Building a Simple OCR App with Python | Automating Text Search
Skills:
CV Basics90%
Let's write a simple Python script to automate search over text in your local images with OCR using Python.
Let's get started! :)
๐ Chapters:
00:00 - Introduction and setting up the OCR search project
00:18 - Initial project setup requirements and Python package installations
00:32 - Importing necessary libraries
00:45 - Setting up the functions to scan images for text and loop over images in a folder
01:00 - Setting up EasyOCR reader for text detection
01:16 - Coding the OCR scan function
01:57 - Setting up the function to process recognized text
02:20 - Introduction of a template for the search images function
02:31 - Initial testing of the OCR scan function
03:00 - Execution of the OCR scan on an image for demonstration
03:16 - Starting the development of the search images function
04:12 - Explanation of how the search images function works
06:10 - Testing the search images function
07:28 - Transitioning the code to an application callable from terminal
08:20 - Developing a CLI tool to enable OCR search from terminal
10:26 - Parsing the arguments for the CLI tool
11:34 - Coding the logic to handle directory search and keyword recognition
12:30 - Handling single image OCR search in the CLI tool
14:31 - Running and testing the application in the terminal
15:02 - Conclusion and review of the project.
๐ Links:
- Medium Article with source code: https://medium.com/@lucas-soares/how-to-build-a-simple-ocr-search-app-for-your-local-files-1a6357c0a1d0
- Subscribe!: https://www.youtube.com/channel/UCu8WF59Scx9f3H1N_FgZUwQ
- Join Medium: https://lucas-soares.medium.com/membership
- Tiktok: https://www.tiktok.com/@enkrateialucca?lang=en
- Twitter: https://twitter.com/LucasEnkrateia
- LinkedIn: https://www.linkedin.com/in/lucas-soares-969044167/
#automation #python #productivity #datascience #artificialintelligence
Watch on YouTube โ
(saves to browser)
Sign in to unlock AI tutor explanation ยท โก30
Playlist
Uploads from Automata Learning Lab ยท Automata Learning Lab ยท 0 of 60
โ Previous
Next โ
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A Quick Tutorial on NLP Basics
Automata Learning Lab
Automating your Digital Morning Routine with Python
Automata Learning Lab
Exploring Problem Solving with Python and Jupyter Notebook #1
Automata Learning Lab
Summarize Papers with Python and GPT-3
Automata Learning Lab
An Experiment Tracking Tutorial with Mlflow and Keras
Automata Learning Lab
Automating Google Forms Submissions with Python
Automata Learning Lab
Productivity Tracking With Python and the Notion API
Automata Learning Lab
When your Machine Learning Model Fails Do This ;p
Automata Learning Lab
Machine Learning Tip#1 Practical Deep Learning Course
Automata Learning Lab
Machine Learning Tips: Deep Learning Monitor
Automata Learning Lab
Machine Learning Tips#5 MLOPs specialization in Coursera #machinelearning
Automata Learning Lab
Automatically Changing Desktop Wallpaper with Python and the Nasa Image API
Automata Learning Lab
Building an Image Classifier to Filter Out Unused Images From Your Photo Album with Machine Learning
Automata Learning Lab
Automating VS Code Snippets with Python
Automata Learning Lab
How to Set Up a Machine Learning Environment with Conda and Pip-Tools
Automata Learning Lab
9 Google Search Tips for Machine Learning
Automata Learning Lab
Thinking Tools
Automata Learning Lab
Automating Car Search with Python and Data Science
Automata Learning Lab
Generating Images from Text with Stable Diffusion and Hugging Face
Automata Learning Lab
A Practical Introduction to Data Science using the Spaceship Titanic Dataset from Kaggle
Automata Learning Lab
Jiu Jitsu App with Python and Streamlit
Automata Learning Lab
2 Apps for Coding In The Ipad Pro
Automata Learning Lab
From Tensorflow to Pytorch?
Automata Learning Lab
Building an Audio Transcription App with OpenAI Whisper and Streamlit
Automata Learning Lab
Productivity Tracking with Python Short Summary
Automata Learning Lab
Automating Expense Reports with Python
Automata Learning Lab
ChatGPT, Angry Pandas and AI Code
Automata Learning Lab
7 Strategies To Learn Anything Using ChatGPT
Automata Learning Lab
Building a Thought Summarization App with Whisper and GPT3
Automata Learning Lab
Visualize a Neural Net Learning Polynomial Functions
Automata Learning Lab
Automating Notion with Python
Automata Learning Lab
Pose Tracking for Jiu Jitsu - Update #jiujitsu #machinelearning
Automata Learning Lab
Update to my Pose Tracking for Jiu Jitsu Project #machinelearning #jiujitsu #ai #deeplearning
Automata Learning Lab
ChatGPT API Released by OpenAI
Automata Learning Lab
ChatGPT API Response Format #machinelearning #ai #datascience
Automata Learning Lab
Beyond Stable Diffusion with Composer | Automata Learning Lab Paper Series #1
Automata Learning Lab
Beyond Diffusion Models with Composer #machinelearning #ai
Automata Learning Lab
Machine Learning for Jiu Jitsu
Automata Learning Lab
Prompt Engineering Basics #machinelearning #gpt4 #chatgpt
Automata Learning Lab
Visual ChatGPT: Integrating Images with ChatGPT Paper Series#2
Automata Learning Lab
Visual ChatGPT #machinelearning #ai #artificialintelligence
Automata Learning Lab
LERF - Language Embeddings + NERF for Querying 3D Spaces #machinelearning #ai
Automata Learning Lab
Summarize Papers with Python and ChatGPT
Automata Learning Lab
Large Language Models can use Tools Now! #artificialintelligence #machinelearning #ai
Automata Learning Lab
Sparks of AGI in GPT4? #machinelearning #ai #agi #artificialintelligence
Automata Learning Lab
Toolformer: LLMs can use Tools! #chatgpt #llms #gpt4 #gpt3 #artificialintelligence
Automata Learning Lab
Talking to Your Notes with LangChain #artificialintelligence #llms #gpt4 #chatgpt
Automata Learning Lab
How to Talk to a PDF using LangChain and ChatGPT
Automata Learning Lab
Query Your Own Notes With LangChain
Automata Learning Lab
HuggingGPT #machinelearning #artificialintelligence #huggingface #gpt4 #chatgpt
Automata Learning Lab
Do as I Can Not as I Say Paper #artificialintelligence #llms #reinforcementlearning
Automata Learning Lab
Automating Anki Flashcards with OpenAI and GPT-4
Automata Learning Lab
Building A PDF Summarization App with Gradio and LangChain
Automata Learning Lab
Auto-GPT #artificialintelligence #gpt4 #llms #autogpt
Automata Learning Lab
DocGPT - Chat with Github #artificialintelligence #gpt4 #chatgpt
Automata Learning Lab
LLMs for Research and Planning #artificialintelligence #gpt4 #llms
Automata Learning Lab
How I Use ChatGPT for Interactive Language Learning
Automata Learning Lab
Building an Audio Transcription App with Gradio and Whisper
Automata Learning Lab
Summarizing and Querying Multiple Papers with LangChain
Automata Learning Lab
Mojo - The New AI Programming Language?
Automata Learning Lab
More on: CV Basics
View skill โRelated AI Lessons
โก
โก
โก
โก
I Built a CLI That Extracts Content From URLs and Turns It Into JSON for AI
Dev.to ยท Omar Fuentes
How to Evaluate AI Tools Without Being a Data Scientist
Medium ยท AI
I Automated My Morning Tech News with Claude โ Here's What Worked (and What Didn't)
Dev.to AI
Best AI Tool for Trading in India 2026
Dev.to AI
Chapters (20)
Introduction and setting up the OCR search project
0:18
Initial project setup requirements and Python package installations
0:32
Importing necessary libraries
0:45
Setting up the functions to scan images for text and loop over images in a fol
1:00
Setting up EasyOCR reader for text detection
1:16
Coding the OCR scan function
1:57
Setting up the function to process recognized text
2:20
Introduction of a template for the search images function
2:31
Initial testing of the OCR scan function
3:00
Execution of the OCR scan on an image for demonstration
3:16
Starting the development of the search images function
4:12
Explanation of how the search images function works
6:10
Testing the search images function
7:28
Transitioning the code to an application callable from terminal
8:20
Developing a CLI tool to enable OCR search from terminal
10:26
Parsing the arguments for the CLI tool
11:34
Coding the logic to handle directory search and keyword recognition
12:30
Handling single image OCR search in the CLI tool
14:31
Running and testing the application in the terminal
15:02
Conclusion and review of the project.
๐
Tutor Explanation
DeepCamp AI