Building a PDF Translation App with ChatGPT
Skills:
LLM Foundations90%
In this video I'll show you how to automatically translate entire PDF documents using Python and the ChatGPT API with a step-by-step tutorial. This video covers everything from loading the PDF to exporting the fully translated version. Understand token limitations, cost estimation, and see the final result in real-time. Hope you enjoy it! :)
📚 Chapters:
00:00 - Introduction to translating PDFs using API.
00:12 - Step-by-step guide: Loading, Extracting, and Translating PDF text.
00:22 - Using DocumentLoader and PDFLoader class in Python.
00:41 - Extracting text from PDF and inspecting document objects.
01:17 - Discussing token size limitations for translation API.
02:01 - Calculating token size for PDF content and estimating translation costs.
04:15 - Writing a function for getting translations via API.
05:39 - Preparing text content for translation automation.
06:03 - Creating a PDF with translated content using fpdf and unidecode.
08:14 - Installing required packages and setting up the workspace.
09:02 - Loading the PDF and starting the translation process.
10:21 - Troubleshooting translation issues.
11:01 - Adjusting code to fix errors and resume translation.
12:00 - Final code adjustments and translation execution.
13:05 - Succesfully translating, handling batched content.
14:46 - Resolving final list and integer error.
16:16 - Reviewing the translated document and concluding thoughts.
17:08 - Suggestions for improvement and potential future development.
18:00 - Final remarks and invitation to access the code and try it out.
18:45 - Closing and call to action for likes and subscription.
🔗 Links:
- Source code (UPDATED 2025!): https://github.com/EnkrateiaLucca/translate-pdf
- Subscribe!: https://www.youtube.com/channel/UCu8WF59Scx9f3H1N_FgZUwQ
- Tiktok: https://www.tiktok.com/@enkrateialucca?lang=en
- Twitter: https://twitter.com/LucasEnkrateia
- LinkedIn: https://www.linkedin.com/in/lucas-soares-969044167/
- Music from www.epidemicsound.com
Support the Chan
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Automata Learning Lab · Automata Learning Lab · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
A Quick Tutorial on NLP Basics
Automata Learning Lab
Automating your Digital Morning Routine with Python
Automata Learning Lab
Exploring Problem Solving with Python and Jupyter Notebook #1
Automata Learning Lab
Summarize Papers with Python and GPT-3
Automata Learning Lab
An Experiment Tracking Tutorial with Mlflow and Keras
Automata Learning Lab
Automating Google Forms Submissions with Python
Automata Learning Lab
Productivity Tracking With Python and the Notion API
Automata Learning Lab
When your Machine Learning Model Fails Do This ;p
Automata Learning Lab
Machine Learning Tip#1 Practical Deep Learning Course
Automata Learning Lab
Machine Learning Tips: Deep Learning Monitor
Automata Learning Lab
Machine Learning Tips#5 MLOPs specialization in Coursera #machinelearning
Automata Learning Lab
Automatically Changing Desktop Wallpaper with Python and the Nasa Image API
Automata Learning Lab
Building an Image Classifier to Filter Out Unused Images From Your Photo Album with Machine Learning
Automata Learning Lab
Automating VS Code Snippets with Python
Automata Learning Lab
How to Set Up a Machine Learning Environment with Conda and Pip-Tools
Automata Learning Lab
9 Google Search Tips for Machine Learning
Automata Learning Lab
Thinking Tools
Automata Learning Lab
Automating Car Search with Python and Data Science
Automata Learning Lab
Generating Images from Text with Stable Diffusion and Hugging Face
Automata Learning Lab
A Practical Introduction to Data Science using the Spaceship Titanic Dataset from Kaggle
Automata Learning Lab
Jiu Jitsu App with Python and Streamlit
Automata Learning Lab
2 Apps for Coding In The Ipad Pro
Automata Learning Lab
From Tensorflow to Pytorch?
Automata Learning Lab
Building an Audio Transcription App with OpenAI Whisper and Streamlit
Automata Learning Lab
Productivity Tracking with Python Short Summary
Automata Learning Lab
Automating Expense Reports with Python
Automata Learning Lab
ChatGPT, Angry Pandas and AI Code
Automata Learning Lab
7 Strategies To Learn Anything Using ChatGPT
Automata Learning Lab
Building a Thought Summarization App with Whisper and GPT3
Automata Learning Lab
Visualize a Neural Net Learning Polynomial Functions
Automata Learning Lab
Automating Notion with Python
Automata Learning Lab
Pose Tracking for Jiu Jitsu - Update #jiujitsu #machinelearning
Automata Learning Lab
Update to my Pose Tracking for Jiu Jitsu Project #machinelearning #jiujitsu #ai #deeplearning
Automata Learning Lab
ChatGPT API Released by OpenAI
Automata Learning Lab
ChatGPT API Response Format #machinelearning #ai #datascience
Automata Learning Lab
Beyond Stable Diffusion with Composer | Automata Learning Lab Paper Series #1
Automata Learning Lab
Beyond Diffusion Models with Composer #machinelearning #ai
Automata Learning Lab
Machine Learning for Jiu Jitsu
Automata Learning Lab
Prompt Engineering Basics #machinelearning #gpt4 #chatgpt
Automata Learning Lab
Visual ChatGPT: Integrating Images with ChatGPT Paper Series#2
Automata Learning Lab
Visual ChatGPT #machinelearning #ai #artificialintelligence
Automata Learning Lab
LERF - Language Embeddings + NERF for Querying 3D Spaces #machinelearning #ai
Automata Learning Lab
Summarize Papers with Python and ChatGPT
Automata Learning Lab
Large Language Models can use Tools Now! #artificialintelligence #machinelearning #ai
Automata Learning Lab
Sparks of AGI in GPT4? #machinelearning #ai #agi #artificialintelligence
Automata Learning Lab
Toolformer: LLMs can use Tools! #chatgpt #llms #gpt4 #gpt3 #artificialintelligence
Automata Learning Lab
Talking to Your Notes with LangChain #artificialintelligence #llms #gpt4 #chatgpt
Automata Learning Lab
How to Talk to a PDF using LangChain and ChatGPT
Automata Learning Lab
Query Your Own Notes With LangChain
Automata Learning Lab
HuggingGPT #machinelearning #artificialintelligence #huggingface #gpt4 #chatgpt
Automata Learning Lab
Do as I Can Not as I Say Paper #artificialintelligence #llms #reinforcementlearning
Automata Learning Lab
Automating Anki Flashcards with OpenAI and GPT-4
Automata Learning Lab
Building A PDF Summarization App with Gradio and LangChain
Automata Learning Lab
Auto-GPT #artificialintelligence #gpt4 #llms #autogpt
Automata Learning Lab
DocGPT - Chat with Github #artificialintelligence #gpt4 #chatgpt
Automata Learning Lab
LLMs for Research and Planning #artificialintelligence #gpt4 #llms
Automata Learning Lab
How I Use ChatGPT for Interactive Language Learning
Automata Learning Lab
Building an Audio Transcription App with Gradio and Whisper
Automata Learning Lab
Summarizing and Querying Multiple Papers with LangChain
Automata Learning Lab
Mojo - The New AI Programming Language?
Automata Learning Lab
More on: LLM Foundations
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
I Tried Talking to 6 AI Tools… and It Felt Like Meeting 6 Different Personalities
Medium · AI
I Tried Talking to 6 AI Tools… and It Felt Like Meeting 6 Different Personalities
Medium · Machine Learning
The Day AI Stopped Reading Word-by-Word: A Story of “Attention”
Medium · Deep Learning
Stop Paying Michelin Prices for Kulcha: A Guide to LLM Unit Economics
Medium · LLM
Chapters (20)
Introduction to translating PDFs using API.
0:12
Step-by-step guide: Loading, Extracting, and Translating PDF text.
0:22
Using DocumentLoader and PDFLoader class in Python.
0:41
Extracting text from PDF and inspecting document objects.
1:17
Discussing token size limitations for translation API.
2:01
Calculating token size for PDF content and estimating translation costs.
4:15
Writing a function for getting translations via API.
5:39
Preparing text content for translation automation.
6:03
Creating a PDF with translated content using fpdf and unidecode.
8:14
Installing required packages and setting up the workspace.
9:02
Loading the PDF and starting the translation process.
10:21
Troubleshooting translation issues.
11:01
Adjusting code to fix errors and resume translation.
12:00
Final code adjustments and translation execution.
13:05
Succesfully translating, handling batched content.
14:46
Resolving final list and integer error.
16:16
Reviewing the translated document and concluding thoughts.
17:08
Suggestions for improvement and potential future development.
18:00
Final remarks and invitation to access the code and try it out.
18:45
Closing and call to action for likes and subscription.
🎓
Tutor Explanation
DeepCamp AI