Chat with your PDFs with Open Source OCR (Optical Character Recognition) & Mistral Lite

Name: Chat with your PDFs with Open Source OCR (Optical Character Recognition) & Mistral Lite
Uploaded: 2024-01-27T00:11:34+00:00
Channel: Brev
Description: Hi there! Harper Carroll from Brev.dev here. In this tutorial, we go through a pre-made Jupyter Notebook to run OCR (Optical Character Recognition) on o...

Brev · Beginner ·🧠 Large Language Models ·2y ago

Skills: LLM Foundations90%Prompt Craft80%LLM Engineering80%Fine-tuning LLMs70%

Hi there! Harper Carroll from Brev.dev here. In this tutorial, we go through a pre-made Jupyter Notebook to run OCR (Optical Character Recognition) on our uploaded PDFs to extract the text, and then we use Amazon's MistralLite to ask questions about those PDFs. Amazon's MistralLite is its fine-tuned version of Mistral 7B, which allows for context lengths of up to 32K tokens... this means we can fit more data into the model's prompt i.e. memory (with some lossiness as the context length is more utilized). Notebook: https://github.com/brevdev/notebooks/blob/main/ocr-pdf-analysis.ipynb More AI/ML notebooks: https://github.com/brevdev/notebooks/ Join the Discord: https://discord.gg/NVDyv7TUgJ Connect with me on 𝕏: https://x.com/HarperSCarroll Find me on Reels: https://instagram.com/harpercarrollai

Watch on YouTube ↗ (saves to browser)