Advanced Retrieval for AI with Chroma

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Advanced Retrieval for AI with Chroma

Coursera · Advanced ·🔍 RAG & Vector Search ·1mo ago
Skills: RAG Basics90%
Information Retrieval (IR) and Retrieval Augmented Generation (RAG) are only effective if the information retrieved from a database as a result of a query is relevant to the query and its application. Too often, queries return semantically similar results but don’t answer the question posed. They may also return irrelevant material which can distract the LLM from the correct results. This course teaches advanced retrieval techniques to improve the relevancy of retrieved results. The techniques covered include: 1. Query Expansion: Expanding user queries improves information retrieval by including related concepts and keywords. Utilizing an LLM makes this traditional technique even more effective. Another form of expansion has the LLM suggest a possible answer to the query which is then included in the query. 2. Cross-encoder reranking: Reranking retrieval results to select the results most relevant to your query improves your results. 3. Training and utilizing Embedding Adapters: Adding an adapter layer to reshape embeddings can improve retrieval by emphasizing elements relevant to your application.
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

When Should You Use Text2Cypher in a GraphRAG Pipeline
Learn when to use Text2Cypher in a GraphRAG pipeline to retrieve precise graph results from natural language questions
Dev.to AI
How to build a production RAG pipeline in Python (without a vector database)
Learn to build a production-ready RAG pipeline in Python without relying on a vector database, and understand the key considerations for a scalable and efficient implementation
Dev.to · Ayi NEDJIMI
Architecting Sub-150ms Hybrid RAG for Voice Agents: Combining pgvector, BM25, and Async FastAPI…
Learn how to architect a sub-150ms hybrid RAG for voice agents using pgvector, BM25, and Async FastAPI to serve large industrial catalogs
Medium · Python
Security Controls in Enterprise RAG: Keys, Audit Logs, and the Hierarchy That Prevents Role Elevation
Implement security controls in Enterprise RAG to prevent role elevation and ensure data integrity
Dev.to · Manjunath
Up next
Watch this before applying for jobs as a developer.
Tech With Tim
Watch →