Embedding Models: From Architecture to Implementation

External: Coursera Courses ↗ · Coursera

Open Course on External: Coursera

Free to audit · Opens on External: Coursera

Embedding Models: From Architecture to Implementation

Coursera · Intermediate ·🔍 RAG & Vector Search ·3mo ago

Skills: RAG Basics90%

Key Takeaways

Explores the architecture and implementation of embedding models for AI applications

Original Description

Join our new short course, Embedding Models: From Architecture to Implementation! Learn from Ofer Mendelevitch, Head of Developer Relations at Vectara. This course goes into the details of the architecture and capabilities of embedding models, which are used in many AI applications to capture the meaning of words and sentences. You will learn about the evolution of embedding models, from word to sentence embeddings, and build and train a simple dual encoder model. This hands-on approach will help you understand the technical concepts behind embedding models and how to use them effectively. In detail, you’ll: 1. Learn about word embedding, sentence embedding, and cross-encoder models; and how they can be used in RAG. 2. Understand how transformer models, specifically BERT (Bi-directional Encoder Representations from Transformers), are trained and used in semantic search systems. 3. Gain knowledge of the evolution of sentence embedding and understand how the dual encoder architecture was formed. 4. Use a contrastive loss to train a dual encoder model, with one encoder trained for questions and another for the responses. 5. Utilize separate encoders for question and answer in a RAG pipeline and see how it affects the retrieval compared to using a single encoder model. By the end of this course, you will understand word, sentence, and cross-encoder embedding models, and how transformer-based models like BERT are trained and used in semantic search. You will also learn how to train dual encoder models with contrastive loss and evaluate their impact on retrieval in a RAG pipeline.

Watch on External: Coursera ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: RAG Basics

View skill →

High Performance (Realtime) RAG Chains: From Basic to Advanced

High Performance (Realtime) RAG Chains: From Basic to Advanced

Coding the Ultimate RAG Engine from Zero

Coding the Ultimate RAG Engine from Zero

Building Agentic RAG From Scratch in Pure Python

Building Agentic RAG From Scratch in Pure Python

Build an LLM and RAG-based Chat Application using AlloyDB and LangChain

I Built a RAG App to Decode Airline Bureaucracy (So You Don't Have To)

I Built a RAG App to Decode Airline Bureaucracy (So You Don't Have To)

Akamai Developers

RAG Demo for Beginners: Full Hands-On Tutorial in Tamil | Build Your Own RAG AI | Karthik's Show

RAG Demo for Beginners: Full Hands-On Tutorial in Tamil | Build Your Own RAG AI | Karthik's Show

Related Reads

Most RAG Hallucinations Are Retrieval Failures: How the Retrieval Brick Decides What the Model Can Invent

Learn how RAG hallucinations are often caused by retrieval failures and how fixing retrieval can reduce model inventions

Towards Data Science

Beyond Search: Building Knowledge Nexus — The Future of AI-Powered Enterprise Intelligence

Learn how to build an enterprise-grade RAG platform that turns static PDFs into an interactive Knowledge Graph, enabling AI-powered enterprise intelligence

Medium · Machine Learning

From Documents to Intelligent Answers: Building a RAG Agent from Scratch & Lessons Learned

Learn to build a RAG agent from scratch and discover key lessons for creating intelligent answer systems

Dev.to · Sri Deevi

Your RAG Eval Isn't Flaky. Your Retrieval Is Non-Deterministic.

Learn why your RAG evaluation may be returning different results despite using the same query, documents, and model, and how to address non-deterministic retrieval

Dev.to · Vasyl

4. Indexing PDF using Vector + Semantic Search in Azure AI Search with Document Intelligence | Chunk

Dewiride Technologies