Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,898

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,459 Reads 5,439

Showing 5,439 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

Train and run Stanford Alpaca on your own machine

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Train and run Stanford Alpaca on your own machine

We'll show you how to train Alpaca, a fine-tuned version of LLaMA that can respond to instructions like ChatGPT.

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Prompt Engineering

Prompt Engineering , also known as In-Context Prompting , refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without u

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Transforming visual accessibility

Be My Eyes uses GPT-4 to transform visual accessibility.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Streamlining financial solutions for safety and growth

Stripe leverages GPT-4 to streamline user experience and combat fraud.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

We’ve created GPT-4, the latest milestone in OpenAI’s effort in scaling up deep learning. GPT-4 is a large multimodal model (accepting image and text inputs, em

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Powering virtual education for the classroom

Khan Academy explores the potential for GPT-4 in a limited pilot program.

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

HNSW+PQ - Exploring ANN algorithms Part 2.1

Implementing HNSW + Product Quantization (PQ) vector compression in Weaviate.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Planning for AGI and beyond

Our mission is to ensure that artificial general intelligence—AI systems that are generally smarter than humans—benefits all of humanity.

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Combining LangChain and Weaviate

LangChain is one of the most exciting new tools in AI. It helps overcome many limitations of LLMs, such as hallucination and limited input lengths.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

How should AI systems behave, and who should decide?

We’re clarifying how ChatGPT’s behavior is shaped and our plans for improving that behavior, allowing more user customization, and getting more public input int

Introducing LoRA: A faster way to fine-tune Stable Diffusion

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Introducing LoRA: A faster way to fine-tune Stable Diffusion

It's like DreamBooth, but much faster. And you can run it in the cloud on Replicate.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Introducing ChatGPT Plus

We’re launching a pilot subscription plan for ChatGPT, a conversational AI that can chat with you, answer follow-up questions, and challenge incorrect assumptio

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

New AI classifier for indicating AI-written text

We’re launching a classifier trained to distinguish between AI-written and human-written text.

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

The Transformer Family Version 2.0

Many new Transformer architecture improvements have been proposed since my last post on “The Transformer Family” about three years ago. Here I did a

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

OpenAI and Microsoft extend partnership

We’re happy to announce that OpenAI and Microsoft are extending our partnership.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

3D Asset Generation: AI for Game Development #3

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Welcome PaddlePaddle to the Hugging Face Hub

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Forecasting potential misuses of language models for disinformation campaigns and how to reduce risk

OpenAI researchers collaborated with Georgetown University’s Center for Security and Emerging Technology and the Stanford Internet Observatory to investigate ho

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Large Transformer Model Inference Optimization

[Updated on 2023-01-24: add a small section on Distillation .] Large transformer models are mainstream nowadays, creating SoTA results for a variety of tasks. T

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Delivering nuanced insights from customer feedback

Using GPT-3 to deliver fast, nuanced insights from customer feedback.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Fine-tuning GPT-3 to scale video creation

Fine-tuning GPT-3 to power and scale done-for-you video creation.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Creating next-gen characters

Using GPT-3 to create the next generation of AI-powered characters.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

The power of continuous learning

Lilian Weng works on Applied AI Research at OpenAI.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

New and improved embedding model

We are excited to announce a new embedding model which is significantly more capable, cost effective, and simpler to use.

Microsoft AI Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

A conversation with Kevin Scott: What’s next in AI

The post A conversation with Kevin Scott: What’s next in AI appeared first on The AI Blog .

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

The Sphere Dataset in Weaviate

Learn how to import and query the Sphere dataset in Weaviate!

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Deep Learning with Proteins

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Using Stable Diffusion with Core ML on Apple Silicon

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Introducing ChatGPT

We’ve trained a model called ChatGPT which interacts in a conversational way. The dialogue format makes it possible for ChatGPT to answer followup questions, ad

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

An overview of inference solutions on Hugging Face

Train and deploy a DreamBooth model on Replicate

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Train and deploy a DreamBooth model on Replicate

With just a handful of images and a single API call, you can train a model, publish it to Replicate, and run predictions on it in the cloud.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Hugging Face Machine Learning Demos on arXiv

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Sentiment Analysis on Encrypted Data with Homomorphic Encryption

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

DALL·E API now available in public beta

Starting today, developers can begin building apps with the DALL·E API.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Accelerate your models with 🤗 Optimum Intel and OpenVINO

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Weaviate 1.16 release

Weaviate 1.16 introduces New Filter Operators, Distributed Backups, Centroid Module, Node Status API, Azure-based OIDC, and more. Lear all about it.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

🧨 Stable Diffusion in JAX / Flax !

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Optimization story: Bloom inference

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

DALL·E now available without waitlist

New users can start creating straight away. Lessons learned from deployment and improvements to our safety systems make wider availability possible.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

How 🤗 Accelerate runs very large models thanks to PyTorch

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Support for Hugging Face Inference API in Weaviate

Running ML Model Inference in production is hard. You can use Weaviate – a vector database – with Hugging Face Inference module to delegate the heavy lifting.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

SetFit: Efficient Few-Shot Learning Without Prompts

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Some Math behind Neural Tangent Kernel

Neural networks are well known to be over-parameterized and can often easily fit data with near-zero training loss with decent generalization performance on tes

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Train your first Decision Transformer

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

How to train a Language Model with Megatron-LM

Run Stable Diffusion on your M1 Mac’s GPU

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Run Stable Diffusion on your M1 Mac’s GPU

How to run Stable Diffusion locally so you can hack on it

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Research Insights – Learning to Retrieve Passages without Supervision

Self-Supervised Retrieval can surpass BM25 and Supervised techniques. This technique also pairs very well alongside BM25 in Hybrid Retrieval. Learn more about i