Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,905

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,460 Reads 5,445

Showing 5,445 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

SetFit: Efficient Few-Shot Learning Without Prompts

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Some Math behind Neural Tangent Kernel

Neural networks are well known to be over-parameterized and can often easily fit data with near-zero training loss with decent generalization performance on tes

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Train your first Decision Transformer

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

How to train a Language Model with Megatron-LM

Run Stable Diffusion on your M1 Mac’s GPU

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Run Stable Diffusion on your M1 Mac’s GPU

How to run Stable Diffusion locally so you can hack on it

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Research Insights – Learning to Retrieve Passages without Supervision

Self-Supervised Retrieval can surpass BM25 and Supervised techniques. This technique also pairs very well alongside BM25 in Hybrid Retrieval. Learn more about i

Run Stable Diffusion with an API

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Run Stable Diffusion with an API

How to use Replicate to integrate Stable Diffusion into hacks, apps, and projects

Build a robot artist for your Discord server with Stable Diffusion, Replicate, and Fly.io

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Build a robot artist for your Discord server with Stable Diffusion, Replicate, and Fly.io

A tutorial for building a chat bot that replies to prompts with the output of a text-to-image model.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Our approach to alignment research

We are improving our AI systems’ ability to learn from human feedback and to assist humans at evaluating AI. Our goal is to build a sufficiently aligned AI syst

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Visualize proteins on Hugging Face Spaces

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Stable Diffusion with 🧨 Diffusers

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Pre-Train BERT with Hugging Face Transformers and Habana Gaudi

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Hugging Face's TensorFlow Philosophy

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Join us at Uncanny Spaces

We're bringing people together to explore what's being created with machine learning.

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Using Cross-Encoders as reranker in multistage vector search

Learn about bi-encoder and cross-encoder machine learning models, and why combining them could improve the vector search experience.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Comments on U.S. National AI Research Resource Interim Report

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Efficient training of language models to fill in the middle

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Faster Text Generation with TensorFlow and XLA

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Reducing bias and improving safety in DALL·E 2

Today, we are implementing a new technique so that DALL·E generates images of people that more accurately reflect the diversity of the world’s population.

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Exploring text to image models

The basics of using the API to create your own images from text.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

How to train your model dynamically using adversarial data

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

DALL·E 2: Extending creativity

As part of our DALL·E 2 research preview, more than 3,000 artists from more than 118 countries have incorporated DALL·E into their creative workflows. The artis

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

A new template for model READMEs

Inspired by model cards, we've created templates for documenting models on Replicate.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

DALL·E 2 pre-training mitigations

In order to share the magic of DALL·E 2 with a broad audience, we needed to reduce the risks associated with powerful image generation models. To this end, we p

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Learning to play Minecraft with Video PreTraining

We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

The AI-First Database Ecosystem

Learn about the vision of the AI-First Database Ecosystem, which drives the R&D of the databases of the future.

Microsoft AI Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Microsoft’s framework for building AI systems responsibly

The post Microsoft’s framework for building AI systems responsibly appeared first on The AI Blog .

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Evolution through large models

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

AI-written critiques help humans notice flaws

We trained “critique-writing” models to describe flaws in summaries. Human evaluators find flaws in summaries much more often when shown our model’s critiques.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Techniques for training large neural networks

Large neural networks are at the core of many recent advances in AI, but training them is a difficult engineering and research challenge which requires orchestr

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

The Annotated Diffusion Model

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Best practices for deploying language models

Cohere, OpenAI, and AI21 Labs have developed a preliminary set of best practices applicable to any organization developing or deploying large language models.

Microsoft AI Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

The opportunity at home – can AI drive innovation in personal assistant devices and sign language?

The post The opportunity at home – can AI drive innovation in personal assistant devices and sign language? appeared first on The AI Blog .

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Constraining CLIPDraw

An introduction to differentiable programming and the process of refining generative art models.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Powering next generation applications with OpenAI Codex

Codex is now powering 70 different applications across a variety of use cases through the OpenAI API.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Efficient Table Pre-training without Real Data: An Introduction to TAPEX

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

An Introduction to Q-Learning Part 2/2

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

How Sempre Health is leveraging the Expert Acceleration Program to accelerate their ML roadmap

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Putting ethical principles at the core of the research lifecycle

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Student Ambassador Program’s call for applications is open!

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Accelerated Inference with Optimum and Transformers Pipelines

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

We Raised $100 Million for Open & Collaborative Machine Learning 🚀

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Director of Machine Learning Insights

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Getting Started with Transformers on Habana Gaudi

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Introducing Hugging Face for Education 🤗

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

CO2 Emissions and the 🤗 Hub: Leading the Charge

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Measuring Goodhart’s law

Goodhart’s law famously says: “When a measure becomes a target, it ceases to be a good measure.” Although originally from economics, it’s something we have to g

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Introducing Decision Transformers on Hugging Face 🤗