Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,905
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,445 reads from curated sources

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
SetFit: Efficient Few-Shot Learning Without Prompts
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Some Math behind Neural Tangent Kernel
Neural networks are well known to be over-parameterized and can often easily fit data with near-zero training loss with decent generalization performance on tes
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Train your first Decision Transformer
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
How to train a Language Model with Megatron-LM
Run Stable Diffusion on your M1 Mac’s GPU
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Run Stable Diffusion on your M1 Mac’s GPU
How to run Stable Diffusion locally so you can hack on it
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Research Insights – Learning to Retrieve Passages without Supervision
Self-Supervised Retrieval can surpass BM25 and Supervised techniques. This technique also pairs very well alongside BM25 in Hybrid Retrieval. Learn more about i
Run Stable Diffusion with an API
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Run Stable Diffusion with an API
How to use Replicate to integrate Stable Diffusion into hacks, apps, and projects
Build a robot artist for your Discord server with Stable Diffusion, Replicate, and Fly.io
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Build a robot artist for your Discord server with Stable Diffusion, Replicate, and Fly.io
A tutorial for building a chat bot that replies to prompts with the output of a text-to-image model.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Our approach to alignment research
We are improving our AI systems’ ability to learn from human feedback and to assist humans at evaluating AI. Our goal is to build a sufficiently aligned AI syst
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Visualize proteins on Hugging Face Spaces
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Stable Diffusion with 🧨 Diffusers
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Pre-Train BERT with Hugging Face Transformers and Habana Gaudi
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Hugging Face's TensorFlow Philosophy
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Join us at Uncanny Spaces
We're bringing people together to explore what's being created with machine learning.
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Using Cross-Encoders as reranker in multistage vector search
Learn about bi-encoder and cross-encoder machine learning models, and why combining them could improve the vector search experience.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Comments on U.S. National AI Research Resource Interim Report
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Efficient training of language models to fill in the middle
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Faster Text Generation with TensorFlow and XLA
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Reducing bias and improving safety in DALL·E 2
Today, we are implementing a new technique so that DALL·E generates images of people that more accurately reflect the diversity of the world’s population.
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Exploring text to image models
The basics of using the API to create your own images from text.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
How to train your model dynamically using adversarial data
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
DALL·E 2: Extending creativity
As part of our DALL·E 2 research preview, more than 3,000 artists from more than 118 countries have incorporated DALL·E into their creative workflows. The artis
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
A new template for model READMEs
Inspired by model cards, we've created templates for documenting models on Replicate.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
DALL·E 2 pre-training mitigations
In order to share the magic of DALL·E 2 with a broad audience, we needed to reduce the risks associated with powerful image generation models. To this end, we p
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Learning to play Minecraft with Video PreTraining
We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
The AI-First Database Ecosystem
Learn about the vision of the AI-First Database Ecosystem, which drives the R&D of the databases of the future.
Microsoft AI Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Microsoft’s framework for building AI systems responsibly
The post Microsoft’s framework for building AI systems responsibly appeared first on The AI Blog .
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Evolution through large models
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
AI-written critiques help humans notice flaws
We trained “critique-writing” models to describe flaws in summaries. Human evaluators find flaws in summaries much more often when shown our model’s critiques.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Techniques for training large neural networks
Large neural networks are at the core of many recent advances in AI, but training them is a difficult engineering and research challenge which requires orchestr
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
The Annotated Diffusion Model
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Best practices for deploying language models
Cohere, OpenAI, and AI21 Labs have developed a preliminary set of best practices applicable to any organization developing or deploying large language models.
Microsoft AI Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
The opportunity at home – can AI drive innovation in personal assistant devices and sign language?
The post The opportunity at home – can AI drive innovation in personal assistant devices and sign language? appeared first on The AI Blog .
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Constraining CLIPDraw
An introduction to differentiable programming and the process of refining generative art models.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Powering next generation applications with OpenAI Codex
Codex is now powering 70 different applications across a variety of use cases through the OpenAI API.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Efficient Table Pre-training without Real Data: An Introduction to TAPEX
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
An Introduction to Q-Learning Part 2/2
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
How Sempre Health is leveraging the Expert Acceleration Program to accelerate their ML roadmap
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Putting ethical principles at the core of the research lifecycle
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Student Ambassador Program’s call for applications is open!
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Accelerated Inference with Optimum and Transformers Pipelines
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
We Raised $100 Million for Open & Collaborative Machine Learning 🚀
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Director of Machine Learning Insights
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Getting Started with Transformers on Habana Gaudi
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Introducing Hugging Face for Education 🤗
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
CO2 Emissions and the 🤗 Hub: Leading the Charge
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Measuring Goodhart’s law
Goodhart’s law famously says: “When a measure becomes a target, it ceases to be a good measure.” Although originally from economics, it’s something we have to g
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Introducing Decision Transformers on Hugging Face 🤗