Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,439 reads from curated sources

Replicate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Train and run Stanford Alpaca on your own machine
We'll show you how to train Alpaca, a fine-tuned version of LLaMA that can respond to instructions like ChatGPT.
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Prompt Engineering
Prompt Engineering , also known as In-Context Prompting , refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without u
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Transforming visual accessibility
Be My Eyes uses GPT-4 to transform visual accessibility.
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Streamlining financial solutions for safety and growth
Stripe leverages GPT-4 to streamline user experience and combat fraud.
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
GPT-4
We’ve created GPT-4, the latest milestone in OpenAI’s effort in scaling up deep learning. GPT-4 is a large multimodal model (accepting image and text inputs, em
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Powering virtual education for the classroom
Khan Academy explores the potential for GPT-4 in a limited pilot program.
Weaviate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
HNSW+PQ - Exploring ANN algorithms Part 2.1
Implementing HNSW + Product Quantization (PQ) vector compression in Weaviate.
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Planning for AGI and beyond
Our mission is to ensure that artificial general intelligence—AI systems that are generally smarter than humans—benefits all of humanity.
Weaviate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Combining LangChain and Weaviate
LangChain is one of the most exciting new tools in AI. It helps overcome many limitations of LLMs, such as hallucination and limited input lengths.
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
How should AI systems behave, and who should decide?
We’re clarifying how ChatGPT’s behavior is shaped and our plans for improving that behavior, allowing more user customization, and getting more public input int

Replicate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Introducing LoRA: A faster way to fine-tune Stable Diffusion
It's like DreamBooth, but much faster. And you can run it in the cloud on Replicate.
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Introducing ChatGPT Plus
We’re launching a pilot subscription plan for ChatGPT, a conversational AI that can chat with you, answer follow-up questions, and challenge incorrect assumptio
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
New AI classifier for indicating AI-written text
We’re launching a classifier trained to distinguish between AI-written and human-written text.
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
The Transformer Family Version 2.0
Many new Transformer architecture improvements have been proposed since my last post on “The Transformer Family” about three years ago. Here I did a
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
OpenAI and Microsoft extend partnership
We’re happy to announce that OpenAI and Microsoft are extending our partnership.
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
3D Asset Generation: AI for Game Development #3
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Welcome PaddlePaddle to the Hugging Face Hub
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Forecasting potential misuses of language models for disinformation campaigns and how to reduce risk
OpenAI researchers collaborated with Georgetown University’s Center for Security and Emerging Technology and the Stanford Internet Observatory to investigate ho
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Large Transformer Model Inference Optimization
[Updated on 2023-01-24: add a small section on Distillation .] Large transformer models are mainstream nowadays, creating SoTA results for a variety of tasks. T
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Delivering nuanced insights from customer feedback
Using GPT-3 to deliver fast, nuanced insights from customer feedback.
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Fine-tuning GPT-3 to scale video creation
Fine-tuning GPT-3 to power and scale done-for-you video creation.
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Creating next-gen characters
Using GPT-3 to create the next generation of AI-powered characters.
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
The power of continuous learning
Lilian Weng works on Applied AI Research at OpenAI.
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
New and improved embedding model
We are excited to announce a new embedding model which is significantly more capable, cost effective, and simpler to use.
Microsoft AI Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
A conversation with Kevin Scott: What’s next in AI
The post A conversation with Kevin Scott: What’s next in AI appeared first on The AI Blog .
Weaviate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
The Sphere Dataset in Weaviate
Learn how to import and query the Sphere dataset in Weaviate!
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Deep Learning with Proteins
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Using Stable Diffusion with Core ML on Apple Silicon
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
Introducing ChatGPT
We’ve trained a model called ChatGPT which interacts in a conversational way. The dialogue format makes it possible for ChatGPT to answer followup questions, ad
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
An overview of inference solutions on Hugging Face

Replicate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Train and deploy a DreamBooth model on Replicate
With just a handful of images and a single API call, you can train a model, publish it to Replicate, and run predictions on it in the cloud.
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Hugging Face Machine Learning Demos on arXiv
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Sentiment Analysis on Encrypted Data with Homomorphic Encryption
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
DALL·E API now available in public beta
Starting today, developers can begin building apps with the DALL·E API.
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Accelerate your models with 🤗 Optimum Intel and OpenVINO
Weaviate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Weaviate 1.16 release
Weaviate 1.16 introduces New Filter Operators, Distributed Backups, Centroid Module, Node Status API, Azure-based OIDC, and more. Lear all about it.
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
🧨 Stable Diffusion in JAX / Flax !
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Optimization story: Bloom inference
OpenAI News
🧠 Large Language Models
⚡ AI Lesson
3y ago
DALL·E now available without waitlist
New users can start creating straight away. Lessons learned from deployment and improvements to our safety systems make wider availability possible.
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
How 🤗 Accelerate runs very large models thanks to PyTorch
Weaviate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Support for Hugging Face Inference API in Weaviate
Running ML Model Inference in production is hard. You can use Weaviate – a vector database – with Hugging Face Inference module to delegate the heavy lifting.
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
SetFit: Efficient Few-Shot Learning Without Prompts
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Some Math behind Neural Tangent Kernel
Neural networks are well known to be over-parameterized and can often easily fit data with near-zero training loss with decent generalization performance on tes
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Train your first Decision Transformer
Hugging Face Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
How to train a Language Model with Megatron-LM

Replicate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Run Stable Diffusion on your M1 Mac’s GPU
How to run Stable Diffusion locally so you can hack on it
Weaviate Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Research Insights – Learning to Retrieve Passages without Supervision
Self-Supervised Retrieval can surpass BM25 and Supervised techniques. This technique also pairs very well alongside BM25 in Hybrid Retrieval. Learn more about i
DeepCamp AI