Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,905
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,445 reads from curated sources

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Machine Learning Experts - Margaret Mitchell
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Announcing the 🤗 AI Research Residency Program
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
New GPT-3 capabilities: Edit & insert
We’ve released new versions of GPT-3 and Codex which can edit or insert content into existing text, rather than just completing existing text.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
A research agenda for assessing the economic impacts of code generation models
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Economic impacts research at OpenAI
Call for expressions of interest to study the economic impacts of large language models.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Lessons learned on language model safety and misuse
We describe our latest thinking in the hope of helping other AI developers address safety and misuse of deployed models.
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Learning with not Enough Data Part 2: Active Learning
This is part 2 of what to do when facing a limited amount of labeled data for supervised learning tasks. This time we will get some amount of human labeling wor
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Solving (some) formal math olympiad problems
We built a neural theorem prover for Lean that learned to solve a variety of challenging high-school olympiad problems, including problems from the AMC12 and AI
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Aligning language models to follow instructions
We’ve trained language models that are much better at following user intentions than GPT-3 while also making them more truthful and less toxic, using techniques
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Introducing text and code embeddings
We are introducing embeddings, a new endpoint in the OpenAI API that makes it easy to perform natural language and code tasks like semantic search, clustering,
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Supercharged Searching on the 🤗 Hub
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Gradio is joining Hugging Face!
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
WebGPT: Improving the factual accuracy of language models through web browsing
We’ve fine-tuned GPT-3 to more accurately answer open-ended questions using a text-based web browser.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Perceiver IO: a scalable, fully-attentional model that works on any modality
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Customizing GPT-3 for your application
Fine-tune with a single command.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
OpenAI Residency
As part of our effort to support and develop AI talent, we’re excited to announce the OpenAI Residency.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Accelerating PyTorch distributed fine-tuning with Intel technologies
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
OpenAI’s API now available with no waitlist
Wider availability made possible by safety progress.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Solving math word problems
We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many problems a
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
The Age of Machine Learning As Code Has Arrived
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Fine tuning CLIP with Remote Sensing (Satellite) images and captions
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Hosting your Models and Datasets on Hugging Face Spaces using Streamlit
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Showcase Your Projects in Spaces using Gradio
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
How to Train Really Large Models on Many GPUs?
[Updated on 2022-03-13: add expert choice routing .] [U
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Summarizing books with human feedback
Scaling human oversight of AI systems for tasks that are difficult to evaluate.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Introducing Optimum: The Optimization Toolkit for Transformers at Scale
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Helen Toner joins OpenAI’s board of directors
Today, we’re excited to announce the appointment of Helen Toner to our board of directors.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
OpenAI Codex
We’ve created an improved version of OpenAI Codex, our AI system that translates natural language to code, and we are releasing it through our API in private be
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Introducing Triton: Open-source GPU programming for neural networks
We’re releasing Triton 1.0, an open-source Python-like programming language which enables researchers with no CUDA experience to write highly efficient GPU code
Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4y ago
Distill Hiatus
After five years, Distill will be taking a break.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Improving language model behavior by training on a curated dataset
Our latest research finds we can improve language model behavior with respect to specific behavioral values by fine-tuning on a small, curated dataset.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Few-shot learning in practice: GPT-Neo and the 🤗 Accelerated Inference API
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Contrastive Representation Learning
The goal of contrastive representation learning is to learn such an embedding space in which similar sample pairs stay close to each other while dissimilar ones
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
OpenAI Scholars 2021: Final projects
We’re proud to announce that the 2021 class of OpenAI Scholars has completed our six-month mentorship program and have produced an open-source research project
Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4y ago
Adversarial Reprogramming of Neural Cellular Automata
Reprogramming Neural CA to exhibit novel behaviour, using adversarial attacks.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Will Hurd joins OpenAI’s board of directors
OpenAI is committed to developing general-purpose artificial intelligence that benefits all humanity, and we believe that achieving our goal requires expertise
Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago
Branch Specialization
When a neural network layer is divided into multiple branches, neurons self-organize into coherent groupings.
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Weaviate 1.2 release - transformer models
Weaviate v1.2 introduced support for transformers (DistilBERT, BERT, RoBERTa, Sentence-BERT, etc) to vectorize and semantically search through your data.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
GPT-3 powers the next generation of apps
Over 300 applications are delivering GPT-3–powered search, conversation, text completion, and other advanced AI features through our API.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
The Partnership: Amazon SageMaker and Hugging Face
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Reducing Toxicity in Language Models
Large pretrained language models are trained over a sizable collection of online data. They unavoidably acquire certain toxic behavior
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
Multimodal neurons in artificial neural networks
We’ve discovered neurons in CLIP that respond to the same concept whether presented literally, symbolically, or conceptually. This may explain CLIP’s accuracy i
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Simple considerations for simple people building fancy neural networks
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Retrieval Augmented Generation with Huggingface Transformers and Ray
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Hugging Face on PyTorch / XLA TPUs
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Faster TensorFlow models in Hugging Face Transformers