Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,908

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,459 Reads 5,449

Showing 5,449 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Simple considerations for simple people building fancy neural networks

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Retrieval Augmented Generation with Huggingface Transformers and Ray

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Hugging Face on PyTorch / XLA TPUs

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Faster TensorFlow models in Hugging Face Transformers

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

Organizational update from OpenAI

It’s been a year of dramatic change and growth at OpenAI.

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago

Understanding RL Vision

With diverse environments, we can analyze, diagnose and edit deep reinforcement learning models using attribution.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Hyperparameter Search with Transformers and Ray Tune

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

OpenAI licenses GPT-3 technology to Microsoft

OpenAI has agreed to license GPT-3 to Microsoft for their own products and services.

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago

Thread: Differentiable Self-organizing Systems

A collection of articles and comments with the goal of understanding how to design robust and general purpose self-organizing systems.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

OpenAI Scholars 2020: Final projects

Our third class of OpenAI Scholars presented their final projects at virtual Demo Day, showcasing their research results from over the past five months.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

Procgen and MineRL Competitions

We’re excited to announce that OpenAI is co-organizing two NeurIPS 2020 competitions with AIcrowd, Carnegie Mellon University, and DeepMind, using Procgen Bench

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

We’re releasing an API for accessing new AI models developed by OpenAI.

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Exploration Strategies in Deep Reinforcement Learning

[Updated on 2020-06-17: Add “exploration via disagreement” in the “Forward Dynamics” section . Exploitation versus ex

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

AI and efficiency

We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classification has be

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

We’re introducing Jukebox, a neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles. We’re releas

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

Improving verifiability in AI development

We’ve contributed to a multi-stakeholder report by 58 co-authors at 30 organizations, including the Centre for the Future of Intelligence, Mila, Schwartz Reisma

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

OpenAI standardizes on PyTorch

We are standardizing OpenAI’s deep learning framework on PyTorch.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

Procgen Benchmark

We’re releasing Procgen Benchmark, 16 simple-to-use procedurally-generated environments which provide a direct measure of how quickly a reinforcement learning a

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

We’re releasing Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safety constraints while

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 6y ago

Self-Supervised Representation Learning

[Updated on 2020-01-09: add a new section on Contrastive Predictive Coding ]. [Updated on 2020-04-13: add a “Momentum Contra

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

GPT-2: 1.5B release

As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights to facili

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

Solving Rubik’s Cube with a robot hand

We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in simulation, using th

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

OpenAI Scholars 2020: Applications open

We are now accepting applications for our third class of OpenAI Scholars.

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6y ago

The Paths Perspective on Value Learning

A closer look at how Temporal Difference Learning merges paths of experience for greater statistical efficiency

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

Fine-tuning GPT-2 from human preferences

We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external human lab

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 6y ago

Evolution Strategies

Stochastic gradient descent is a universal choice for optimizing deep learning models. However, it is not the only option. With bl

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

Testing robustness against unforeseen adversaries

We’ve developed a method to assess whether a neural network classifier can reliably defend against adversarial attacks not seen during training. Our method yiel

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

GPT-2: 6-month follow-up

We’re releasing the 774 million parameter GPT-2 language model after the release of our small 124M model in February, staged release of our medium 355M model in

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6y ago

A Discussion of 'Adversarial Examples Are Not Bugs, They Are Features': Learning from Incorrectly Labeled Data

Section 3.2 of Ilyas et al. (2019) shows that training a model on only adversarial errors leads to non-trivial generalization on the original test set. We show

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

Microsoft invests in and partners with OpenAI to support us building beneficial AGI

Microsoft is investing $1 billion in OpenAI to support us building artificial general intelligence (AGI) with widely distributed economic benefits. We’re partne

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

Why responsible AI development needs cooperation on safety

We’ve written a policy research paper identifying four strategies that can be used today to improve the likelihood of long-term industry cooperation on safety n

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 6y ago

Meta Reinforcement Learning

In my earlier post on meta-learning , the problem is mainly defined in the context of few-shot classificati

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

OpenAI Robotics Symposium 2019

We hosted the first OpenAI Robotics Symposium on April 27, 2019.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

OpenAI Scholars 2019: Final projects

Our second class of OpenAI Scholars has concluded, with all eight scholars producing an exciting final project showcased at Scholars Demo Day at OpenAI.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 6y ago

OpenAI Fellows Fall 2018: Final projects

Our second class of OpenAI Fellows has wrapped up, with each Fellow going from a machine learning beginner to core OpenAI contributor in the course of a 6-month

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 6y ago

Domain Randomization for Sim2Real Transfer

In Robotics, one of the hardest problems is how to make your model transfer to the real world. Due to the sample inefficiency of deep RL algorithms and the cost

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

We’ve created MuseNet, a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

Generative modeling with sparse transformers

We’ve developed the Sparse Transformer, a deep neural network which sets new records at predicting what comes next in a sequence—whether text, images, or sound.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

OpenAI Five defeats Dota 2 world champions

OpenAI Five is the first AI to beat the world champions in an esports game, having won two back-to-back games versus the world champion Dota 2 team, OG, at Fina

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

OpenAI Five Finals

We’ll be holding our final live event for OpenAI Five at 11:30am PT on April 13.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

Implicit generation and generalization methods for energy-based models

We’ve made progress towards stable and scalable training of energy-based models (EBMs) resulting in better sample quality and generalization ability than existi

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

OpenAI Scholars 2019: Meet our Scholars

Our class of eight scholars (out of 550 applicants) brings together collective expertise in literature, philosophy, cell biology, statistics, economics, quantum

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

Introducing Activation Atlases

We’ve created activation atlases (in collaboration with Google researchers), a new technique for visualizing what interactions between neurons can represent. As

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

Neural MMO: A massively multiagent game environment

We’re releasing a Neural MMO, a massively multiagent game environment for reinforcement learning agents. Our platform supports a large, variable number of agent

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

Spinning Up in Deep RL: Workshop review

On February 2, we held our first Spinning Up Workshop as part of our new education initiative at OpenAI.

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 7y ago

AI Safety Needs Social Scientists

If we want to train AI to do what humans want, we need to study humans.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 7y ago

AI safety needs social scientists

We’ve written a paper arguing that long-term AI safety research needs social scientists to ensure AI alignment algorithms succeed when actual humans are involve