Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,905

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,460 Reads 5,445

Showing 5,445 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Machine Learning Experts - Margaret Mitchell

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Announcing the 🤗 AI Research Residency Program

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

New GPT-3 capabilities: Edit & insert

We’ve released new versions of GPT-3 and Codex which can edit or insert content into existing text, rather than just completing existing text.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

A research agenda for assessing the economic impacts of code generation models

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Economic impacts research at OpenAI

Call for expressions of interest to study the economic impacts of large language models.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Lessons learned on language model safety and misuse

We describe our latest thinking in the hope of helping other AI developers address safety and misuse of deployed models.

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Learning with not Enough Data Part 2: Active Learning

This is part 2 of what to do when facing a limited amount of labeled data for supervised learning tasks. This time we will get some amount of human labeling wor

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Solving (some) formal math olympiad problems

We built a neural theorem prover for Lean that learned to solve a variety of challenging high-school olympiad problems, including problems from the AMC12 and AI

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Aligning language models to follow instructions

We’ve trained language models that are much better at following user intentions than GPT-3 while also making them more truthful and less toxic, using techniques

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Introducing text and code embeddings

We are introducing embeddings, a new endpoint in the OpenAI API that makes it easy to perform natural language and code tasks like semantic search, clustering,

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Supercharged Searching on the 🤗 Hub

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Gradio is joining Hugging Face!

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

WebGPT: Improving the factual accuracy of language models through web browsing

We’ve fine-tuned GPT-3 to more accurately answer open-ended questions using a text-based web browser.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Perceiver IO: a scalable, fully-attentional model that works on any modality

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Customizing GPT-3 for your application

Fine-tune with a single command.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

OpenAI Residency

As part of our effort to support and develop AI talent, we’re excited to announce the OpenAI Residency.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Accelerating PyTorch distributed fine-tuning with Intel technologies

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

OpenAI’s API now available with no waitlist

Wider availability made possible by safety progress.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Solving math word problems

We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many problems a

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

The Age of Machine Learning As Code Has Arrived

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Fine tuning CLIP with Remote Sensing (Satellite) images and captions

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Hosting your Models and Datasets on Hugging Face Spaces using Streamlit

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Showcase Your Projects in Spaces using Gradio

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

How to Train Really Large Models on Many GPUs?

[Updated on 2022-03-13: add expert choice routing .] [U

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Summarizing books with human feedback

Scaling human oversight of AI systems for tasks that are difficult to evaluate.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Introducing Optimum: The Optimization Toolkit for Transformers at Scale

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Helen Toner joins OpenAI’s board of directors

Today, we’re excited to announce the appointment of Helen Toner to our board of directors.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

We’ve created an improved version of OpenAI Codex, our AI system that translates natural language to code, and we are releasing it through our API in private be

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Introducing Triton: Open-source GPU programming for neural networks

We’re releasing Triton 1.0, an open-source Python-like programming language which enables researchers with no CUDA experience to write highly efficient GPU code

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4y ago

After five years, Distill will be taking a break.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Improving language model behavior by training on a curated dataset

Our latest research finds we can improve language model behavior with respect to specific behavioral values by fine-tuning on a small, curated dataset.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Few-shot learning in practice: GPT-Neo and the 🤗 Accelerated Inference API

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Contrastive Representation Learning

The goal of contrastive representation learning is to learn such an embedding space in which similar sample pairs stay close to each other while dissimilar ones

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

OpenAI Scholars 2021: Final projects

We’re proud to announce that the 2021 class of OpenAI Scholars has completed our six-month mentorship program and have produced an open-source research project

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4y ago

Adversarial Reprogramming of Neural Cellular Automata

Reprogramming Neural CA to exhibit novel behaviour, using adversarial attacks.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Will Hurd joins OpenAI’s board of directors

OpenAI is committed to developing general-purpose artificial intelligence that benefits all humanity, and we believe that achieving our goal requires expertise

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago

Branch Specialization

When a neural network layer is divided into multiple branches, neurons self-organize into coherent groupings.

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Weaviate 1.2 release - transformer models

Weaviate v1.2 introduced support for transformers (DistilBERT, BERT, RoBERTa, Sentence-BERT, etc) to vectorize and semantically search through your data.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

GPT-3 powers the next generation of apps

Over 300 applications are delivering GPT-3–powered search, conversation, text completion, and other advanced AI features through our API.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

The Partnership: Amazon SageMaker and Hugging Face

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Reducing Toxicity in Language Models

Large pretrained language models are trained over a sizable collection of online data. They unavoidably acquire certain toxic behavior

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

Multimodal neurons in artificial neural networks

We’ve discovered neurons in CLIP that respond to the same concept whether presented literally, symbolically, or conceptually. This may explain CLIP’s accuracy i

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Simple considerations for simple people building fancy neural networks

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Retrieval Augmented Generation with Huggingface Transformers and Ray

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Hugging Face on PyTorch / XLA TPUs

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Faster TensorFlow models in Hugging Face Transformers