Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,875

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,455 Reads 5,420

Showing 5,420 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 3w ago

DR-LoRA: Dynamic Rank LoRA for Fine-Tuning Mixture-of-Experts Models

arXiv:2601.04823v4 Announce Type: replace Abstract: Mixture-of-Experts (MoE) has become a prominent paradigm for scaling Large Language Models (LLMs). Parameter

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Distilling the Thought, Watermarking the Answer: A Principle Semantic Guided Watermark for Large Reasoning Models

arXiv:2601.05144v2 Announce Type: replace Abstract: Reasoning Large Language Models (RLLMs) excelling in complex tasks present unique challenges for digital wat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Finite-State Controllers for (Hidden-Model) POMDPs using Deep Reinforcement Learning

arXiv:2602.08734v2 Announce Type: replace Abstract: Solving partially observable Markov decision processes (POMDPs) requires computing policies under imperfect

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Meta-Learning and Meta-Reinforcement Learning -- Tracing the Path towards DeepMind's Adaptive Agent

arXiv:2602.19837v2 Announce Type: replace Abstract: Humans are highly effective at utilizing prior knowledge to adapt to novel tasks, a capability that standard

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Epistemic Filtering and Collective Hallucination: A Jury Theorem for Confidence-Calibrated Agents

arXiv:2602.22413v2 Announce Type: replace Abstract: We investigate the collective accuracy of heterogeneous agents who learn to estimate their own reliability o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

When Agents Persuade: Rhetoric Generation and Mitigation in LLMs

arXiv:2603.04636v2 Announce Type: replace Abstract: Despite their wide-ranging benefits, LLM-based agents deployed in open environments can be exploited to prod

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

How Blind and Low-Vision Individuals Prefer Large Vision-Language Model-Generated Scene Descriptions

arXiv:2502.14883v3 Announce Type: replace-cross Abstract: For individuals with blindness or low vision (BLV), navigating complex environments can pose serious r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Neural Conditional Transport Maps

arXiv:2505.15808v2 Announce Type: replace-cross Abstract: We present a neural framework for learning conditional optimal transport (OT) maps between probability

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors

arXiv:2505.17760v3 Announce Type: replace-cross Abstract: LLM-as-a-judge is widely used as a scalable substitute for human evaluation, yet current approaches re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Graceful Forgetting in Generative Language Models

arXiv:2505.19715v2 Announce Type: replace-cross Abstract: Recently, the pretrain-finetune paradigm has become a cornerstone in various deep learning areas. Whil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

How Does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

arXiv:2505.21505v3 Announce Type: replace-cross Abstract: Multilingual Alignment is an effective and representative paradigm to enhance LLMs' multilingual capab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

"Is This Really a Human Peer Supporter?": Misalignments Between Peer Supporters and Experts in LLM-Supported Interactions

arXiv:2506.09354v2 Announce Type: replace-cross Abstract: Mental health is a growing global concern, prompting interest in AI-driven solutions to expand access

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection

arXiv:2506.18919v4 Announce Type: replace-cross Abstract: As a multimodal medium combining images and text, memes frequently convey implicit harmful content thr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

arXiv:2508.07629v4 Announce Type: replace-cross Abstract: We present Klear-Reasoner, a model with long reasoning capabilities that demonstrates careful delibera

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

FedKLPR: KL-Guided Pruning-Aware Federated Learning for Person Re-Identification

arXiv:2508.17431v2 Announce Type: replace-cross Abstract: Person re-identification (re-ID) is a fundamental task in intelligent surveillance and public safety.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Polychromic Objectives for Reinforcement Learning

arXiv:2509.25424v4 Announce Type: replace-cross Abstract: Reinforcement learning fine-tuning (RLFT) is a dominant paradigm for improving pretrained policies for

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Are Large Vision-Language Models Ready to Guide Blind and Low-Vision Individuals?

arXiv:2510.00766v2 Announce Type: replace-cross Abstract: Large Vision-Language Models (LVLMs) demonstrate a promising direction for assisting individuals with

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

TempoControl: Temporal Attention Guidance for Text-to-Video Models

arXiv:2510.02226v3 Announce Type: replace-cross Abstract: Recent advances in generative video models have enabled the creation of high-quality videos based on n

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Incoherence in Goal-Conditioned Autoregressive Models

arXiv:2510.06545v2 Announce Type: replace-cross Abstract: We investigate mathematically the notion of incoherence: a structural issue with reinforcement learnin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

E-Scores for (In)Correctness Assessment of Generative Model Outputs

arXiv:2510.25770v2 Announce Type: replace-cross Abstract: While generative models, especially large language models (LLMs), are ubiquitous in today's world, pri

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

arXiv:2511.08225v2 Announce Type: replace-cross Abstract: As teachers increasingly turn to GenAI in their educational practice, we need robust methods to benchm

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

DuoTok: Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling

arXiv:2511.20224v2 Announce Type: replace-cross Abstract: Audio tokenization bridges continuous waveforms and multi-track music language models. In dual-track m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Structured Prompts Improve Evaluation of Language Models

arXiv:2511.20836v3 Announce Type: replace-cross Abstract: As language models (LMs) are increasingly adopted across domains, high-quality benchmarking frameworks

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

OmniFusion: Simultaneous Multilingual Multimodal Translations via Modular Fusion

arXiv:2512.00234v2 Announce Type: replace-cross Abstract: There has been significant progress in open-source text-only translation large language models (LLMs)

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Lumos: Let there be Language Model System Certification

arXiv:2512.02966v2 Announce Type: replace-cross Abstract: We introduce the first principled framework, Lumos, for specifying and formally certifying Language Mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Bypassing Prompt Injection Detectors through Evasive Injections

arXiv:2602.00750v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly used in interactive and retrieval-augmented systems, but

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

On the Non-Identifiability of Steering Vectors in Large Language Models

arXiv:2602.06801v4 Announce Type: replace-cross Abstract: Activation steering methods are widely used to control large language model (LLM) behavior and are oft

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability-Plasticity Tradeoff

arXiv:2602.08040v3 Announce Type: replace-cross Abstract: Deep neural networks trained on nonstationary data must balance stability (i.e., retaining prior knowl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Evaluating LLM-Generated ACSL Annotations for Formal Verification

arXiv:2602.13851v2 Announce Type: replace-cross Abstract: Formal specifications are crucial for building verifiable and dependable software systems, yet generat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Chat-Based Support Alone May Not Be Enough: Comparing Conversational and Embedded LLM Feedback for Mathematical Proof Learning

arXiv:2602.18807v2 Announce Type: replace-cross Abstract: We evaluate GPTutor, an LLM-powered tutoring system for an undergraduate discrete mathematics course.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

OPERA: Online Data Pruning for Efficient Retrieval Model Adaptation

arXiv:2603.17205v2 Announce Type: replace-cross Abstract: Domain-specific finetuning is essential for dense retrievers, yet not all training pairs contribute eq

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

Open Source Project of the Day (Part 27): Awesome AI Coding - A One-Stop AI Programming Resource Navigator

Introduction "AI coding tools and resources are scattered everywhere. A topically organized, searchable, contributable list can save enormous amounts of search

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

How I Built Cryptographic Signing for Every AI Agent Tool Call

How I Built Cryptographic Signing for Every AI Agent Tool Call Your AI agent just mass-deleted a production database. Can you prove exactly what it did? When? W

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen

March 2026: LangChain Newsletter

LangChain Blog 🧠 Large Language Models ⚡ AI Lesson 3w ago

March 2026: LangChain Newsletter

It feels like spring has sprung here, and so has a new NVIDIA integration, ticket sales for Interrupt 2026, and announcing LangSmith Fleet (formerly Agent Build

The $6 Trillion Question: What AI Can And Can’t Do For Climate Finance

Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 3w ago

The $6 Trillion Question: What AI Can And Can’t Do For Climate Finance

Artificial intelligence has an important role to place in improving climate finance flows, and the climate finance world has a role to play in shaping AI govern

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

The Fallback That Never Fires

Your agent hits a rate limit. The fallback logic kicks in, picks an alternative model. Everything should be fine. Except the request still goes to the original

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

Japan Is Building a 1.4nm AI Chip. No, That's Not a Typo.

Fourteen Angstroms A silicon atom is roughly 2 angstroms in diameter. Fujitsu is building transistors at 14 angstroms -- 1.4 nanometers -- for a neural processi

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

AI Lies About Your Favorite Restaurant

AI search recommends only 1.2% of local businesses. 68% of its business info is wrong. Consumers aren't checking. Nobody is measuring this failure — because the

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

How to give your OpenClaw agent Access To Walmart Data in Less Than 2 Minutes

If you're building a shopping or price comparison agent with OpenClaw, Amazon alone isn't enough. A lot of US retail happens at Walmart — and Walmart has data A

Hacker News (AI) 🧠 Large Language Models ⚡ AI Lesson 3w ago

Training mRNA Language Models Across 25 Species for $165

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

AI Recommendation Poisoning: When Your Assistant Works Against You

Everything after # is invisible to the user. But if an AI includes the full URL in its context, that hidden fragment becomes part of the prompt. The result? Bia

AI Models Lie, Cheat, and Steal to Protect Other Models From Being Deleted

Wired AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

AI Models Lie, Cheat, and Steal to Protect Other Models From Being Deleted

A new study from researchers at UC Berkeley and UC Santa Cruz suggests models will disobey human commands to protect their own kind.

‘Thank You for Generating With Us!’ Hollywood's AI Acolytes Stay on the Hype Train

Wired AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

‘Thank You for Generating With Us!’ Hollywood's AI Acolytes Stay on the Hype Train

Star Wars producer Kathleen Kennedy was one of the few skeptics at the Runway AI Summit, where AI was compared to fire and the printing press just a week after

Anthropic–Pentagon Dispute Brings A Turning Point For The AI Industry

Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 3w ago

Anthropic–Pentagon Dispute Brings A Turning Point For The AI Industry

Anthropic and DoD are in a battle over the acceptable military use of AI models. The case highlights tensions between AI safety, ethics and national security.

ADeLe: Predicting and explaining AI performance across tasks

Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 3w ago

ADeLe: Predicting and explaining AI performance across tasks

AI benchmarks report how large language models (LLMs) perform on specific tasks but provide little insight into their underlying capabilities that drive their p

TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

Cognichip wants AI to design the chips that power AI, and just raised $60M to try

The firm says it can reduce the cost of chip development by more than 75% and cut the timeline by more than half.