Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,694

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,442 Reads 5,252

Showing 5,252 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech

arXiv:2603.20112v1 Announce Type: cross Abstract: Personalizing Automatic Speech Recognition (ASR) for non-normative speech remains challenging because data col

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning

arXiv:2603.20116v1 Announce Type: cross Abstract: Conventional fine-tuning on domain-specific datasets can inadvertently alter a model's pretrained multimodal p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models

arXiv:2603.20122v1 Announce Type: cross Abstract: Large Language Models (LLMs) have been widely deployed, especially through free Web-based applications that ex

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

arXiv:2603.20161v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across diverse tasks. However, the trut

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning

arXiv:2603.20164v1 Announce Type: cross Abstract: Conventional robot social behavior generation has been limited in flexibility and autonomy, relying on predefi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation

arXiv:2603.20172v1 Announce Type: cross Abstract: Recent work on chain-of-thought (CoT) faithfulness reports single aggregate numbers (e.g., DeepSeek-R1 acknowl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AI Agents Can Already Autonomously Perform Experimental High Energy Physics

arXiv:2603.20179v1 Announce Type: cross Abstract: Large language model-based AI agents are now able to autonomously execute substantial portions of a high energ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Adaptive Greedy Frame Selection for Long Video Understanding

arXiv:2603.20180v1 Announce Type: cross Abstract: Large vision--language models (VLMs) are increasingly applied to long-video question answering, yet inference

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Improving Generalization on Cybersecurity Tasks with Multi-Modal Contrastive Learning

arXiv:2603.20181v1 Announce Type: cross Abstract: The use of ML in cybersecurity has long been impaired by generalization issues: Models that work well in contr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking

arXiv:2603.20185v1 Announce Type: cross Abstract: Video agentic models have advanced challenging video-language tasks. However, most agentic approaches still he

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

HPS: Hard Preference Sampling for Human Preference Alignment

arXiv:2502.14400v5 Announce Type: replace Abstract: Aligning Large Language Model (LLM) responses with human preferences is vital for building safe and controll

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives

arXiv:2505.15693v3 Announce Type: replace Abstract: Recent advances in reinforcement learning (RL) have renewed interest in reward design for shaping agent beha

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation

arXiv:2506.08898v4 Announce Type: replace Abstract: Recent deep reinforcement learning methods have achieved remarkable success in solving multi-objective combi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Improved Generalized Planning with LLMs through Strategy Refinement and Reflection

arXiv:2508.13876v2 Announce Type: replace Abstract: LLMs have recently been used to generate Python programs representing generalized plans in PDDL planning, i.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Evaluation-Aware Reinforcement Learning

arXiv:2509.19464v3 Announce Type: replace Abstract: Policy evaluation is a core component of many reinforcement learning (RL) algorithms and a critical tool for

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark

arXiv:2509.24897v2 Announce Type: replace Abstract: The integration of visual understanding and generation into unified multimodal models represents a significa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

PDDL Axioms Are Equivalent to Least Fixed Point Logic (Extended Version)

arXiv:2510.14412v2 Announce Type: replace Abstract: Axioms are a feature of the Planning Domain Definition Language PDDL that can be considered as a generalizat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DAPS++: Rethinking Diffusion Inverse Problems with Decoupled Posterior Annealing

arXiv:2511.17038v2 Announce Type: replace Abstract: From a Bayesian perspective, score-based diffusion solves inverse problems through joint inference, embeddin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

On Sample-Efficient Generalized Planning via Learned Transition Models

arXiv:2602.23148v3 Announce Type: replace Abstract: Generalized planning studies the construction of solution strategies that generalize across families of plan

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Unified Framework to Quantify Cultural Intelligence of AI

arXiv:2603.01211v2 Announce Type: replace Abstract: As generative AI technologies are increasingly being launched across the globe, assessing their competence t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Generative AI-assisted Participatory Modeling in Socio-Environmental Planning under Deep Uncertainty

arXiv:2603.17021v2 Announce Type: replace Abstract: Socio-environmental planning under deep uncertainty requires researchers to identify and conceptualize probl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

arXiv:2603.18048v2 Announce Type: replace Abstract: Recent Audio Multimodal Large Language Models (Audio MLLMs) demonstrate impressive performance on speech ben

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Evaluating Game Difficulty in Tetris Block Puzzle

arXiv:2603.18994v2 Announce Type: replace Abstract: Tetris Block Puzzle is a single player stochastic puzzle in which a player places blocks on an 8 x 8 grid to

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge

arXiv:2310.11703v3 Announce Type: replace-cross Abstract: As high-dimensional vector data increasingly surpasses the processing capabilities of traditional data

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

LISAA: A Framework for Large Language Model Information Security Awareness Assessment

arXiv:2411.13207v3 Announce Type: replace-cross Abstract: The popularity of large language models (LLMs) continues to grow, and LLM-based assistants have become

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Understanding and Optimizing Multi-Stage AI Inference Pipelines

arXiv:2504.09775v5 Announce Type: replace-cross Abstract: The rapid evolution of Large Language Models (LLMs) has driven the need for increasingly sophisticated

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

HALO: Hierarchical Reinforcement Learning for Large-Scale Adaptive Traffic Signal Control

arXiv:2506.14391v3 Announce Type: replace-cross Abstract: Adaptive traffic signal control (ATSC) is essential for mitigating urban congestion in modern smart ci

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers

arXiv:2506.15047v2 Announce Type: replace-cross Abstract: Family caregivers of individuals with Alzheimer's Disease and Related Dementia (AD/ADRD) face signific

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation

arXiv:2509.19080v2 Announce Type: replace-cross Abstract: Robotic manipulation policies are commonly initialized through imitation learning, but their performan

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Responsible AI Technical Report

arXiv:2509.20057v4 Announce Type: replace-cross Abstract: KT developed a Responsible AI (RAI) assessment methodology and risk mitigation technologies to ensure

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning

arXiv:2509.24773v4 Announce Type: replace-cross Abstract: Video-conditioned audio generation, including Video-to-Sound (V2S) and Visual Text-to-Speech (VisualTT

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

FinReflectKG -- EvalBench: Benchmarking Financial KG with Multi-Dimensional Evaluation

arXiv:2510.05710v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly being used to extract structured knowledge from unstruct

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CARES: Context-Aware Resolution Selector for VLMs

arXiv:2510.19496v2 Announce Type: replace-cross Abstract: Large vision-language models (VLMs) commonly process images at native or high resolution to remain eff

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Rep2Text: Decoding Full Text from a Single LLM Token Representation

arXiv:2511.06571v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have achieved remarkable progress across diverse tasks, yet their interna

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

arXiv:2511.16665v3 Announce Type: replace-cross Abstract: The emergence of Large Language Models (LLMs) with strong reasoning capabilities marks a significant m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Phish, The Spam, and The Valid: Generating Feature-Rich Emails for Benchmarking LLMs

arXiv:2511.21448v5 Announce Type: replace-cross Abstract: In this paper, we introduce a metadata-enriched generation framework (PhishFuzzer) that seeds real ema

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis

arXiv:2601.03018v2 Announce Type: replace-cross Abstract: While Large Language Models (LLMs) have shown strong performance on clinical text understanding, they

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Multi-Perspective Benchmark and Moderation Model for Evaluating Safety and Adversarial Robustness

arXiv:2601.03273v2 Announce Type: replace-cross Abstract: As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer modera

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

StealthRL: Reinforcement Learning Paraphrase Attacks for Multi-Detector Evasion of AI-Text Detectors

arXiv:2602.08934v2 Announce Type: replace-cross Abstract: AI-text detectors face a critical robustness challenge: adversarial paraphrasing attacks that preserve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

LHAW: Controllable Underspecification for Long-Horizon Tasks

arXiv:2602.10525v2 Announce Type: replace-cross Abstract: Long-horizon workflow agents that operate effectively over extended periods are essential for truly au

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Art of Efficient Reasoning: Data, Reward, and Optimization

arXiv:2602.20945v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but al

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

On the Structural Non-Preservation of Epistemic Behaviour under Policy Transformation

arXiv:2602.21424v2 Announce Type: replace-cross Abstract: Reinforcement learning (RL) agents under partial observability often condition actions on internally a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CIRCUS: Circuit Consensus under Uncertainty via Stability Ensembles

arXiv:2603.00523v2 Announce Type: replace-cross Abstract: Every mechanistic circuit carries an invisible asterisk: it reflects not just the model's computation,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing

arXiv:2603.01038v2 Announce Type: replace-cross Abstract: Face recognition remains vulnerable to presentation attacks, calling for robust Face Anti-Spoofing (FA

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

arXiv:2603.12180v2 Announce Type: replace-cross Abstract: Multimodal agents offer a promising path to automating complex document-intensive workflows. Yet, a cr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Prompt Injection as Role Confusion

arXiv:2603.12277v2 Announce Type: replace-cross Abstract: Language models remain vulnerable to prompt injection attacks despite extensive safety training. We tr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems

arXiv:2603.15727v2 Announce Type: replace-cross Abstract: Autonomous LLM-based agents increasingly operate as long-running processes forming densely interconnec

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data

arXiv:2603.16513v2 Announce Type: replace-cross Abstract: Structured data is foundational to healthcare, finance, e-commerce, and scientific data management. La