Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,961

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,461 Reads 5,500

Showing 5,500 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Surrogates, Spikes, and Sparsity: Performance Analysis and Characterization of SNN Hyperparameters on Hardware

arXiv:2603.24891v1 Announce Type: cross Abstract: Spiking Neural Networks (SNNs) offer inherent advantages for low-power inference through sparse, event-driven

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Sovereign AI at the Front Door of Care: A Physically Unidirectional Architecture for Secure Clinical Intelligence

arXiv:2603.24898v1 Announce Type: cross Abstract: We present a Sovereign AI architecture for clinical triage in which all inference is performed on-device and i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Shaping the Future of Mathematics in the Age of AI

arXiv:2603.24914v1 Announce Type: cross Abstract: Artificial intelligence is transforming mathematics at a speed and scale that demand active engagement from th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Evaluating adaptive and generative AI-based feedback and recommendations in a knowledge-graph-integrated programming learning system

arXiv:2603.24940v1 Announce Type: cross Abstract: This paper introduces the design and development of a framework that integrates a large language model (LLM) w

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators

arXiv:2603.24986v1 Announce Type: cross Abstract: Large language model based health agents are increasingly used by health consumers and clinicians to interpret

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model

arXiv:2603.24989v1 Announce Type: cross Abstract: Learning diverse and high-fidelity traffic simulations from human driving demonstrations is crucial for autono

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models

arXiv:2603.25015v1 Announce Type: cross Abstract: System prompt instructions that cooperate in English compete in Spanish, with the same semantic content, but o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Closing the Confidence-Faithfulness Gap in Large Language Models

arXiv:2603.25052v1 Announce Type: cross Abstract: Large language models (LLMs) tend to verbalize confidence scores that are largely detached from their actual a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities

arXiv:2603.25056v1 Announce Type: cross Abstract: System prompt configuration can make the difference between near-total phishing blindness and near-perfect det

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization

arXiv:2603.25063v1 Announce Type: cross Abstract: Recent agentic systems demonstrate that large language models can generate scientific visualizations from natu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Pixelis: Reasoning in Pixels, from Seeing to Acting

arXiv:2603.25091v1 Announce Type: cross Abstract: Most vision-language systems are static observers: they describe pixels, do not act, and cannot safely improve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization

arXiv:2603.25099v1 Announce Type: cross Abstract: We present a framework in which a large language model (LLM) acts as an online adaptive controller for SIMP to

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Layer-Specific Lipschitz Modulation for Fault-Tolerant Multimodal Representation Learning

arXiv:2603.25103v1 Announce Type: cross Abstract: Modern multimodal systems deployed in industrial and safety-critical environments must remain reliable under p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv:2603.25112v1 Announce Type: cross Abstract: Standard evaluation of LLM confidence relies on calibration metrics (ECE, Brier score) that conflate two disti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Reinforcement learning for quantum processes with memory

arXiv:2603.25138v1 Announce Type: cross Abstract: In reinforcement learning, an agent interacts sequentially with an environment to maximize a reward, receiving

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

SAVe: Self-Supervised Audio-visual Deepfake Detection Exploiting Visual Artifacts and Audio-visual Misalignment

arXiv:2603.25140v1 Announce Type: cross Abstract: Multimodal deepfakes can exhibit subtle visual artifacts and cross-modal inconsistencies, which remain challen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation

arXiv:2603.25144v1 Announce Type: cross Abstract: Dataset distillation (DD) compresses a large training set into a small synthetic set, reducing storage and tra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence

arXiv:2603.25146v1 Announce Type: cross Abstract: Context: The rapid adoption of AI-assisted code generation tools, such as large language models (LLMs), is tra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models

arXiv:2603.25155v1 Announce Type: cross Abstract: Multimodal large language models are promising for clinical visual question answering tasks, but scaling to 3D

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems

arXiv:2603.25164v1 Announce Type: cross Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across a wide range of applications. How

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model

arXiv:2603.25184v1 Announce Type: cross Abstract: Reinforcement learning (RL) has become essential for post-training large language models (LLMs) in reasoning t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Probing the Lack of Stable Internal Beliefs in LLMs

arXiv:2603.25187v1 Announce Type: cross Abstract: Persona-driven large language models (LLMs) require consistent behavioral tendencies across interactions to si

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Decade-Scale Benchmark Evaluating LLMs' Clinical Practice Guidelines Detection and Adherence in Multi-turn Conversations

arXiv:2603.25196v1 Announce Type: cross Abstract: Clinical practice guidelines (CPGs) play a pivotal role in ensuring evidence-based decision-making and improvi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction

arXiv:2603.25209v1 Announce Type: cross Abstract: Generating long videos using pre-trained video diffusion models, which are typically trained on short clips, p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Wireless World Model for AI-Native 6G Networks

arXiv:2603.25216v1 Announce Type: cross Abstract: Integrating AI into the physical layer is a cornerstone of 6G networks. However, current data-driven approache

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing

arXiv:2603.25226v1 Announce Type: cross Abstract: The emergence of Large Language Models (LLMs) has catalyzed a paradigm shift in programming, giving rise to "v

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA

arXiv:2603.25243v1 Announce Type: cross Abstract: Large language models and autonomous agents are increasingly explored for EDA automation, but many existing in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Activation Matters: Test-time Activated Negative Labels for OOD Detection with Vision-Language Models

arXiv:2603.25250v1 Announce Type: cross Abstract: Out-of-distribution (OOD) detection aims to identify samples that deviate from in-distribution (ID). One popul

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation

arXiv:2603.25253v1 Announce Type: cross Abstract: Large language models (LLMs) hold considerable potential for advancing scientific discovery, yet systematic as

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CRAFT: Grounded Multi-Agent Coordination Under Partial Information

arXiv:2603.25268v1 Announce Type: cross Abstract: We introduce CRAFT, a multi-agent benchmark for evaluating pragmatic communication in large language models un

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Revealing the influence of participant failures on model quality in cross-silo Federated Learning

arXiv:2603.25289v1 Announce Type: cross Abstract: Federated Learning (FL) is a paradigm for training machine learning (ML) models in collaborative settings whil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study

arXiv:2603.25322v1 Announce Type: cross Abstract: Alzheimer's disease (AD) is a growing global health challenge as populations age, and timely, accurate diagnos

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv:2603.25325v1 Announce Type: cross Abstract: Weight pruning is a standard technique for compressing large language models, yet its effect on learned intern

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GlowQ: Group-Shared LOw-Rank Approximation for Quantized LLMs

arXiv:2603.25385v1 Announce Type: cross Abstract: Quantization techniques such as BitsAndBytes, AWQ, and GPTQ are widely used as a standard method in deploying

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Causal Framework for Evaluating ICU Discharge Strategies

arXiv:2603.25397v1 Announce Type: cross Abstract: In this applied paper, we address the difficult open problem of when to discharge patients from the Intensive

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Shape and Substance: Dual-Layer Side-Channel Attacks on Local Vision-Language Models

arXiv:2603.25403v1 Announce Type: cross Abstract: On-device Vision-Language Models (VLMs) promise data privacy via local execution. However, we show that the ar

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Decidable By Construction: Design-Time Verification for Trustworthy AI

arXiv:2603.25414v1 Announce Type: cross Abstract: A prevailing assumption in machine learning is that model correctness must be enforced after the fact. We obse

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Temporally Decoupled Diffusion Planning for Autonomous Driving

arXiv:2603.25462v1 Announce Type: cross Abstract: Motion planning in dynamic urban environments requires balancing immediate safety with long-term goals. While

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Maximum Entropy Behavior Exploration for Sim2Real Zero-Shot Reinforcement Learning

arXiv:2603.25464v1 Announce Type: cross Abstract: Zero-shot reinforcement learning (RL) algorithms aim to learn a family of policies from a reward-free dataset,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

arXiv:2603.25562v1 Announce Type: cross Abstract: On-policy distillation (OPD) is appealing for large language model (LLM) post-training because it evaluates te

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Are LLMs Overkill for Databases?: A Study on the Finiteness of SQL

arXiv:2603.25568v1 Announce Type: cross Abstract: Translating natural language to SQL for data retrieval has become more accessible thanks to code generation LL

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

TAAC: A gate into Trustable Audio Affective Computing

arXiv:2603.25570v1 Announce Type: cross Abstract: With the emergence of AI techniques for depression diagnosis, the conflict between high demand and limited sup

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification

arXiv:2603.25613v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) have recently been explored as face verification systems that determi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

arXiv:2603.25638v1 Announce Type: cross Abstract: Through an analysis of arXiv papers, we report several shifts in word usage that are likely driven by large la

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

arXiv:2603.25646v1 Announce Type: cross Abstract: This paper presents an experimental platform for studying intentional-state attribution toward a non-humanoid

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors

arXiv:2603.25674v1 Announce Type: cross Abstract: Automated systems have been widely adopted across the educational testing industry for open-response assessmen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Unified Memory Perspective for Probabilistic Trustworthy AI

arXiv:2603.25692v1 Announce Type: cross Abstract: Trustworthy artificial intelligence increasingly relies on probabilistic computation to achieve robustness, in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

arXiv:2603.25697v1 Announce Type: cross Abstract: Code production is now a commodity; the bottleneck is knowing what to build and proving it works. We present t