Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,516

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,400 Reads 5,116

Showing 5,116 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba

arXiv:2603.27314v1 Announce Type: new Abstract: Music-to-dance generation has broad applications in virtual reality, dance education, and digital character anim

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CounterMoral: Editing Morals in Language Models

arXiv:2603.27338v1 Announce Type: new Abstract: Recent advancements in language model technology have significantly enhanced the ability to edit factual informa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance

arXiv:2603.27343v1 Announce Type: new Abstract: Task-completion rate is the standard proxy for LLM agent capability, but models with identical completion scores

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

LLM Readiness Harness: Evaluation, Observability, and CI Gates for LLM/RAG Applications

arXiv:2603.27355v1 Announce Type: new Abstract: We present a readiness harness for LLM and RAG applications that turns evaluation into a deployment decision wor

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Defend: Automated Rebuttals for Peer Review with Minimal Author Guidance

arXiv:2603.27360v1 Announce Type: new Abstract: Rebuttal generation is a critical component of the peer review process for scientific papers, enabling authors t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Heterogeneous Debate Engine: Identity-Grounded Cognitive Architecture for Resilient LLM-Based Ethical Tutoring

arXiv:2603.27404v1 Announce Type: new Abstract: Large Language Models (LLMs) are being increasingly used as autonomous agents in complex reasoning tasks, openin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Greedy Is a Strong Default: Agents as Iterative Optimizers

arXiv:2603.27415v1 Announce Type: new Abstract: Classical optimization algorithms--hill climbing, simulated annealing, population-based methods--generate candid

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

AstraAI: LLMs, Retrieval, and AST-Guided Assistance for HPC Codebases

arXiv:2603.27423v1 Announce Type: new Abstract: We present AstraAI, a command-line interface (CLI) coding framework for high-performance computing (HPC) softwar

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dual-Stage LLM Framework for Scenario-Centric Semantic Interpretation in Driving Assistance

arXiv:2603.27536v1 Announce Type: new Abstract: Advanced Driver Assistance Systems (ADAS) increasingly rely on learning-based perception, yet safety-relevant fa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

DSevolve: Enabling Real-Time Adaptive Scheduling on Dynamic Shop Floor with LLM-Evolved Heuristic Portfolios

arXiv:2603.27628v1 Announce Type: new Abstract: In dynamic manufacturing environments, disruptions such as machine breakdowns and new order arrivals continuousl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SkyNet: Belief-Aware Planning for Partially-Observable Stochastic Games

arXiv:2603.27751v1 Announce Type: new Abstract: In 2019, Google DeepMind released MuZero, a model-based reinforcement learning method that achieves strong resul

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CARV: A Diagnostic Benchmark for Compositional Analogical Reasoning in Multimodal LLMs

arXiv:2603.27958v1 Announce Type: new Abstract: Analogical reasoning tests a fundamental aspect of human cognition: mapping the relation from one pair of object

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SARL: Label-Free Reinforcement Learning by Rewarding Reasoning Topology

arXiv:2603.27977v1 Announce Type: new Abstract: Reinforcement learning has become central to improving large reasoning models, but its success still relies heav

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA

arXiv:2603.28026v1 Announce Type: new Abstract: Scientific figure multiple-choice question answering (MCQA) requires models to reason over diverse visual eviden

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Beyond the Answer: Decoding the Behavior of LLMs as Scientific Reasoners

arXiv:2603.28038v1 Announce Type: new Abstract: As Large Language Models (LLMs) achieve increasingly sophisticated performance on complex reasoning tasks, curre

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Meta-Harness: End-to-End Optimization of Model Harnesses

arXiv:2603.28052v1 Announce Type: new Abstract: The performance of large language model (LLM) systems depends not only on model weights, but also on their harne

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SLOW: Strategic Logical-inference Open Workspace for Cognitive Adaptation in AI Tutoring

arXiv:2603.28062v1 Announce Type: new Abstract: While Large Language Models (LLMs) have demonstrated remarkable fluency in educational dialogues, most generativ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning

arXiv:2603.28135v1 Announce Type: new Abstract: Recent test-time reasoning methods improve performance by generating more candidate chains or searching over lar

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

PReD: An LLM-based Foundation Multimodal Model for Electromagnetic Perception, Recognition, and Decision

arXiv:2603.28183v1 Announce Type: new Abstract: Multimodal Large Language Models have demonstrated powerful cross-modal understanding and reasoning capabilities

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling

arXiv:2603.28197v1 Announce Type: new Abstract: Pluralistic alignment is essential for adapting large language models (LLMs) to the diverse preferences of indiv

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Reasoning as Energy Minimization over Structured Latent Trajectories

arXiv:2603.28248v1 Announce Type: new Abstract: Single-shot neural decoders commit to answers without iterative refinement, while chain-of-thought methods intro

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Evaluating LLMs for Answering Student Questions in Introductory Programming Courses

arXiv:2603.28295v1 Announce Type: new Abstract: The rapid emergence of Large Language Models (LLMs) presents both opportunities and challenges for programming e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CoE: Collaborative Entropy for Uncertainty Quantification in Agentic Multi-LLM Systems

arXiv:2603.28360v1 Announce Type: new Abstract: Uncertainty estimation in multi-LLM systems remains largely single-model-centric: existing methods quantify unce

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Deep Research of Deep Research: From Transformer to Agent, From AI to AI for Science

arXiv:2603.28361v1 Announce Type: new Abstract: With the advancement of large language models (LLMs) in their knowledge base and reasoning capabilities, their i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments via Two-Player Zero-Sum Game

arXiv:2603.28386v1 Announce Type: new Abstract: A central challenge in building continually improving agents is that training environments are typically static

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

The Scaffold Effect: How Prompt Framing Drives Apparent Multimodal Gains in Clinical VLM Evaluation

arXiv:2603.28387v1 Announce Type: new Abstract: Trustworthy clinical AI requires that performance gains reflect genuine evidence integration rather than surface

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

T-Norm Operators for EU AI Act Compliance Classification: An Empirical Comparison of Lukasiewicz, Product, and G\"odel Semantics in a Neuro-Symbolic Reasoning System

arXiv:2603.28558v1 Announce Type: new Abstract: We present a first comparative pilot study of three t-norm operators -- Lukasiewicz (T_L), Product (T_P), and G\

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

arXiv:2603.28590v1 Announce Type: new Abstract: Large language models (LLMs) can generate chains of thought (CoTs) that are not always causally responsible for

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Seeing with You: Perception-Reasoning Coevolution for Multimodal Reasoning

arXiv:2603.28618v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has substantially enhanced the reasoning capabilities of m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AIGENIE from its Bottle

arXiv:2603.28643v1 Announce Type: new Abstract: Psychological scale development has traditionally required extensive expert involvement, iterative revision, and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Not Search, But Scan: Benchmarking MLLMs on Scan-Oriented Academic Paper Reasoning

arXiv:2603.28651v1 Announce Type: new Abstract: With the rapid progress of multimodal large language models (MLLMs), AI already performs well at literature retr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Exploring Cultural Variations in Moral Judgments with Large Language Models

arXiv:2506.12433v2 Announce Type: cross Abstract: Large Language Models (LLMs) have shown strong performance across many tasks, but their ability to capture cul

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs

arXiv:2603.20253v1 Announce Type: cross Abstract: Evaluating LLM agents for scientific tasks has focused on token costs while ignoring tool-use costs like simul

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

M-RAG: Making RAG Faster, Stronger, and More Efficient

arXiv:2603.26667v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) has become a widely adopted paradigm for enhancing the reliability of lar

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Bridge-RAG: An Abstract Bridge Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter

arXiv:2603.26668v1 Announce Type: cross Abstract: As an important paradigm for enhancing the generation quality of Large Language Models (LLMs), retrieval-augme

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ReCQR: Incorporating conversational query rewriting to improve Multimodal Image Retrieval

arXiv:2603.26669v1 Announce Type: cross Abstract: With the rise of multimodal learning, image retrieval plays a crucial role in connecting visual information wi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Can AI be a Teaching Partner? Evaluating ChatGPT, Gemini, and DeepSeek across Three Teaching Strategies

arXiv:2603.26673v1 Announce Type: cross Abstract: There are growing promises that Large Language Models (LLMs) can support students' learning by providing expla

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Alignment

arXiv:2603.26680v1 Announce Type: cross Abstract: As Large Language Models (LLMs) evolve into lifelong AI assistants, LLM personalization has become a critical

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

LITTA: Late-Interaction and Test-Time Alignment for Visually-Grounded Multimodal Retrieval

arXiv:2603.26683v1 Announce Type: cross Abstract: Retrieving relevant evidence from visually rich documents such as textbooks, technical reports, and manuals is

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SpatialPoint: Spatial-aware Point Prediction for Embodied Localization

arXiv:2603.26690v1 Announce Type: cross Abstract: Embodied intelligence fundamentally requires a capability to determine where to act in 3D space. We formalize

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Complementarity-Preserving Generative Theory for Multimodal ECG Synthesis: A Quantum-Inspired Approach

arXiv:2603.26695v1 Announce Type: cross Abstract: Multimodal deep learning has substantially improved electrocardiogram (ECG) classification by jointly leveragi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

The Cognitive Divergence: AI Context Windows, Human Attention Decline, and the Delegation Feedback Loop

arXiv:2603.26707v1 Announce Type: cross Abstract: This paper documents and theorises a self-reinforcing dynamic between two measurable trends: the exponential e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Agentic AI for Human Resources: LLM-Driven Candidate Assessment

arXiv:2603.26710v1 Announce Type: cross Abstract: In this work, we present a modular and interpretable framework that uses Large Language Models (LLMs) to autom

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Brain-inspired AI for Edge Intelligence: a systematic review

arXiv:2603.26722v1 Announce Type: cross Abstract: While Spiking Neural Networks (SNNs) promise to circumvent the severe Size, Weight, and Power (SWaP) constrain

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SEAR: Schema-Based Evaluation and Routing for LLM Gateways

arXiv:2603.26728v1 Announce Type: cross Abstract: Evaluating production LLM responses and routing requests across providers in LLM gateways requires fine-graine

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Contextual inference from single objects in Vision-Language models

arXiv:2603.26731v1 Announce Type: cross Abstract: How much scene context a single object carries is a well-studied question in human scene perception, yet how t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Distilled Large Language Model-Driven Dynamic Sparse Expert Activation Mechanism

arXiv:2603.26735v1 Announce Type: cross Abstract: High inter-class similarity, extreme scale variation, and limited computational budgets hinder reliable visual

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Beyond Static Visual Tokens: Structured Sequential Visual Chain-of-Thought Reasoning

arXiv:2603.26737v1 Announce Type: cross Abstract: Current multimodal LLMs encode images as static visual prefixes and rely on text-based reasoning, lacking goal