Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,908

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,459 Reads 5,449

Showing 5,449 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

GUIDE: Resolving Domain Bias in GUI Agents through Real-Time Web Video Retrieval and Plug-and-Play Annotation

arXiv:2603.26266v1 Announce Type: new Abstract: Large vision-language models have endowed GUI agents with strong general capabilities for interface understandin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

AIRA_2: Overcoming Bottlenecks in AI Research Agents

arXiv:2603.26499v1 Announce Type: new Abstract: Existing research has identified three structural performance bottlenecks in AI research agents: (1) synchronous

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

CADSmith: Multi-Agent CAD Generation with Programmatic Geometric Validation

arXiv:2603.26512v1 Announce Type: new Abstract: Existing methods for text-to-CAD generation either operate in a single pass with no geometric verification or re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Stabilizing Rubric Integration Training via Decoupled Advantage Normalization

arXiv:2603.26535v1 Announce Type: new Abstract: We propose Process-Aware Policy Optimization (PAPO), a method that integrates process-level evaluation into Grou

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models

arXiv:2603.25750v1 Announce Type: cross Abstract: As the paradigm of AI shifts from text-based LLMs to Speech Language Models (SLMs), there is a growing demand

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Consistency Amplifies: How Behavioral Variance Shapes Agent Accuracy

arXiv:2603.25764v1 Announce Type: cross Abstract: As LLM-based agents are deployed in production systems, understanding their behavioral consistency (whether th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models

arXiv:2603.25766v1 Announce Type: cross Abstract: The integration of Vision-Language-Action (VLA) models into autonomous driving systems offers a unified framew

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

UCAgent: An End-to-End Agent for Block-Level Functional Verification

arXiv:2603.25768v1 Announce Type: cross Abstract: Functional verification remains a critical bottleneck in modern IC development cycles, accounting for approxim

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

IncreRTL: Traceability-Guided Incremental RTL Generation under Requirement Evolution

arXiv:2603.25769v1 Announce Type: cross Abstract: Large language models (LLMs) have shown promise in generating RTL code from natural-language descriptions, but

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ReCUBE: Evaluating Repository-Level Context Utilization in Code Generation

arXiv:2603.25770v1 Announce Type: cross Abstract: Large Language Models (LLMs) have recently emerged as capable coding assistants that operate over large codeba

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Empowering Epidemic Response: The Role of Reinforcement Learning in Infectious Disease Control

arXiv:2603.25771v1 Announce Type: cross Abstract: Reinforcement learning (RL), owing to its adaptability to various dynamic systems in many real-world scenarios

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Beyond identifiability: Learning causal representations with few environments and finite samples

arXiv:2603.25796v1 Announce Type: cross Abstract: We provide explicit, finite-sample guarantees for learning causal representations from data with a sublinear n

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

MAGNET: Autonomous Expert Model Generation via Decentralized Autoresearch and BitNet Training

arXiv:2603.25813v1 Announce Type: cross Abstract: We present MAGNET (Model Autonomously Growing Network), a decentralized system for autonomous generation, trai

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

arXiv:2603.25823v1 Announce Type: cross Abstract: Beneath the stunning visual fidelity of modern AIGC models lies a "logical desert", where systems fail tasks t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

A Compression Perspective on Simplicity Bias

arXiv:2603.25839v1 Announce Type: cross Abstract: Deep neural networks exhibit a simplicity bias, a well-documented tendency to favor simple functions over comp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

GazeQwen: Lightweight Gaze-Conditioned LLM Modulation for Streaming Video Understanding

arXiv:2603.25841v1 Announce Type: cross Abstract: Current multimodal large language models (MLLMs) cannot effectively utilize eye-gaze information for video und

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Why Safety Probes Catch Liars But Miss Fanatics

arXiv:2603.25861v1 Announce Type: cross Abstract: Activation-based probes have emerged as a promising approach for detecting deceptively aligned AI systems by i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks

arXiv:2603.25864v1 Announce Type: cross Abstract: Graphical User Interface (GUI) agents have the potential to assist users in interacting with complex software

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

On Integrating Resilience and Human Oversight into LLM-Assisted Modeling Workflows for Digital Twins

arXiv:2603.25898v1 Announce Type: cross Abstract: LLM-assisted modeling holds the potential to rapidly build executable Digital Twins of complex systems from on

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Good Scores, Bad Data: A Metric for Multimodal Coherence

arXiv:2603.25924v1 Announce Type: cross Abstract: Multimodal AI systems are evaluated by downstream task accuracy, but high accuracy does not mean the underlyin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

DiReCT: Disentangled Regularization of Contrastive Trajectories for Physics-Refined Video Generation

arXiv:2603.25931v1 Announce Type: cross Abstract: Flow-matching video generators produce temporally coherent, high-fidelity outputs yet routinely violate elemen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Reinforcing Structured Chain-of-Thought for Video Understanding

arXiv:2603.25942v1 Announce Type: cross Abstract: Multi-modal Large Language Models (MLLMs) show promise in video understanding. However, their reasoning often

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models

arXiv:2603.25960v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed in medical settings, yet their sensitivity to prompt fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Policy-Guided World Model Planning for Language-Conditioned Visual Navigation

arXiv:2603.25981v1 Announce Type: cross Abstract: Navigating to a visually specified goal given natural language instructions remains a fundamental challenge in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants

arXiv:2603.26008v1 Announce Type: cross Abstract: While powerful in image-conditioned generation, multimodal large language models (MLLMs) can display uneven pe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

H-Node Attack and Defense in Large Language Models

arXiv:2603.26045v1 Announce Type: cross Abstract: We present H-Node Adversarial Noise Cancellation (H-Node ANC), a mechanistic framework that identifies, exploi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

MuDD: A Multimodal Deception Detection Dataset and GSR-Guided Progressive Distillation for Non-Contact Deception Detection

arXiv:2603.26064v1 Announce Type: cross Abstract: Non-contact automatic deception detection remains challenging because visual and auditory deception cues often

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

When Identities Collapse: A Stress-Test Benchmark for Multi-Subject Personalization

arXiv:2603.26078v1 Announce Type: cross Abstract: Subject-driven text-to-image diffusion models have achieved remarkable success in preserving single identities

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind

arXiv:2603.26089v1 Announce Type: cross Abstract: The ability to represent oneself and others as agents with knowledge, intentions, and belief states that guide

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning

arXiv:2603.26098v1 Announce Type: cross Abstract: While self-supervised learning (SSL) has revolutionized audio representation, the excessive parameterization a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

"Oops! ChatGPT is Temporarily Unavailable!": A Diary Study on Knowledge Workers' Experiences of LLM Withdrawal

arXiv:2603.26099v1 Announce Type: cross Abstract: LLMs have become deeply embedded in knowledge work, raising concerns about growing dependency and the potentia

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

SkinGPT-X: A Self-Evolving Collaborative Multi-Agent System for Transparent and Trustworthy Dermatological Diagnosis

arXiv:2603.26122v1 Announce Type: cross Abstract: While recent advancements in Large Language Models have significantly advanced dermatological diagnosis, monol

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Finding Distributed Object-Centric Properties in Self-Supervised Transformers

arXiv:2603.26127v1 Announce Type: cross Abstract: Self-supervised Vision Transformers (ViTs) like DINO show an emergent ability to discover objects, typically o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

SWE-PRBench: Benchmarking AI Code Review Quality Against Pull Request Feedback

arXiv:2603.26130v1 Announce Type: cross Abstract: We introduce SWE-PRBench, a benchmark of 350 pull requests with human-annotated ground truth for evaluating AI

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Sparse Auto-Encoders and Holism about Large Language Models

arXiv:2603.26207v1 Announce Type: cross Abstract: Does Large Language Model (LLM) technology suggest a meta-semantic picture i.e. a picture of how words and com

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Towards GUI Agents: Vision-Language Diffusion Models for GUI Grounding

arXiv:2603.26211v1 Announce Type: cross Abstract: Autoregressive (AR) vision-language models (VLMs) have long dominated multimodal understanding, reasoning, and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Clawed and Dangerous: Can We Trust Open Agentic Systems?

arXiv:2603.26221v1 Announce Type: cross Abstract: Open agentic systems combine LLM-based planning with external capabilities, persistent memory, and privileged

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Automating Domain-Driven Design: Experience with a Prompting Framework

arXiv:2603.26244v1 Announce Type: cross Abstract: Domain-driven design (DDD) is a powerful design technique for architecting complex software systems. This pape

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Physics-Informed Neural Networks and Sequence Encoder: Application to heating and early cooling of thermo-stamping process

arXiv:2603.26245v1 Announce Type: cross Abstract: In a previous work (Elaarabi et al., 2025b), the Sequence Encoder for online dynamical system identification (

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction

arXiv:2603.26258v1 Announce Type: cross Abstract: We present ARTA, a mixed-resolution coarse-to-fine vision transformer for efficient dense feature extraction.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Working Notes on Late Interaction Dynamics: Analyzing Targeted Behaviors of Late Interaction Models

arXiv:2603.26259v1 Announce Type: cross Abstract: While Late Interaction models exhibit strong retrieval performance, many of their underlying dynamics remain u

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Knowdit: Agentic Smart Contract Vulnerability Detection with Auditing Knowledge Summarization

arXiv:2603.26270v1 Announce Type: cross Abstract: Smart contracts govern billions of dollars in decentralized finance (DeFi), yet automated vulnerability detect

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

PhysVid: Physics Aware Local Conditioning for Generative Video Models

arXiv:2603.26285v1 Announce Type: cross Abstract: Generative video models achieve high visual fidelity but often violate basic physical principles, limiting rel

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Preference-Aligned LoRA Merging: Preserving Subspace Coverage and Addressing Directional Anisotropy

arXiv:2603.26299v1 Announce Type: cross Abstract: Merging multiple Low-Rank Adaptation (LoRA) modules is promising for constructing general-purpose systems, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Label-Free Cross-Task LoRA Merging with Null-Space Compression

arXiv:2603.26317v1 Announce Type: cross Abstract: Model merging combines independently fine-tuned checkpoints without joint multi-task training. In the era of f

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

From Human Cognition to Neural Activations: Probing the Computational Primitives of Spatial Reasoning in LLMs

arXiv:2603.26323v1 Announce Type: cross Abstract: As spatial intelligence becomes an increasingly important capability for foundation models, it remains unclear

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

CALRK-Bench: Evaluating Context-Aware Legal Reasoning in Korean Law

arXiv:2603.26332v1 Announce Type: cross Abstract: Legal reasoning requires not only the application of legal rules but also an understanding of the context in w

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

UNIFERENCE: A Discrete Event Simulation Framework for Developing Distributed AI Models

arXiv:2603.26469v1 Announce Type: cross Abstract: Developing and evaluating distributed inference algorithms remains difficult due to the lack of standardized t