Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,662

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,439 Reads 5,223

Showing 5,223 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

arXiv:2603.22918v1 Announce Type: cross Abstract: Video understanding with multimodal large language models (MLLMs) remains challenging due to the long token se

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The EU AI Act and the Rights-based Approach to Technological Governance

arXiv:2603.22920v1 Announce Type: cross Abstract: The EU AI Act constitutes an important development in shaping the Union's digital regulatory architecture. The

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees

arXiv:2603.22966v1 Announce Type: cross Abstract: Large language models (LLMs) inherently operate over a large generation space, yet conventional usage typicall

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Can Graph Foundation Models Generalize Over Architecture?

arXiv:2603.22984v1 Announce Type: cross Abstract: Graph foundation models (GFMs) have recently attracted interest due to the promise of graph neural network (GN

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation

arXiv:2603.23047v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) fine-tuning has shown substantial improvements over vanilla RAG, yet most

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DBAutoDoc: Automated Discovery and Documentation of Undocumented Database Schemas via Statistical Analysis and Iterative LLM Refinement

arXiv:2603.23050v1 Announce Type: cross Abstract: A tremendous number of critical database systems lack adequate documentation. Declared primary keys are absent

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Mind Your HEARTBEAT! Claw Background Execution Inherently Enables Silent Memory Pollution

arXiv:2603.23064v1 Announce Type: cross Abstract: We identify a critical security vulnerability in mainstream Claw personal AI agents: untrusted content encount

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Can an LLM Detect Instances of Microservice Infrastructure Patterns?

arXiv:2603.23073v1 Announce Type: cross Abstract: Architectural patterns are frequently found in various software artifacts. The wide variety of patterns and th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy

arXiv:2603.23146v1 Announce Type: cross Abstract: The widespread adoption of Large Language Models (LLMs) has made the detection of AI-Generated text a pressing

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Robust Safety Monitoring of Language Models via Activation Watermarking

arXiv:2603.23171v1 Announce Type: cross Abstract: Large language models (LLMs) can be misused to reveal sensitive information, such as weapon-making instruction

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Reasoning over Semantic IDs Enhances Generative Recommendation

arXiv:2603.23183v1 Announce Type: cross Abstract: Recent advances in generative recommendation have leveraged pretrained LLMs by formulating sequential recommen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

arXiv:2603.23184v1 Announce Type: cross Abstract: Reward modeling represents a long-standing challenge in reinforcement learning from human feedback (RLHF) for

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

General Machine Learning: Theory for Learning Under Variable Regimes

arXiv:2603.23220v1 Announce Type: cross Abstract: We study learning under regime variation, where the learner, its memory state, and the evaluative conditions m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Neural ODE and SDE Models for Adaptation and Planning in Model-Based Reinforcement Learning

arXiv:2603.23245v1 Announce Type: cross Abstract: We investigate neural ordinary and stochastic differential equations (neural ODEs and SDEs) to model stochasti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling

arXiv:2603.23249v1 Announce Type: cross Abstract: Efficient scheduling of directed acyclic graphs (DAGs) in heterogeneous environments is challenging due to res

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RAN

arXiv:2603.23252v1 Announce Type: cross Abstract: Integrating Artificial Intelligence (AI) into Non-Terrestrial Networks (NTN) is constrained by the joint limit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

SafeSeek: Universal Attribution of Safety Circuits in Language Models

arXiv:2603.23268v1 Announce Type: cross Abstract: Mechanistic interpretability reveals that safety-critical behaviors (e.g., alignment, jailbreak, backdoor) in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

arXiv:2603.23269v1 Announce Type: cross Abstract: Large Language Models(LLMs) are widely deployed, yet are vulnerable to jailbreak prompts that elicit policy-vi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Multimodal Framework for Human-Multi-Agent Interaction

arXiv:2603.23271v1 Announce Type: cross Abstract: Human-robot interaction is increasingly moving toward multi-robot, socially grounded environments. Existing sy

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook

arXiv:2603.23279v1 Announce Type: cross Abstract: The rapid diffusion of large language models and the growth in their capability has enabled the emergence of o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Designing Agentic AI-Based Screening for Portfolio Investment

arXiv:2603.23300v1 Announce Type: cross Abstract: We introduce a new agentic artificial intelligence (AI) platform for portfolio management. Our architecture co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression

arXiv:2603.23308v1 Announce Type: cross Abstract: Automated radiology report generation from 3D computed tomography (CT) volumes is challenging due to extreme s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Unilateral Relationship Revision Power in Human-AI Companion Interaction

arXiv:2603.23315v1 Announce Type: cross Abstract: When providers update AI companions, users report grief, betrayal, and loss. A growing literature asks whether

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Leveraging LLMs and Social Media to Understand User Perception of Smartphone-Based Earthquake Early Warnings

arXiv:2603.23322v1 Announce Type: cross Abstract: Android's Earthquake Alert (AEA) system provided timely early warnings to millions during the Mw 6.2 Marmara E

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Planning over MAPF Agent Dependencies via Multi-Dependency PIBT

arXiv:2603.23405v1 Announce Type: cross Abstract: Modern Multi-Agent Path Finding (MAPF) algorithms must plan for hundreds to thousands of agents in congested e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

arXiv:2603.23414v1 Announce Type: cross Abstract: Scaling reinforcement learning (RL) has shown strong promise for enhancing the reasoning abilities of large la

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Biased Error Attribution in Multi-Agent Human-AI Systems Under Delayed Feedback

arXiv:2603.23419v1 Announce Type: cross Abstract: Human decision-making is strongly influenced by cognitive biases, particularly under conditions of uncertainty

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Evaluating LLM-Based Test Generation Under Software Evolution

arXiv:2603.23443v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used for automated unit test generation. However, it remains unc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding

arXiv:2603.23447v1 Announce Type: cross Abstract: While multi-modality large language models excel in object-centric or indoor scenarios, scaling them to 3D cit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs

arXiv:2603.23481v1 Announce Type: cross Abstract: Video-Action Models (VAMs) have emerged as a promising framework for embodied intelligence, learning implicit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains

arXiv:2603.23482v1 Announce Type: cross Abstract: Requirements engineering is a vital, yet labor-intensive, stage in the software development process. This arti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Failure of contextual invariance in gender inference with large language models

arXiv:2603.23485v1 Announce Type: cross Abstract: Standard evaluation practices assume that large language model (LLM) outputs are stable under contextually equ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions

arXiv:2603.23495v1 Announce Type: cross Abstract: Existing approaches for improving the efficiency of Large Vision-Language Models (LVLMs) are largely based on

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage

arXiv:2603.23501v1 Announce Type: cross Abstract: Vision Language Models (VLMs) are increasingly used for tasks like medical report generation and visual questi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

An Accurate and Interpretable Framework for Trustworthy Process Monitoring

arXiv:2302.10426v3 Announce Type: replace Abstract: Trustworthy process monitoring seeks to build an accurate and interpretable monitoring framework, which is c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Towards Self-Evolving Benchmarks: Synthesizing Agent Trajectories via Test-Time Exploration under Validate-by-Reproduce Paradigm

arXiv:2510.00415v3 Announce Type: replace Abstract: Recent advances in large language models (LLMs) and agent system designs have empowered agents with unpreced

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions

arXiv:2510.05318v3 Announce Type: replace Abstract: Large language models (LLMs) have demonstrated remarkable performance on single-turn text-to-SQL tasks, but

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

BuilderBench: The Building Blocks of Intelligent Agents

arXiv:2510.06288v3 Announce Type: replace Abstract: Today's AI models learn primarily through mimicry and refining, so it is not surprising that they struggle t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Hybrid Stackelberg Game and Diffusion-based Auction for Two-tier Agentic AI Task Offloading in Internet of Agents

arXiv:2511.22076v2 Announce Type: replace Abstract: The Internet of Agents (IoA) is rapidly gaining prominence as a foundational architecture for interconnected

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DriveSafe: A Hierarchical Risk Taxonomy for Safety-Critical LLM-Based Driving Assistants

arXiv:2601.12138v3 Announce Type: replace Abstract: Large Language Models (LLMs) are increasingly integrated into vehicle-based digital assistants, where unsafe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

arXiv:2602.02050v3 Announce Type: replace Abstract: Tool-using agents based on Large Language Models (LLMs) excel in tasks such as mathematical reasoning and mu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search

arXiv:2602.22983v3 Announce Type: replace Abstract: As Large Language Models (LLMs) are increasingly used, their security risks have drawn increasing attention.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays

arXiv:2602.23276v2 Announce Type: replace Abstract: Chest X-ray plays a central role in thoracic diagnosis, and its interpretation inherently requires multi-ste

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Agentic AI-based Coverage Closure for Formal Verification

arXiv:2603.03147v2 Announce Type: replace Abstract: Coverage closure is a critical requirement in Integrated Chip (IC) development process and key metric for ve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Retrieval-Augmented Generation with Covariate Time Series

arXiv:2603.04951v2 Announce Type: replace Abstract: While RAG has greatly enhanced LLMs, extending this paradigm to Time-Series Foundation Models (TSFMs) remain

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Planning as Goal Recognition: Deriving Heuristics from Intention Models -- Extended Version

arXiv:2603.14824v2 Announce Type: replace Abstract: Classical planning aims to find a sequence of actions, a plan, that maps a starting state into one of the go

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Cascade-Aware Multi-Agent Routing: Spatio-Temporal Sidecars and Geometry-Switching

arXiv:2603.17112v2 Announce Type: replace Abstract: Advanced AI reasoning systems route tasks through dynamic execution graphs of specialized agents. We identif

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models

arXiv:2603.20670v2 Announce Type: replace Abstract: The rapid growth in the volume, variety, and velocity of geospatial data has created data ecosystems that ar