Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,662
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,223 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
arXiv:2603.22918v1 Announce Type: cross Abstract: Video understanding with multimodal large language models (MLLMs) remains challenging due to the long token se
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
The EU AI Act and the Rights-based Approach to Technological Governance
arXiv:2603.22920v1 Announce Type: cross Abstract: The EU AI Act constitutes an important development in shaping the Union's digital regulatory architecture. The
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees
arXiv:2603.22966v1 Announce Type: cross Abstract: Large language models (LLMs) inherently operate over a large generation space, yet conventional usage typicall
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Can Graph Foundation Models Generalize Over Architecture?
arXiv:2603.22984v1 Announce Type: cross Abstract: Graph foundation models (GFMs) have recently attracted interest due to the promise of graph neural network (GN
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation
arXiv:2603.23047v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) fine-tuning has shown substantial improvements over vanilla RAG, yet most
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
DBAutoDoc: Automated Discovery and Documentation of Undocumented Database Schemas via Statistical Analysis and Iterative LLM Refinement
arXiv:2603.23050v1 Announce Type: cross Abstract: A tremendous number of critical database systems lack adequate documentation. Declared primary keys are absent
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Mind Your HEARTBEAT! Claw Background Execution Inherently Enables Silent Memory Pollution
arXiv:2603.23064v1 Announce Type: cross Abstract: We identify a critical security vulnerability in mainstream Claw personal AI agents: untrusted content encount
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Can an LLM Detect Instances of Microservice Infrastructure Patterns?
arXiv:2603.23073v1 Announce Type: cross Abstract: Architectural patterns are frequently found in various software artifacts. The wide variety of patterns and th
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy
arXiv:2603.23146v1 Announce Type: cross Abstract: The widespread adoption of Large Language Models (LLMs) has made the detection of AI-Generated text a pressing
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Robust Safety Monitoring of Language Models via Activation Watermarking
arXiv:2603.23171v1 Announce Type: cross Abstract: Large language models (LLMs) can be misused to reveal sensitive information, such as weapon-making instruction
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Reasoning over Semantic IDs Enhances Generative Recommendation
arXiv:2603.23183v1 Announce Type: cross Abstract: Recent advances in generative recommendation have leveraged pretrained LLMs by formulating sequential recommen
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment
arXiv:2603.23184v1 Announce Type: cross Abstract: Reward modeling represents a long-standing challenge in reinforcement learning from human feedback (RLHF) for
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
General Machine Learning: Theory for Learning Under Variable Regimes
arXiv:2603.23220v1 Announce Type: cross Abstract: We study learning under regime variation, where the learner, its memory state, and the evaluative conditions m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Neural ODE and SDE Models for Adaptation and Planning in Model-Based Reinforcement Learning
arXiv:2603.23245v1 Announce Type: cross Abstract: We investigate neural ordinary and stochastic differential equations (neural ODEs and SDEs) to model stochasti
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling
arXiv:2603.23249v1 Announce Type: cross Abstract: Efficient scheduling of directed acyclic graphs (DAGs) in heterogeneous environments is challenging due to res
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RAN
arXiv:2603.23252v1 Announce Type: cross Abstract: Integrating Artificial Intelligence (AI) into Non-Terrestrial Networks (NTN) is constrained by the joint limit
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
SafeSeek: Universal Attribution of Safety Circuits in Language Models
arXiv:2603.23268v1 Announce Type: cross Abstract: Mechanistic interpretability reveals that safety-critical behaviors (e.g., alignment, jailbreak, backdoor) in
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs
arXiv:2603.23269v1 Announce Type: cross Abstract: Large Language Models(LLMs) are widely deployed, yet are vulnerable to jailbreak prompts that elicit policy-vi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
A Multimodal Framework for Human-Multi-Agent Interaction
arXiv:2603.23271v1 Announce Type: cross Abstract: Human-robot interaction is increasingly moving toward multi-robot, socially grounded environments. Existing sy
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook
arXiv:2603.23279v1 Announce Type: cross Abstract: The rapid diffusion of large language models and the growth in their capability has enabled the emergence of o
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Designing Agentic AI-Based Screening for Portfolio Investment
arXiv:2603.23300v1 Announce Type: cross Abstract: We introduce a new agentic artificial intelligence (AI) platform for portfolio management. Our architecture co
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression
arXiv:2603.23308v1 Announce Type: cross Abstract: Automated radiology report generation from 3D computed tomography (CT) volumes is challenging due to extreme s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Unilateral Relationship Revision Power in Human-AI Companion Interaction
arXiv:2603.23315v1 Announce Type: cross Abstract: When providers update AI companions, users report grief, betrayal, and loss. A growing literature asks whether
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Leveraging LLMs and Social Media to Understand User Perception of Smartphone-Based Earthquake Early Warnings
arXiv:2603.23322v1 Announce Type: cross Abstract: Android's Earthquake Alert (AEA) system provided timely early warnings to millions during the Mw 6.2 Marmara E
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Planning over MAPF Agent Dependencies via Multi-Dependency PIBT
arXiv:2603.23405v1 Announce Type: cross Abstract: Modern Multi-Agent Path Finding (MAPF) algorithms must plan for hundreds to thousands of agents in congested e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling
arXiv:2603.23414v1 Announce Type: cross Abstract: Scaling reinforcement learning (RL) has shown strong promise for enhancing the reasoning abilities of large la
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Biased Error Attribution in Multi-Agent Human-AI Systems Under Delayed Feedback
arXiv:2603.23419v1 Announce Type: cross Abstract: Human decision-making is strongly influenced by cognitive biases, particularly under conditions of uncertainty
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Evaluating LLM-Based Test Generation Under Software Evolution
arXiv:2603.23443v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used for automated unit test generation. However, it remains unc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding
arXiv:2603.23447v1 Announce Type: cross Abstract: While multi-modality large language models excel in object-centric or indoor scenarios, scaling them to 3D cit
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs
arXiv:2603.23481v1 Announce Type: cross Abstract: Video-Action Models (VAMs) have emerged as a promising framework for embodied intelligence, learning implicit
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains
arXiv:2603.23482v1 Announce Type: cross Abstract: Requirements engineering is a vital, yet labor-intensive, stage in the software development process. This arti
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Failure of contextual invariance in gender inference with large language models
arXiv:2603.23485v1 Announce Type: cross Abstract: Standard evaluation practices assume that large language model (LLM) outputs are stable under contextually equ
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions
arXiv:2603.23495v1 Announce Type: cross Abstract: Existing approaches for improving the efficiency of Large Vision-Language Models (LVLMs) are largely based on
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage
arXiv:2603.23501v1 Announce Type: cross Abstract: Vision Language Models (VLMs) are increasingly used for tasks like medical report generation and visual questi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
An Accurate and Interpretable Framework for Trustworthy Process Monitoring
arXiv:2302.10426v3 Announce Type: replace Abstract: Trustworthy process monitoring seeks to build an accurate and interpretable monitoring framework, which is c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Towards Self-Evolving Benchmarks: Synthesizing Agent Trajectories via Test-Time Exploration under Validate-by-Reproduce Paradigm
arXiv:2510.00415v3 Announce Type: replace Abstract: Recent advances in large language models (LLMs) and agent system designs have empowered agents with unpreced
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions
arXiv:2510.05318v3 Announce Type: replace Abstract: Large language models (LLMs) have demonstrated remarkable performance on single-turn text-to-SQL tasks, but
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
BuilderBench: The Building Blocks of Intelligent Agents
arXiv:2510.06288v3 Announce Type: replace Abstract: Today's AI models learn primarily through mimicry and refining, so it is not surprising that they struggle t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Hybrid Stackelberg Game and Diffusion-based Auction for Two-tier Agentic AI Task Offloading in Internet of Agents
arXiv:2511.22076v2 Announce Type: replace Abstract: The Internet of Agents (IoA) is rapidly gaining prominence as a foundational architecture for interconnected
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
DriveSafe: A Hierarchical Risk Taxonomy for Safety-Critical LLM-Based Driving Assistants
arXiv:2601.12138v3 Announce Type: replace Abstract: Large Language Models (LLMs) are increasingly integrated into vehicle-based digital assistants, where unsafe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
arXiv:2602.02050v3 Announce Type: replace Abstract: Tool-using agents based on Large Language Models (LLMs) excel in tasks such as mathematical reasoning and mu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search
arXiv:2602.22983v3 Announce Type: replace Abstract: As Large Language Models (LLMs) are increasingly used, their security risks have drawn increasing attention.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays
arXiv:2602.23276v2 Announce Type: replace Abstract: Chest X-ray plays a central role in thoracic diagnosis, and its interpretation inherently requires multi-ste
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Agentic AI-based Coverage Closure for Formal Verification
arXiv:2603.03147v2 Announce Type: replace Abstract: Coverage closure is a critical requirement in Integrated Chip (IC) development process and key metric for ve
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Retrieval-Augmented Generation with Covariate Time Series
arXiv:2603.04951v2 Announce Type: replace Abstract: While RAG has greatly enhanced LLMs, extending this paradigm to Time-Series Foundation Models (TSFMs) remain
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Planning as Goal Recognition: Deriving Heuristics from Intention Models -- Extended Version
arXiv:2603.14824v2 Announce Type: replace Abstract: Classical planning aims to find a sequence of actions, a plan, that maps a starting state into one of the go
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Cascade-Aware Multi-Agent Routing: Spatio-Temporal Sidecars and Geometry-Switching
arXiv:2603.17112v2 Announce Type: replace Abstract: Advanced AI reasoning systems route tasks through dynamic execution graphs of specialized agents. We identif
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models
arXiv:2603.20670v2 Announce Type: replace Abstract: The rapid growth in the volume, variety, and velocity of geospatial data has created data ecosystems that ar