Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,662

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,439 Reads 5,223

Showing 5,223 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Graphs RAG at Scale: Beyond Retrieval-Augmented Generation With Labeled Property Graphs and Resource Description Framework for Complex and Unknown Search Spaces

arXiv:2603.22340v1 Announce Type: cross Abstract: Recent advances in Retrieval-Augmented Generation (RAG) have revolutionized knowledge-intensive tasks, yet tra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

arXiv:2603.22341v1 Announce Type: cross Abstract: While prior red-teaming efforts have focused on eliciting harmful text outputs from large language models (LLM

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement

arXiv:2603.22352v1 Announce Type: cross Abstract: Recent progress in reinforcement learning with verifiable rewards (RLVR) offers a practical path to self-impro

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale

arXiv:2603.22363v1 Announce Type: cross Abstract: Designing algorithms with provable guarantees that also work well in practice remains difficult, requiring bot

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives

arXiv:2603.22364v1 Announce Type: cross Abstract: Diffusion models have achieved state-of-the-art performance in generative modeling, but their success often re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Q-AGNN: Quantum-Enhanced Attentive Graph Neural Network for Intrusion Detection

arXiv:2603.22365v1 Announce Type: cross Abstract: With the rapid growth of interconnected devices, accurately detecting malicious activities in network traffic

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Modeling Quantum Federated Autoencoder for Anomaly Detection in IoT Networks

arXiv:2603.22366v1 Announce Type: cross Abstract: We propose a Quantum Federated Autoencoder for Anomaly Detection, a framework that leverages quantum federated

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window

arXiv:2603.22367v1 Announce Type: cross Abstract: Large Language Models (LLMs) deployed as autonomous agents commonly use Retrieval-Augmented Generation (RAG),

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

FAAR: Format-Aware Adaptive Rounding for NVFP4

arXiv:2603.22370v1 Announce Type: cross Abstract: Deploying large language models (LLMs) on edge devices requires extremely low-bit quantization. Ultra-low prec

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Three Creates All: You Only Sample 3 Steps

arXiv:2603.22375v1 Announce Type: cross Abstract: Diffusion models deliver high-fidelity generation but remain slow at inference time due to many sequential net

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access

arXiv:2603.22376v1 Announce Type: cross Abstract: Recent advances in AI agents for software engineering and scientific discovery have demonstrated remarkable ca

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Instruction-Tuned, but Not More Verifiable Instruction-Following: A Cross-Task Diagnosis for LoRA Adapters

arXiv:2603.22379v1 Announce Type: cross Abstract: Adapters are often selected and deployed based on nominal labels (e.g., instruction-tuned), which implicitly s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Learning When to Act: Interval-Aware Reinforcement Learning with Predictive Temporal Structure

arXiv:2603.22384v1 Announce Type: cross Abstract: Autonomous agents operating in continuous environments must decide not only what to do, but when to act. We in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Latent Style-based Quantum Wasserstein GAN for Drug Design

arXiv:2603.22399v1 Announce Type: cross Abstract: The development of new drugs is a tedious, time-consuming, and expensive process, for which the average costs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CaP-X: A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation

arXiv:2603.22435v1 Announce Type: cross Abstract: "Code-as-Policy" considers how executable code can complement data-intensive Vision-Language-Action (VLA) meth

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

arXiv:2603.22446v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly improved reasoning in large language m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

LLM-guided headline rewriting for clickability enhancement without clickbait

arXiv:2603.22459v1 Announce Type: cross Abstract: Enhancing reader engagement while preserving informational fidelity is a central challenge in controllable tex

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Stability-Preserving Online Adaptation of Neural Closed-loop Maps

arXiv:2603.22469v1 Announce Type: cross Abstract: The growing complexity of modern control tasks calls for controllers that can react online as objectives and d

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures

arXiv:2603.22473v1 Announce Type: cross Abstract: Hybrid language models combining attention with state space models (SSMs) or linear attention offer improved e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games

arXiv:2603.22479v1 Announce Type: cross Abstract: Defining a constructive process to build general capabilities for language models in an automatic manner is co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Tiny Inference-Time Scaling with Latent Verifiers

arXiv:2603.22492v1 Announce Type: cross Abstract: Inference-time scaling has emerged as an effective way to improve generative models at test time by using a ve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals

arXiv:2603.22510v1 Announce Type: cross Abstract: Large language models such as ChatGPT have increased scholarly output, but whether this productivity boost pro

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

arXiv:2603.22519v1 Announce Type: cross Abstract: Textual Large Language Models (LLMs) provide a simple and familiar interface: a string of text is used for bot

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs

arXiv:2603.22528v1 Announce Type: cross Abstract: Large Language Models (LLMs) combined with Retrieval-Augmented Generation (RAG) and knowledge graphs offer new

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

arXiv:2603.22529v1 Announce Type: cross Abstract: Multimodal AI agents are increasingly automating complex real-world workflows that involve online web executio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving

arXiv:2603.22577v1 Announce Type: cross Abstract: Large Language Models (LLMs) have demonstrated potential in code generation, yet they struggle with the multi-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

arXiv:2603.22582v1 Announce Type: cross Abstract: Chain-of-thought (CoT) reasoning has been proposed as a transparency mechanism for large language models in sa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

flexvec: SQL Vector Retrieval with Programmatic Embedding Modulation

arXiv:2603.22587v1 Announce Type: cross Abstract: As AI agents become the primary consumers of retrieval APIs, there is an opportunity to expose more of the ret

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Language Models Can Explain Visual Features via Steering

arXiv:2603.22593v1 Announce Type: cross Abstract: Sparse Autoencoders uncover thousands of features in vision models, yet explaining these features without requ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Do Consumers Accept AIs as Moral Compliance Agents?

arXiv:2603.22617v1 Announce Type: cross Abstract: Consumers are generally resistant to Artificial Intelligence (AI) involvement in moral decision-making, percei

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models

arXiv:2603.22623v1 Announce Type: cross Abstract: Vision-language models (VLMs) adapted to the medical domain have shown strong performance on visual question a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Learning to Trust: How Humans Mentally Recalibrate AI Confidence Signals

arXiv:2603.22634v1 Announce Type: cross Abstract: Productive human-AI collaboration requires appropriate reliance, yet contemporary AI systems are often miscali

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research

arXiv:2603.22648v1 Announce Type: cross Abstract: There are different goals for literature research, from understanding an unfamiliar topic to generate hypothes

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset

arXiv:2603.22714v1 Announce Type: cross Abstract: We present PopResume, a population-representative resume dataset for causal fairness auditing of LLM- and VLM-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

KALAVAI: Predicting When Independent Specialist Fusion Works -- A Quantitative Model for Post-Hoc Cooperative LLM Training

arXiv:2603.22755v1 Announce Type: cross Abstract: Independently trained domain specialists can be fused post-hoc into a single model that outperforms any indivi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona

arXiv:2603.22765v1 Announce Type: cross Abstract: Data scarcity remains a persistent challenge in low-resource domains. While existing data augmentation methods

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Overload to Convergence: Supporting Multi-Issue Human-AI Negotiation with Bayesian Visualization

arXiv:2603.22766v1 Announce Type: cross Abstract: As AI systems increasingly mediate negotiations, understanding how the number of negotiated issues impacts hum

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips

arXiv:2603.22770v1 Announce Type: cross Abstract: The deployment of deep neural networks (DNNs) in safety-critical edge environments necessitates robustness aga

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao

arXiv:2603.22779v1 Announce Type: cross Abstract: Large Language Models (LLMs) are equipped with profound semantic knowledge, making them a natural choice for i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding

arXiv:2603.22796v1 Announce Type: cross Abstract: Embodied agents for creative tasks like photography must bridge the semantic gap between high-level language c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

When AI Shows Its Work, Is It Actually Working? Step-Level Evaluation Reveals Frontier Language Models Frequently Bypass Their Own Reasoning

arXiv:2603.22816v1 Announce Type: cross Abstract: Language models increasingly "show their work" by writing step-by-step reasoning before answering. But are the

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection

arXiv:2603.22840v1 Announce Type: cross Abstract: Unsupervised anomaly detection plays a pivotal role in industrial defect inspection and medical image analysis

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Agent Audit: A Security Analysis System for LLM Agent Applications

arXiv:2603.22853v1 Announce Type: cross Abstract: What should a developer inspect before deploying an LLM agent: the model, the tool code, the deployment config

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Coordinate System Problem in Persistent Structural Memory for Neural Architectures

arXiv:2603.22858v1 Announce Type: cross Abstract: We introduce the Dual-View Pheromone Pathway Network (DPPN), an architecture that routes sparse attention thro

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Agent-Sentry: Bounding LLM Agents via Execution Provenance

arXiv:2603.22868v1 Announce Type: cross Abstract: Agentic computing systems, which autonomously spawn new functionalities based on natural language instructions

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models

arXiv:2603.22876v1 Announce Type: cross Abstract: Learning a generalist control policy for dexterous manipulation typically relies on large-scale datasets. Give

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

arXiv:2603.22911v1 Announce Type: cross Abstract: Due to the great saving of computation and memory overhead, token compression has become a research hot-spot f

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From the AI Act to a European AI Agency: Completing the Union's Regulatory Architecture

arXiv:2603.22912v1 Announce Type: cross Abstract: As artificial intelligence (AI) technologies continue to advance, effective risk assessment, regulation, and o