Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,932

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,459 Reads 5,473

Showing 5,473 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

FedFG: Privacy-Preserving and Robust Federated Learning via Flow-Matching Generation

arXiv:2603.27986v1 Announce Type: cross Abstract: Federated learning (FL) enables distributed clients to collaboratively train a global model using local privat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution Alignment

arXiv:2603.27987v1 Announce Type: cross Abstract: The high cost and accessibility problem associated with large datasets hinder the development of large-scale v

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

arXiv:2603.27991v1 Announce Type: cross Abstract: Interactive documents help readers engage with complex ideas through dynamic visualization, interactive animat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers

arXiv:2603.28013v1 Announce Type: cross Abstract: We present a stage-decomposed analysis of prompt injection attacks against five frontier LLM agents. Prior wor

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Synonymix: Unified Group Personas for Generative Simulations

arXiv:2603.28066v1 Announce Type: cross Abstract: Generative agent simulations operate at two scales: individual personas for character interaction, and populat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

arXiv:2603.28069v1 Announce Type: cross Abstract: Grounding has become a fundamental capability of vision-language models (VLMs). Most existing VLMs point by ge

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions

arXiv:2603.28086v1 Announce Type: cross Abstract: Voice design from natural language aims to generate speaker timbres directly from free-form textual descriptio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models

arXiv:2603.28103v1 Announce Type: cross Abstract: Parliamentary proceedings represent a rich yet challenging resource for computational analysis, particularly w

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data

arXiv:2603.28122v1 Announce Type: cross Abstract: Integrating quantum circuits into deep learning pipelines remains challenging due to heuristic design limitati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Does Claude's Constitution Have a Culture?

arXiv:2603.28123v1 Announce Type: cross Abstract: Constitutional AI (CAI) aligns language models with explicitly stated normative principles, offering a transpa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

arXiv:2603.28130v1 Announce Type: cross Abstract: We introduce Multilingual Document Parsing Benchmark, the first benchmark for multilingual digital and photogr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

RecycleLoRA: Rank-Revealing QR-Based Dual-LoRA Subspace Adaptation for Domain Generalized Semantic Segmentation

arXiv:2603.28142v1 Announce Type: cross Abstract: Domain Generalized Semantic Segmentation (DGSS) aims to maintain robust performance across unseen target domai

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Evaluating Privilege Usage of Agents on Real-World Tools

arXiv:2603.28166v1 Announce Type: cross Abstract: Equipping LLM agents with real-world tools can substantially improve productivity. However, granting agents au

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models

arXiv:2603.28204v1 Announce Type: cross Abstract: Reinforcement learning from verifiable rewards (RLVR) has significantly advanced the reasoning capabilities of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

DiffAttn: Diffusion-Based Drivers' Visual Attention Prediction with LLM-Enhanced Semantic Reasoning

arXiv:2603.28251v1 Announce Type: cross Abstract: Drivers' visual attention provides critical cues for anticipating latent hazards and directly shapes decision-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries

arXiv:2603.28258v1 Announce Type: cross Abstract: Categorical perception (CP) -- enhanced discriminability at category boundaries -- is among the most studied p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights

arXiv:2603.28263v1 Announce Type: cross Abstract: Large Language Models (LLMs) remain heavily centered on English, with limited performance in low-resource lang

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Pre-Deployment Complexity Estimation for Federated Perception Systems

arXiv:2603.28282v1 Announce Type: cross Abstract: Edge AI systems increasingly rely on federated learning to train perception models in distributed, privacy-pre

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

FI-KAN: Fractal Interpolation Kolmogorov-Arnold Networks

arXiv:2603.28288v1 Announce Type: cross Abstract: Kolmogorov-Arnold Networks (KAN) employ B-spline bases on a fixed grid, providing no intrinsic multi-scale dec

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

NeiGAD: Augmenting Graph Anomaly Detection via Spectral Neighbor Information

arXiv:2603.28300v1 Announce Type: cross Abstract: Graph anomaly detection (GAD) aims to identify irregular nodes or structures in attributed graphs. Neighbor in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Building evidence-based knowledge graphs from full-text literature for disease-specific biomedical reasoning

arXiv:2603.28325v1 Announce Type: cross Abstract: Biomedical knowledge resources often either preserve evidence as unstructured text or compress it into flat tr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Integrating Multimodal Large Language Model Knowledge into Amodal Completion

arXiv:2603.28333v1 Announce Type: cross Abstract: With the widespread adoption of autonomous vehicles and robotics, amodal completion, which reconstructs the oc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Crossing the NL/PL Divide: Information Flow Analysis Across the NL/PL Boundary in LLM-Integrated Code

arXiv:2603.28345v1 Announce Type: cross Abstract: LLM API calls are becoming a ubiquitous program construct, yet they create a boundary that no existing program

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Coherent Without Grounding, Grounded Without Success: Observability and Epistemic Failure

arXiv:2603.28371v1 Announce Type: cross Abstract: When an agent can articulate why something works, we typically take this as evidence of genuine understanding.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Membership Inference Attacks against Large Audio Language Models

arXiv:2603.28378v1 Announce Type: cross Abstract: We present the first systematic Membership Inference Attack (MIA) evaluation of Large Audio Language Models (L

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Critic-Free Deep Reinforcement Learning for Maritime Coverage Path Planning on Irregular Hexagonal Grids

arXiv:2603.28385v1 Announce Type: cross Abstract: Maritime surveillance missions, such as search and rescue and environmental monitoring, rely on the efficient

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation

arXiv:2603.28405v1 Announce Type: cross Abstract: Diffusion Transformers (DiT) have established a new state-of-the-art in high-fidelity image synthesis; however

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Evolutionary Discovery of Reinforcement Learning Algorithms via Large Language Models

arXiv:2603.28416v1 Announce Type: cross Abstract: Reinforcement learning algorithms are defined by their learning update rules, which are typically hand-designe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Spectral Higher-Order Neural Networks

arXiv:2603.28420v1 Announce Type: cross Abstract: Neural networks are fundamental tools of modern machine learning. The standard paradigm assumes binary interac

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

FeDMRA: Federated Incremental Learning with Dynamic Memory Replay Allocation

arXiv:2603.28455v1 Announce Type: cross Abstract: In federated healthcare systems, Federated Class-Incremental Learning (FCIL) has emerged as a key paradigm, en

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

arXiv:2603.28458v1 Announce Type: cross Abstract: Token-level sparse attention mechanisms, exemplified by DeepSeek Sparse Attention (DSA), achieve fine-grained

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification

arXiv:2603.28488v1 Announce Type: cross Abstract: Large language models (LLMs) remain unreliable for high-stakes claim verification due to hallucinations and sh

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Next-Token Prediction and Regret Minimization

arXiv:2603.28499v1 Announce Type: cross Abstract: We consider the question of how to employ next-token prediction algorithms in adversarial online decision-maki

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

The Unreasonable Effectiveness of Scaling Laws in AI

arXiv:2603.28507v1 Announce Type: cross Abstract: Classical AI scaling laws, especially for pre-training, describe how training loss decreases with compute in a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Hydra: Unifying Document Retrieval and Generation in a Single Vision-Language Model

arXiv:2603.28554v1 Announce Type: cross Abstract: Visual document understanding typically requires separate retrieval and generation models, doubling memory and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Domain-Invariant Prompt Learning for Vision-Language Models

arXiv:2603.28555v1 Announce Type: cross Abstract: Large pre-trained vision-language models like CLIP have transformed computer vision by aligning images and tex

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems

arXiv:2603.28561v1 Announce Type: cross Abstract: The growing deployment of small Unmanned Aerial Systems (sUASs) in low-altitude airspaces has increased the ne

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

CirrusBench: Evaluating LLM-based Agents Beyond Correctness in Real-World Cloud Service Environments

arXiv:2603.28569v1 Announce Type: cross Abstract: The increasing agentic capabilities of Large Language Models (LLMs) have enabled their deployment in real-worl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Learning Partial Action Replacement in Offline MARL

arXiv:2603.28573v1 Announce Type: cross Abstract: Offline multi-agent reinforcement learning (MARL) faces a critical challenge: the joint action space grows exp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ChemCLIP: Bridging Organic and Inorganic Anticancer Compounds Through Contrastive Learning

arXiv:2603.28575v1 Announce Type: cross Abstract: The discovery of anticancer therapeutics has traditionally treated organic small molecules and metal-based coo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Moving Beyond Review: Applying Language Models to Planning and Translation in Reflection

arXiv:2603.28596v1 Announce Type: cross Abstract: Reflective writing is known to support the development of students' metacognitive skills, yet learners often s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning

arXiv:2603.28610v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) achieve stronger visual understanding by scaling input fidelity, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Trust-Aware Routing for Distributed Generative AI Inference at the Edge

arXiv:2603.28622v1 Announce Type: cross Abstract: Emerging deployments of Generative AI increasingly execute inference across decentralized and heterogeneous ed

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

AMIGO: Agentic Multi-Image Grounding Oracle Benchmark

arXiv:2603.28662v1 Announce Type: cross Abstract: Agentic vision-language models increasingly act through extended interactions, but most evaluations still focu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding

arXiv:2603.28696v1 Announce Type: cross Abstract: Long video understanding remains challenging for Multi-modal Large Language Models (MLLMs) due to high memory

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Stepwise Credit Assignment for GRPO on Flow-Matching Models

arXiv:2603.28718v1 Announce Type: cross Abstract: Flow-GRPO successfully applies reinforcement learning to flow models, but uses uniform credit assignment acros

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining

arXiv:2603.28737v1 Announce Type: cross Abstract: We introduce ParaSpeechCLAP, a dual-encoder contrastive model that maps speech and text style captions into a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers

arXiv:2603.28762v1 Announce Type: cross Abstract: Modern Text-to-Image (T2I) diffusion models have achieved remarkable semantic alignment, yet they often suffer