Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,473 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
FedFG: Privacy-Preserving and Robust Federated Learning via Flow-Matching Generation
arXiv:2603.27986v1 Announce Type: cross Abstract: Federated learning (FL) enables distributed clients to collaboratively train a global model using local privat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution Alignment
arXiv:2603.27987v1 Announce Type: cross Abstract: The high cost and accessibility problem associated with large datasets hinder the development of large-scale v
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
ViviDoc: Generating Interactive Documents through Human-Agent Collaboration
arXiv:2603.27991v1 Announce Type: cross Abstract: Interactive documents help readers engage with complex ideas through dynamic visualization, interactive animat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers
arXiv:2603.28013v1 Announce Type: cross Abstract: We present a stage-decomposed analysis of prompt injection attacks against five frontier LLM agents. Prior wor
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Synonymix: Unified Group Personas for Generative Simulations
arXiv:2603.28066v1 Announce Type: cross Abstract: Generative agent simulations operate at two scales: individual personas for character interaction, and populat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
MolmoPoint: Better Pointing for VLMs with Grounding Tokens
arXiv:2603.28069v1 Announce Type: cross Abstract: Grounding has become a fundamental capability of vision-language models (VLMs). Most existing VLMs point by ge
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions
arXiv:2603.28086v1 Announce Type: cross Abstract: Voice design from natural language aims to generate speaker timbres directly from free-form textual descriptio
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models
arXiv:2603.28103v1 Announce Type: cross Abstract: Parliamentary proceedings represent a rich yet challenging resource for computational analysis, particularly w
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data
arXiv:2603.28122v1 Announce Type: cross Abstract: Integrating quantum circuits into deep learning pipelines remains challenging due to heuristic design limitati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Does Claude's Constitution Have a Culture?
arXiv:2603.28123v1 Announce Type: cross Abstract: Constitutional AI (CAI) aligns language models with explicitly stated normative principles, offering a transpa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios
arXiv:2603.28130v1 Announce Type: cross Abstract: We introduce Multilingual Document Parsing Benchmark, the first benchmark for multilingual digital and photogr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
RecycleLoRA: Rank-Revealing QR-Based Dual-LoRA Subspace Adaptation for Domain Generalized Semantic Segmentation
arXiv:2603.28142v1 Announce Type: cross Abstract: Domain Generalized Semantic Segmentation (DGSS) aims to maintain robust performance across unseen target domai
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Evaluating Privilege Usage of Agents on Real-World Tools
arXiv:2603.28166v1 Announce Type: cross Abstract: Equipping LLM agents with real-world tools can substantially improve productivity. However, granting agents au
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models
arXiv:2603.28204v1 Announce Type: cross Abstract: Reinforcement learning from verifiable rewards (RLVR) has significantly advanced the reasoning capabilities of
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
DiffAttn: Diffusion-Based Drivers' Visual Attention Prediction with LLM-Enhanced Semantic Reasoning
arXiv:2603.28251v1 Announce Type: cross Abstract: Drivers' visual attention provides critical cues for anticipating latent hazards and directly shapes decision-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries
arXiv:2603.28258v1 Announce Type: cross Abstract: Categorical perception (CP) -- enhanced discriminability at category boundaries -- is among the most studied p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights
arXiv:2603.28263v1 Announce Type: cross Abstract: Large Language Models (LLMs) remain heavily centered on English, with limited performance in low-resource lang
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Pre-Deployment Complexity Estimation for Federated Perception Systems
arXiv:2603.28282v1 Announce Type: cross Abstract: Edge AI systems increasingly rely on federated learning to train perception models in distributed, privacy-pre
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
FI-KAN: Fractal Interpolation Kolmogorov-Arnold Networks
arXiv:2603.28288v1 Announce Type: cross Abstract: Kolmogorov-Arnold Networks (KAN) employ B-spline bases on a fixed grid, providing no intrinsic multi-scale dec
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
NeiGAD: Augmenting Graph Anomaly Detection via Spectral Neighbor Information
arXiv:2603.28300v1 Announce Type: cross Abstract: Graph anomaly detection (GAD) aims to identify irregular nodes or structures in attributed graphs. Neighbor in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Building evidence-based knowledge graphs from full-text literature for disease-specific biomedical reasoning
arXiv:2603.28325v1 Announce Type: cross Abstract: Biomedical knowledge resources often either preserve evidence as unstructured text or compress it into flat tr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Integrating Multimodal Large Language Model Knowledge into Amodal Completion
arXiv:2603.28333v1 Announce Type: cross Abstract: With the widespread adoption of autonomous vehicles and robotics, amodal completion, which reconstructs the oc
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Crossing the NL/PL Divide: Information Flow Analysis Across the NL/PL Boundary in LLM-Integrated Code
arXiv:2603.28345v1 Announce Type: cross Abstract: LLM API calls are becoming a ubiquitous program construct, yet they create a boundary that no existing program
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Coherent Without Grounding, Grounded Without Success: Observability and Epistemic Failure
arXiv:2603.28371v1 Announce Type: cross Abstract: When an agent can articulate why something works, we typically take this as evidence of genuine understanding.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Membership Inference Attacks against Large Audio Language Models
arXiv:2603.28378v1 Announce Type: cross Abstract: We present the first systematic Membership Inference Attack (MIA) evaluation of Large Audio Language Models (L
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Critic-Free Deep Reinforcement Learning for Maritime Coverage Path Planning on Irregular Hexagonal Grids
arXiv:2603.28385v1 Announce Type: cross Abstract: Maritime surveillance missions, such as search and rescue and environmental monitoring, rely on the efficient
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation
arXiv:2603.28405v1 Announce Type: cross Abstract: Diffusion Transformers (DiT) have established a new state-of-the-art in high-fidelity image synthesis; however
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Evolutionary Discovery of Reinforcement Learning Algorithms via Large Language Models
arXiv:2603.28416v1 Announce Type: cross Abstract: Reinforcement learning algorithms are defined by their learning update rules, which are typically hand-designe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Spectral Higher-Order Neural Networks
arXiv:2603.28420v1 Announce Type: cross Abstract: Neural networks are fundamental tools of modern machine learning. The standard paradigm assumes binary interac
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
FeDMRA: Federated Incremental Learning with Dynamic Memory Replay Allocation
arXiv:2603.28455v1 Announce Type: cross Abstract: In federated healthcare systems, Federated Class-Incremental Learning (FCIL) has emerged as a key paradigm, en
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention
arXiv:2603.28458v1 Announce Type: cross Abstract: Token-level sparse attention mechanisms, exemplified by DeepSeek Sparse Attention (DSA), achieve fine-grained
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification
arXiv:2603.28488v1 Announce Type: cross Abstract: Large language models (LLMs) remain unreliable for high-stakes claim verification due to hallucinations and sh
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Next-Token Prediction and Regret Minimization
arXiv:2603.28499v1 Announce Type: cross Abstract: We consider the question of how to employ next-token prediction algorithms in adversarial online decision-maki
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
The Unreasonable Effectiveness of Scaling Laws in AI
arXiv:2603.28507v1 Announce Type: cross Abstract: Classical AI scaling laws, especially for pre-training, describe how training loss decreases with compute in a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Hydra: Unifying Document Retrieval and Generation in a Single Vision-Language Model
arXiv:2603.28554v1 Announce Type: cross Abstract: Visual document understanding typically requires separate retrieval and generation models, doubling memory and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Domain-Invariant Prompt Learning for Vision-Language Models
arXiv:2603.28555v1 Announce Type: cross Abstract: Large pre-trained vision-language models like CLIP have transformed computer vision by aligning images and tex
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems
arXiv:2603.28561v1 Announce Type: cross Abstract: The growing deployment of small Unmanned Aerial Systems (sUASs) in low-altitude airspaces has increased the ne
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
CirrusBench: Evaluating LLM-based Agents Beyond Correctness in Real-World Cloud Service Environments
arXiv:2603.28569v1 Announce Type: cross Abstract: The increasing agentic capabilities of Large Language Models (LLMs) have enabled their deployment in real-worl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Learning Partial Action Replacement in Offline MARL
arXiv:2603.28573v1 Announce Type: cross Abstract: Offline multi-agent reinforcement learning (MARL) faces a critical challenge: the joint action space grows exp
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
ChemCLIP: Bridging Organic and Inorganic Anticancer Compounds Through Contrastive Learning
arXiv:2603.28575v1 Announce Type: cross Abstract: The discovery of anticancer therapeutics has traditionally treated organic small molecules and metal-based coo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Moving Beyond Review: Applying Language Models to Planning and Translation in Reflection
arXiv:2603.28596v1 Announce Type: cross Abstract: Reflective writing is known to support the development of students' metacognitive skills, yet learners often s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning
arXiv:2603.28610v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) achieve stronger visual understanding by scaling input fidelity, yet
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Trust-Aware Routing for Distributed Generative AI Inference at the Edge
arXiv:2603.28622v1 Announce Type: cross Abstract: Emerging deployments of Generative AI increasingly execute inference across decentralized and heterogeneous ed
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
AMIGO: Agentic Multi-Image Grounding Oracle Benchmark
arXiv:2603.28662v1 Announce Type: cross Abstract: Agentic vision-language models increasingly act through extended interactions, but most evaluations still focu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding
arXiv:2603.28696v1 Announce Type: cross Abstract: Long video understanding remains challenging for Multi-modal Large Language Models (MLLMs) due to high memory
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Stepwise Credit Assignment for GRPO on Flow-Matching Models
arXiv:2603.28718v1 Announce Type: cross Abstract: Flow-GRPO successfully applies reinforcement learning to flow models, but uses uniform credit assignment acros
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining
arXiv:2603.28737v1 Announce Type: cross Abstract: We introduce ParaSpeechCLAP, a dual-encoder contrastive model that maps speech and text style captions into a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers
arXiv:2603.28762v1 Announce Type: cross Abstract: Modern Text-to-Image (T2I) diffusion models have achieved remarkable semantic alignment, yet they often suffer
DeepCamp AI