Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,236 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Skilled AI Agents for Embedded and IoT Systems Development
arXiv:2603.19583v1 Announce Type: cross Abstract: Large language models (LLMs) and agentic systems have shown promise for automated software development, but ap
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization
arXiv:2603.19594v1 Announce Type: cross Abstract: Indoor localization has become increasingly essential for applications ranging from asset tracking to deliveri
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement
arXiv:2603.19608v1 Announce Type: cross Abstract: Fine-grained anomaly detection is crucial in industrial and medical applications, but labeled anomalies are of
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
CAF-Score: Calibrating CLAP with LALMs for Reference-free Audio Captioning Evaluation
arXiv:2603.19615v1 Announce Type: cross Abstract: While Large Audio-Language Models (LALMs) have advanced audio captioning, robust evaluation remains difficult.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management
arXiv:2603.19621v1 Announce Type: cross Abstract: Deep Reinforcement Learning (DRL) provides a general-purpose methodology for training inventory policies that
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
MetaCues: Enabling Critical Engagement with Generative AI for Information Seeking and Sensemaking
arXiv:2603.19634v1 Announce Type: cross Abstract: Generative AI (GenAI) search tools are increasingly used for information seeking, yet their design tends to en
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework
arXiv:2603.19643v1 Announce Type: cross Abstract: Despite the rapid advancement of Virtual Try-On (VTON) and Try-Off (VTOFF) technologies, existing VTON methods
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization
arXiv:2603.19649v1 Announce Type: cross Abstract: Social platforms serve as central hubs for information exchange, where user behaviors and platform interventio
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference
arXiv:2603.19664v1 Announce Type: cross Abstract: The key-value (KV) cache is widely treated as essential state in transformer inference, and a large body of wo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models
arXiv:2603.19676v1 Announce Type: cross Abstract: Text-to-image diffusion models achieve high visual fidelity but surprisingly exhibit systematic failures in nu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems
arXiv:2603.19677v1 Announce Type: cross Abstract: Large language model (LLM)-based multi-agent systems (MAS) have demonstrated exceptional capabilities in solvi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
AIGQ: An End-to-End Hybrid Generative Architecture for E-commerce Query Recommendation
arXiv:2603.19710v1 Announce Type: cross Abstract: Pre-search query recommendation, widely known as HintQ on Taobao's homepage, plays a vital role in intent capt
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Enhancing Alignment for Unified Multimodal Models via Semantically-Grounded Supervision
arXiv:2603.19807v1 Announce Type: cross Abstract: Unified Multimodal Models (UMMs) have emerged as a promising paradigm that integrates multimodal understanding
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue
arXiv:2603.19849v1 Announce Type: cross Abstract: Do LLMs talk like us? This question intrigues a multitude of scholar and it is relevant in many fields, from e
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Failure Modes for Deep Learning-Based Online Mapping: How to Measure and Address Them
arXiv:2603.19852v1 Announce Type: cross Abstract: Deep learning-based online mapping has emerged as a cornerstone of autonomous driving, yet these models freque
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time
arXiv:2603.19880v1 Announce Type: cross Abstract: Test-Time Reinforcement Learning (TTRL) enables Large Language Models (LLMs) to enhance reasoning capabilities
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Integrating Meta-Features with Knowledge Graph Embeddings for Meta-Learning
arXiv:2603.19888v1 Announce Type: cross Abstract: The vast collection of machine learning records available on the web presents a significant opportunity for me
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
arXiv:2603.19918v1 Announce Type: cross Abstract: Generalized Category Discovery (GCD) seeks to uncover novel categories in unlabeled data while preserving reco
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Trojan's Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance
arXiv:2603.19974v1 Announce Type: cross Abstract: Autonomous coding agents are increasingly integrated into software development workflows, offering capabilitie
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Promoting Critical Thinking With Domain-Specific Generative AI Provocations
arXiv:2603.19975v1 Announce Type: cross Abstract: The evidence on the effects of generative AI (GenAI) on critical thinking is mixed, with studies suggesting bo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving
arXiv:2603.19979v1 Announce Type: cross Abstract: Scalable and reliable evaluation is increasingly critical in the end-to-end era of autonomous driving, where v
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States
arXiv:2603.19987v1 Announce Type: cross Abstract: Reinforcement learning (RL) has become a standard paradigm for post-training and aligning Large Language Model
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Physics-Informed Long-Range Coulomb Correction for Machine-learning Hamiltonians
arXiv:2603.20007v1 Announce Type: cross Abstract: Machine-learning electronic Hamiltonians achieve orders-of-magnitude speedups over density-functional theory,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR
arXiv:2603.20020v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) excel at high-level reasoning yet fail on OCR tasks where fine-graine
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families
arXiv:2603.20042v1 Announce Type: cross Abstract: Large language models (LLMs) have driven substantial advances in speech language models (SpeechLMs), yielding
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries
arXiv:2603.20062v1 Announce Type: cross Abstract: When a traveler asks an AI search engine to recommend a hotel, which sources get cited -- and does query frami
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Fine-tuning Timeseries Predictors Using Reinforcement Learning
arXiv:2603.20063v1 Announce Type: cross Abstract: This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Agentic Harness for Real-World Compilers
arXiv:2603.20075v1 Announce Type: cross Abstract: Compilers are critical to modern computing, yet fixing compiler bugs is difficult. While recent large language
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace Domain
arXiv:2603.20094v1 Announce Type: cross Abstract: Large manufacturing companies face challenges in information retrieval due to data silos maintained by differe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models
arXiv:2603.20100v1 Announce Type: cross Abstract: Direct Preference Optimization (DPO) is widely used after supervised fine-tuning (SFT) to align language model
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Spectral Alignment in Forward-Backward Representations via Temporal Abstraction
arXiv:2603.20103v1 Announce Type: cross Abstract: Forward-backward (FB) representations provide a powerful framework for learning the successor representation (
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Var-JEPA: A Variational Formulation of the Joint-Embedding Predictive Architecture -- Bridging Predictive and Generative Self-Supervised Learning
arXiv:2603.20111v1 Announce Type: cross Abstract: The Joint-Embedding Predictive Architecture (JEPA) is often seen as a non-generative alternative to likelihood
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech
arXiv:2603.20112v1 Announce Type: cross Abstract: Personalizing Automatic Speech Recognition (ASR) for non-normative speech remains challenging because data col
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning
arXiv:2603.20116v1 Announce Type: cross Abstract: Conventional fine-tuning on domain-specific datasets can inadvertently alter a model's pretrained multimodal p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models
arXiv:2603.20122v1 Announce Type: cross Abstract: Large Language Models (LLMs) have been widely deployed, especially through free Web-based applications that ex
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models
arXiv:2603.20161v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across diverse tasks. However, the trut
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning
arXiv:2603.20164v1 Announce Type: cross Abstract: Conventional robot social behavior generation has been limited in flexibility and autonomy, relying on predefi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation
arXiv:2603.20172v1 Announce Type: cross Abstract: Recent work on chain-of-thought (CoT) faithfulness reports single aggregate numbers (e.g., DeepSeek-R1 acknowl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
AI Agents Can Already Autonomously Perform Experimental High Energy Physics
arXiv:2603.20179v1 Announce Type: cross Abstract: Large language model-based AI agents are now able to autonomously execute substantial portions of a high energ
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Adaptive Greedy Frame Selection for Long Video Understanding
arXiv:2603.20180v1 Announce Type: cross Abstract: Large vision--language models (VLMs) are increasingly applied to long-video question answering, yet inference
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Improving Generalization on Cybersecurity Tasks with Multi-Modal Contrastive Learning
arXiv:2603.20181v1 Announce Type: cross Abstract: The use of ML in cybersecurity has long been impaired by generalization issues: Models that work well in contr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking
arXiv:2603.20185v1 Announce Type: cross Abstract: Video agentic models have advanced challenging video-language tasks. However, most agentic approaches still he
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
HPS: Hard Preference Sampling for Human Preference Alignment
arXiv:2502.14400v5 Announce Type: replace Abstract: Aligning Large Language Model (LLM) responses with human preferences is vital for building safe and controll
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives
arXiv:2505.15693v3 Announce Type: replace Abstract: Recent advances in reinforcement learning (RL) have renewed interest in reward design for shaping agent beha
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation
arXiv:2506.08898v4 Announce Type: replace Abstract: Recent deep reinforcement learning methods have achieved remarkable success in solving multi-objective combi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Improved Generalized Planning with LLMs through Strategy Refinement and Reflection
arXiv:2508.13876v2 Announce Type: replace Abstract: LLMs have recently been used to generate Python programs representing generalized plans in PDDL planning, i.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Evaluation-Aware Reinforcement Learning
arXiv:2509.19464v3 Announce Type: replace Abstract: Policy evaluation is a core component of many reinforcement learning (RL) algorithms and a critical tool for
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark
arXiv:2509.24897v2 Announce Type: replace Abstract: The integration of visual understanding and generation into unified multimodal models represents a significa
DeepCamp AI