Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,252 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech
arXiv:2603.20112v1 Announce Type: cross Abstract: Personalizing Automatic Speech Recognition (ASR) for non-normative speech remains challenging because data col
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning
arXiv:2603.20116v1 Announce Type: cross Abstract: Conventional fine-tuning on domain-specific datasets can inadvertently alter a model's pretrained multimodal p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models
arXiv:2603.20122v1 Announce Type: cross Abstract: Large Language Models (LLMs) have been widely deployed, especially through free Web-based applications that ex
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models
arXiv:2603.20161v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across diverse tasks. However, the trut
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning
arXiv:2603.20164v1 Announce Type: cross Abstract: Conventional robot social behavior generation has been limited in flexibility and autonomy, relying on predefi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation
arXiv:2603.20172v1 Announce Type: cross Abstract: Recent work on chain-of-thought (CoT) faithfulness reports single aggregate numbers (e.g., DeepSeek-R1 acknowl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
AI Agents Can Already Autonomously Perform Experimental High Energy Physics
arXiv:2603.20179v1 Announce Type: cross Abstract: Large language model-based AI agents are now able to autonomously execute substantial portions of a high energ
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Adaptive Greedy Frame Selection for Long Video Understanding
arXiv:2603.20180v1 Announce Type: cross Abstract: Large vision--language models (VLMs) are increasingly applied to long-video question answering, yet inference
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Improving Generalization on Cybersecurity Tasks with Multi-Modal Contrastive Learning
arXiv:2603.20181v1 Announce Type: cross Abstract: The use of ML in cybersecurity has long been impaired by generalization issues: Models that work well in contr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking
arXiv:2603.20185v1 Announce Type: cross Abstract: Video agentic models have advanced challenging video-language tasks. However, most agentic approaches still he
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
HPS: Hard Preference Sampling for Human Preference Alignment
arXiv:2502.14400v5 Announce Type: replace Abstract: Aligning Large Language Model (LLM) responses with human preferences is vital for building safe and controll
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives
arXiv:2505.15693v3 Announce Type: replace Abstract: Recent advances in reinforcement learning (RL) have renewed interest in reward design for shaping agent beha
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation
arXiv:2506.08898v4 Announce Type: replace Abstract: Recent deep reinforcement learning methods have achieved remarkable success in solving multi-objective combi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Improved Generalized Planning with LLMs through Strategy Refinement and Reflection
arXiv:2508.13876v2 Announce Type: replace Abstract: LLMs have recently been used to generate Python programs representing generalized plans in PDDL planning, i.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Evaluation-Aware Reinforcement Learning
arXiv:2509.19464v3 Announce Type: replace Abstract: Policy evaluation is a core component of many reinforcement learning (RL) algorithms and a critical tool for
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark
arXiv:2509.24897v2 Announce Type: replace Abstract: The integration of visual understanding and generation into unified multimodal models represents a significa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
PDDL Axioms Are Equivalent to Least Fixed Point Logic (Extended Version)
arXiv:2510.14412v2 Announce Type: replace Abstract: Axioms are a feature of the Planning Domain Definition Language PDDL that can be considered as a generalizat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
DAPS++: Rethinking Diffusion Inverse Problems with Decoupled Posterior Annealing
arXiv:2511.17038v2 Announce Type: replace Abstract: From a Bayesian perspective, score-based diffusion solves inverse problems through joint inference, embeddin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
On Sample-Efficient Generalized Planning via Learned Transition Models
arXiv:2602.23148v3 Announce Type: replace Abstract: Generalized planning studies the construction of solution strategies that generalize across families of plan
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
A Unified Framework to Quantify Cultural Intelligence of AI
arXiv:2603.01211v2 Announce Type: replace Abstract: As generative AI technologies are increasingly being launched across the globe, assessing their competence t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Generative AI-assisted Participatory Modeling in Socio-Environmental Planning under Deep Uncertainty
arXiv:2603.17021v2 Announce Type: replace Abstract: Socio-environmental planning under deep uncertainty requires researchers to identify and conceptualize probl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models
arXiv:2603.18048v2 Announce Type: replace Abstract: Recent Audio Multimodal Large Language Models (Audio MLLMs) demonstrate impressive performance on speech ben
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Evaluating Game Difficulty in Tetris Block Puzzle
arXiv:2603.18994v2 Announce Type: replace Abstract: Tetris Block Puzzle is a single player stochastic puzzle in which a player places blocks on an 8 x 8 grid to
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge
arXiv:2310.11703v3 Announce Type: replace-cross Abstract: As high-dimensional vector data increasingly surpasses the processing capabilities of traditional data
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
LISAA: A Framework for Large Language Model Information Security Awareness Assessment
arXiv:2411.13207v3 Announce Type: replace-cross Abstract: The popularity of large language models (LLMs) continues to grow, and LLM-based assistants have become
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Understanding and Optimizing Multi-Stage AI Inference Pipelines
arXiv:2504.09775v5 Announce Type: replace-cross Abstract: The rapid evolution of Large Language Models (LLMs) has driven the need for increasingly sophisticated
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
HALO: Hierarchical Reinforcement Learning for Large-Scale Adaptive Traffic Signal Control
arXiv:2506.14391v3 Announce Type: replace-cross Abstract: Adaptive traffic signal control (ATSC) is essential for mitigating urban congestion in modern smart ci
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers
arXiv:2506.15047v2 Announce Type: replace-cross Abstract: Family caregivers of individuals with Alzheimer's Disease and Related Dementia (AD/ADRD) face signific
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation
arXiv:2509.19080v2 Announce Type: replace-cross Abstract: Robotic manipulation policies are commonly initialized through imitation learning, but their performan
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Responsible AI Technical Report
arXiv:2509.20057v4 Announce Type: replace-cross Abstract: KT developed a Responsible AI (RAI) assessment methodology and risk mitigation technologies to ensure
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning
arXiv:2509.24773v4 Announce Type: replace-cross Abstract: Video-conditioned audio generation, including Video-to-Sound (V2S) and Visual Text-to-Speech (VisualTT
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
FinReflectKG -- EvalBench: Benchmarking Financial KG with Multi-Dimensional Evaluation
arXiv:2510.05710v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly being used to extract structured knowledge from unstruct
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
CARES: Context-Aware Resolution Selector for VLMs
arXiv:2510.19496v2 Announce Type: replace-cross Abstract: Large vision-language models (VLMs) commonly process images at native or high resolution to remain eff
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Rep2Text: Decoding Full Text from a Single LLM Token Representation
arXiv:2511.06571v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have achieved remarkable progress across diverse tasks, yet their interna
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter
arXiv:2511.16665v3 Announce Type: replace-cross Abstract: The emergence of Large Language Models (LLMs) with strong reasoning capabilities marks a significant m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
The Phish, The Spam, and The Valid: Generating Feature-Rich Emails for Benchmarking LLMs
arXiv:2511.21448v5 Announce Type: replace-cross Abstract: In this paper, we introduce a metadata-enriched generation framework (PhishFuzzer) that seeds real ema
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis
arXiv:2601.03018v2 Announce Type: replace-cross Abstract: While Large Language Models (LLMs) have shown strong performance on clinical text understanding, they
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
A Multi-Perspective Benchmark and Moderation Model for Evaluating Safety and Adversarial Robustness
arXiv:2601.03273v2 Announce Type: replace-cross Abstract: As large language models (LLMs) become deeply embedded in daily life, the urgent need for safer modera
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
StealthRL: Reinforcement Learning Paraphrase Attacks for Multi-Detector Evasion of AI-Text Detectors
arXiv:2602.08934v2 Announce Type: replace-cross Abstract: AI-text detectors face a critical robustness challenge: adversarial paraphrasing attacks that preserve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
LHAW: Controllable Underspecification for Long-Horizon Tasks
arXiv:2602.10525v2 Announce Type: replace-cross Abstract: Long-horizon workflow agents that operate effectively over extended periods are essential for truly au
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
The Art of Efficient Reasoning: Data, Reward, and Optimization
arXiv:2602.20945v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but al
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
On the Structural Non-Preservation of Epistemic Behaviour under Policy Transformation
arXiv:2602.21424v2 Announce Type: replace-cross Abstract: Reinforcement learning (RL) agents under partial observability often condition actions on internally a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
CIRCUS: Circuit Consensus under Uncertainty via Stability Ensembles
arXiv:2603.00523v2 Announce Type: replace-cross Abstract: Every mechanistic circuit carries an invisible asterisk: it reflects not just the model's computation,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing
arXiv:2603.01038v2 Announce Type: replace-cross Abstract: Face recognition remains vulnerable to presentation attacks, calling for robust Face Anti-Spoofing (FA
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
arXiv:2603.12180v2 Announce Type: replace-cross Abstract: Multimodal agents offer a promising path to automating complex document-intensive workflows. Yet, a cr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Prompt Injection as Role Confusion
arXiv:2603.12277v2 Announce Type: replace-cross Abstract: Language models remain vulnerable to prompt injection attacks despite extensive safety training. We tr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems
arXiv:2603.15727v2 Announce Type: replace-cross Abstract: Autonomous LLM-based agents increasingly operate as long-running processes forming densely interconnec
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data
arXiv:2603.16513v2 Announce Type: replace-cross Abstract: Structured data is foundational to healthcare, finance, e-commerce, and scientific data management. La
DeepCamp AI