Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,164 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Neural Network Conversion of Machine Learning Pipelines
arXiv:2603.25699v1 Announce Type: cross Abstract: Transfer learning and knowledge distillation has recently gained a lot of attention in the deep learning commu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
arXiv:2603.25716v1 Announce Type: cross Abstract: Video world models have shown immense potential in simulating the physical world, yet existing memory mechanis
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Natural-Language Agent Harnesses
arXiv:2603.25723v1 Announce Type: cross Abstract: Agent performance increasingly depends on \emph{harness engineering}, yet harness design is usually buried in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving
arXiv:2603.25740v1 Announce Type: cross Abstract: Human driving behavior is inherently personal, which is shaped by long-term habits and influenced by short-ter
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Working Paper: Active Causal Structure Learning with Latent Variables: Towards Learning to Detour in Autonomous Robots
arXiv:2410.20894v3 Announce Type: replace Abstract: Artificial General Intelligence (AGI) Agents and Robots must be able to cope with everchanging environments
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Semi-Strongly solved: a New Definition Leading Computer to Perfect Gameplay
arXiv:2411.01029v2 Announce Type: replace Abstract: Strong solving of perfect-information games certifies optimal play from every reachable position, but the re
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Research on environment perception and behavior prediction of intelligent UAV based on semantic communication
arXiv:2501.04480v2 Announce Type: replace Abstract: The convergence of drone delivery systems, virtual worlds, and blockchain has transformed logistics and supp
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Concepts Learned Visually by Infants Can Contribute to Visual Learning and Understanding in AI Models
arXiv:2503.03361v3 Announce Type: replace Abstract: Early in development, infants learn to extract surprisingly complex aspects of visual scenes. This early lea
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
arXiv:2504.15780v3 Announce Type: replace Abstract: Geometric problem solving (GPS) requires precise multimodal understanding and rigorous, step-by-step logical
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Interactive Query Answering on Knowledge Graphs with Soft Entity Constraints
arXiv:2508.13663v4 Announce Type: replace Abstract: Methods for query answering over incomplete knowledge graphs retrieve entities that are \emph{likely} to be
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive Reasoning
arXiv:2509.03345v2 Announce Type: replace Abstract: Non-deductive reasoning, encompassing inductive and abductive reasoning, is essential in addressing complex
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning
arXiv:2509.23768v2 Announce Type: replace Abstract: The chemical reaction recommendation is to select proper reaction condition parameters for chemical reaction
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Planned Diffusion
arXiv:2510.18087v2 Announce Type: replace Abstract: Most large language models are autoregressive: they generate tokens one at a time. Discrete diffusion langua
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Analysing Environmental Efficiency in AI for X-Ray Diagnosis
arXiv:2511.07436v2 Announce Type: replace Abstract: The integration of AI tools into medical applications has aimed to improve the efficiency of diagnosis. The
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
XGrammar-2: Efficient Dynamic Structured Generation Engine for Agentic LLMs
arXiv:2601.04426v2 Announce Type: replace Abstract: Modern LLM agents increasingly rely on dynamic structured generation, such as tool calling and response prot
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback
arXiv:2603.08561v4 Announce Type: replace Abstract: Standard reinforcement learning (RL) for large language model (LLM) agents typically optimizes extrinsic rew
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Consequentialist Objectives and Catastrophe
arXiv:2603.15017v2 Announce Type: replace Abstract: Because human preferences are too complex to codify, AIs operate with misspecified objectives. Optimizing su
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Characterizing Linear Alignment Across Language Models
arXiv:2603.18908v3 Announce Type: replace Abstract: Language models increasingly appear to learn similar representations, despite differences in training object
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Man and machine: artificial intelligence and judicial decision making
arXiv:2603.19042v2 Announce Type: replace Abstract: The integration of artificial intelligence (AI) technologies into judicial decision-making, particularly in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers
arXiv:2401.11605v2 Announce Type: replace-cross Abstract: We present the Hourglass Diffusion Transformer (HDiT), an image generative model that exhibits linear
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
The Future of AI-Driven Software Engineering
arXiv:2406.07737v2 Announce Type: replace-cross Abstract: A paradigm shift is underway in Software Engineering, with AI systems such as LLMs playing an increasi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
CodeRefine: A Pipeline for Enhancing LLM-Generated Code Implementations of Research Papers
arXiv:2408.13366v2 Announce Type: replace-cross Abstract: This paper presents CodeRefine, a novel framework for automatically transforming research paper method
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts
arXiv:2410.10700v3 Announce Type: replace-cross Abstract: Safety concerns in large language models (LLMs) have gained significant attention due to their exposur
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
LLM4AD: Large Language Models for Autonomous Driving -- Concept, Review, Benchmark, Experiments, and Future Trends
arXiv:2410.15281v5 Announce Type: replace-cross Abstract: With the broader adoption and highly successful development of Large Language Models (LLMs), there has
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
The Limits of Inference Scaling Through Resampling
arXiv:2411.17501v3 Announce Type: replace-cross Abstract: Recent research has generated hope that inference scaling, such as resampling solutions until they pas
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Physics-Informed Evolution: An Evolutionary Framework for Solving Quantum Control Problems Involving the Schr\"odinger Equation
arXiv:2502.05228v3 Announce Type: replace-cross Abstract: Physics-informed Neural Networks (PINNs) show that embedding physical laws directly into the learning
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
The LLM Bottleneck: Why Open-Source Vision LLMs Struggle with Hierarchical Visual Recognition
arXiv:2505.24840v2 Announce Type: replace-cross Abstract: This paper reveals that many open-source large language models (LLMs) lack hierarchical knowledge abou
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents
arXiv:2506.12104v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are increasingly central to agentic systems due to their strong reasoning
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Instruction Following by Principled Boosting Attention of Large Language Models
arXiv:2506.13734v3 Announce Type: replace-cross Abstract: Large language models' behavior is often shaped by instructions such as system prompts, refusal bounda
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
BMFM-RNA: whole-cell expression decoding improves transcriptomic foundation models
arXiv:2506.14861v2 Announce Type: replace-cross Abstract: Transcriptomic foundation models pretrained with masked language modeling can achieve low pretraining
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning
arXiv:2507.19737v2 Announce Type: replace-cross Abstract: The vulnerability of cities has increased with urbanization and climate change, making it more importa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
CodeNER: Code Prompting for Named Entity Recognition
arXiv:2507.20423v4 Announce Type: replace-cross Abstract: Recent studies have explored various approaches for treating candidate named entity spans as both sour
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation
arXiv:2508.09223v2 Announce Type: replace-cross Abstract: Test-time adaptation allows pretrained models to adjust to incoming data streams, addressing distribut
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Mapping the Course for Prompt-based Structured Prediction
arXiv:2508.15090v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have demonstrated strong performance in a wide-range of language tasks wi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
The Information Dynamics of Generative Diffusion
arXiv:2508.19897v4 Announce Type: replace-cross Abstract: Generative diffusion models have emerged as a powerful class of models in machine learning, yet a unif
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
End-to-End Low-Level Neural Control of an Industrial-Grade 6D Magnetic Levitation System
arXiv:2509.01388v2 Announce Type: replace-cross Abstract: Magnetic levitation is poised to revolutionize industrial automation by integrating flexible in-machin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
GeoResponder: Towards Building Geospatial LLMs for Time-Critical Disaster Response
arXiv:2509.19354v3 Announce Type: replace-cross Abstract: LLMs excel at linguistic tasks but lack the inner geospatial capabilities needed for time-critical dis
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models
arXiv:2509.24296v2 Announce Type: replace-cross Abstract: The rapid advancement of Diffusion Large Language Models (dLLMs) introduces unprecedented vulnerabilit
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation
arXiv:2510.24821v3 Announce Type: replace-cross Abstract: We propose Ming-Flash-Omni, an upgraded version of Ming-Omni, built upon a sparser Mixture-of-Experts
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Foundry: Distilling 3D Foundation Models for the Edge
arXiv:2511.20721v2 Announce Type: replace-cross Abstract: Foundation models pre-trained with self-supervised learning (SSL) on large-scale datasets have become
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
A cross-species neural foundation model for end-to-end speech decoding
arXiv:2511.21740v4 Announce Type: replace-cross Abstract: Speech brain-computer interfaces (BCIs) aim to restore communication for people with paralysis by tran
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Epistemic Bias Injection: Biasing LLMs via Selective Context Retrieval
arXiv:2512.00804v2 Announce Type: replace-cross Abstract: When answering user queries, LLMs often retrieve knowledge from external sources stored in retrieval-a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
SWAA: Sliding Window Attention Adaptation for Efficient and Quality Preserving Long Context Processing
arXiv:2512.10411v5 Announce Type: replace-cross Abstract: The quadratic complexity of self attention in Transformer based LLMs renders long context inference pr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
arXiv:2512.14698v2 Announce Type: replace-cross Abstract: This paper does not introduce a novel method but instead establishes a straightforward, incremental, y
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing and LLM Sequence Classification
arXiv:2601.06394v2 Announce Type: replace-cross Abstract: Understanding student behavior in the classroom is essential to improve both pedagogical quality and s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts
arXiv:2601.08881v2 Announce Type: replace-cross Abstract: Unified image generation and editing models suffer from severe task interference in dense diffusion tr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Information Access of the Oppressed: A Problem-Posing Framework for Envisioning Emancipatory Information Access Platforms
arXiv:2601.09600v2 Announce Type: replace-cross Abstract: Online information access (IA) platforms are targets of authoritarian capture. We explore the question
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4w ago
Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia
arXiv:2602.18455v2 Announce Type: replace-cross Abstract: Search engines increasingly display LLM-generated answers shown above organic links, shifting search f
DeepCamp AI