Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,455 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Multiplicative learning from observation-prediction ratios
arXiv:2503.10144v2 Announce Type: replace-cross Abstract: Additive parameter updates, as used in gradient descent and its adaptive extensions, underpin most mod
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Automating quantum feature map design via large language models
arXiv:2504.07396v2 Announce Type: replace-cross Abstract: Quantum feature maps are a key component of quantum machine learning, encoding classical data into qua
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Leakage and Interpretability in Concept-Based Models
arXiv:2504.14094v3 Announce Type: replace-cross Abstract: Concept-based Models aim to improve interpretability by predicting high-level intermediate concepts, r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
GAIA: A Foundation Model for Operational Atmospheric Dynamics
arXiv:2505.18179v3 Announce Type: replace-cross Abstract: We introduce GAIA (Geospatial Artificial Intelligence for Atmospheres), a hybrid self-supervised geosp
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Generalizable Heuristic Generation Through LLMs with Meta-Optimization
arXiv:2505.20881v2 Announce Type: replace-cross Abstract: Heuristic design with large language models (LLMs) has emerged as a promising approach for tackling co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale
arXiv:2506.02548v3 Announce Type: replace-cross Abstract: AI agents have significant potential to reshape cybersecurity, making a thorough assessment of their c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Learning The Minimum Action Distance
arXiv:2506.09276v3 Announce Type: replace-cross Abstract: This paper presents a state representation framework for Markov decision processes (MDPs) that can be
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
RedTopic: Toward Topic-Diverse Red Teaming of Large Language Models
arXiv:2507.00026v2 Announce Type: replace-cross Abstract: As large language models (LLMs) are increasingly deployed as black-box components in real-world applic
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Graph Structure Learning with Privacy Guarantees for Open Graph Data
arXiv:2507.19116v3 Announce Type: replace-cross Abstract: Publishing open graph data while preserving individual privacy remains challenging when data publisher
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
From Product Hilbert Spaces to the Generalized Koopman Operator and the Nonlinear Fundamental Lemma
arXiv:2508.07494v2 Announce Type: replace-cross Abstract: The generalization of the Koopman operator to systems with control input and the derivation of a nonli
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
From Context to Intent: Reasoning-Guided Function-Level Code Completion
arXiv:2508.09537v2 Announce Type: replace-cross Abstract: The growing capabilities of Large Language Models (LLMs) have led to their widespread adoption for fun
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
DreamAudio: Customized Text-to-Audio Generation with Diffusion Models
arXiv:2509.06027v2 Announce Type: replace-cross Abstract: With the development of large-scale diffusion-based and language-modeling-based generative models, imp
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
MARS: toward more efficient multi-agent collaboration for LLM reasoning
arXiv:2509.20502v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have achieved impressive results in natural language understanding, yet t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
VL-KnG: Persistent Spatiotemporal Knowledge Graphs from Egocentric Video for Embodied Scene Understanding
arXiv:2510.01483v2 Announce Type: replace-cross Abstract: Vision-language models (VLMs) demonstrate strong image-level scene understanding but often lack persis
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Generating Findings for Jaw Cysts in Dental Panoramic Radiographs Using a GPT-Based VLM: A Preliminary Study on Building a Two-Stage Self-Correction Loop with Structured Output (SLSO) Framework
arXiv:2510.02001v4 Announce Type: replace-cross Abstract: Vision-language models (VLMs) such as GPT (Generative Pre-Trained Transformer) have shown potential fo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents
arXiv:2510.14967v2 Announce Type: replace-cross Abstract: Large language model (LLM)-based agents are increasingly trained with reinforcement learning (RL) to e
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
arXiv:2510.15994v2 Announce Type: replace-cross Abstract: The Model Context Protocol (MCP) standardizes how large language model (LLM) agents discover, describe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
GUIrilla: A Scalable Framework for Automated Desktop UI Exploration
arXiv:2510.16051v2 Announce Type: replace-cross Abstract: The performance and generalization of foundation models for interactive systems critically depend on t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding
arXiv:2510.21356v2 Announce Type: replace-cross Abstract: Eye gaze offers valuable cues about attention, short-term intent, and future actions, making it a powe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Quantifying Systemic Vulnerability in the Foundation Model Industry
arXiv:2510.23421v2 Announce Type: replace-cross Abstract: The foundation model industry exhibits unprecedented concentration in critical inputs: semiconductors,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs
arXiv:2511.05919v3 Announce Type: replace-cross Abstract: LLMs are now an integral part of information retrieval. As such, their role as question answering chat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding
arXiv:2511.12449v2 Announce Type: replace-cross Abstract: Recent Multimodal Large Language Models (MLLMs) have significantly advanced e-commerce product underst
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation
arXiv:2511.21732v2 Announce Type: replace-cross Abstract: Humor, as both a creative human activity and a social binding mechanism, has long posed a major challe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Understanding
arXiv:2512.02487v2 Announce Type: replace-cross Abstract: Recent advances in 3D scene-language understanding have leveraged Large Language Models (LLMs) for 3D
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles
arXiv:2512.03454v3 Announce Type: replace-cross Abstract: Interpreting natural-language commands to localize target objects is critical for autonomous driving (
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Arc Gradient Descent: A Geometrically Motivated Gradient Descent-based Optimiser with Phase-Aware, User-Controlled Step Dynamics (proof-of-concept)
arXiv:2512.06737v3 Announce Type: replace-cross Abstract: The paper presents the formulation, implementation, and evaluation of the ArcGD optimiser. The evaluat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Metaphor-based Jailbreak Attacks on Text-to-Image Models
arXiv:2512.10766v2 Announce Type: replace-cross Abstract: Text-to-image (T2I) models commonly incorporate defense mechanisms to prevent the generation of sensit
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Schr\"odinger's Navigator: Imagining an Ensemble of Futures for Zero-Shot Object Navigation
arXiv:2512.21201v2 Announce Type: replace-cross Abstract: Zero-shot object navigation (ZSON) requires robots to locate target objects in unseen environments wit
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
AI-Generated Code Is Not Reproducible (Yet): An Empirical Study of Dependency Gaps in LLM-Based Coding Agents
arXiv:2512.22387v3 Announce Type: replace-cross Abstract: The rise of Large Language Models (LLMs) as coding agents promises to accelerate software development,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
VLM-CAD: VLM-Optimized Collaborative Agent Design Workflow for Analog Circuit Sizing
arXiv:2601.07315v4 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) have demonstrated remarkable potential in multimodal reasoning, yet they
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Hierarchical Long Video Understanding with Audiovisual Entity Cohesion and Agentic Search
arXiv:2601.13719v2 Announce Type: replace-cross Abstract: Long video understanding presents significant challenges for vision-language models due to extremely l
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Representational Homomorphism Predicts and Improves Compositional Generalization In Transformer Language Model
arXiv:2601.18858v2 Announce Type: replace-cross Abstract: Compositional generalization-the ability to interpret novel combinations of familiar components-remain
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
arXiv:2601.22060v3 Announce Type: replace-cross Abstract: Multimodal large language models (MLLMs) have achieved remarkable success across a broad range of visi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Residual Decoding: Mitigating Hallucinations in Large Vision-Language Models via History-Aware Residual Guidance
arXiv:2602.01047v3 Announce Type: replace-cross Abstract: Large Vision-Language Models (LVLMs) can reason from image-text inputs and perform well in various mul
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
FlyPrompt: Brain-Inspired Random-Expanded Routing with Temporal-Ensemble Experts for General Continual Learning
arXiv:2602.01976v3 Announce Type: replace-cross Abstract: General continual learning (GCL) challenges intelligent systems to learn from single-pass, non-station
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Behavioral Consistency Validation for LLM Agents: An Analysis of Trading-Style Switching through Stock-Market Simulation
arXiv:2602.07023v2 Announce Type: replace-cross Abstract: Recent works have increasingly applied Large Language Models (LLMs) as agents in financial stock marke
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Energy-Aware Reinforcement Learning for Robotic Manipulation of Articulated Components in Infrastructure Operation and Maintenance
arXiv:2602.12288v3 Announce Type: replace-cross Abstract: With the growth of intelligent civil infrastructure and smart cities, operation and maintenance (O&M)
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
KDFlow: A User-Friendly and Efficient Knowledge Distillation Framework for Large Language Models
arXiv:2603.01875v2 Announce Type: replace-cross Abstract: Knowledge distillation (KD) is an essential technique to compress large language models (LLMs) into sm
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG
arXiv:2603.03292v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) exhibit high reasoning capacity in medical question-answering, but their
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
When Sensors Fail: Temporal Sequence Models for Robust PPO under Sensor Drift
arXiv:2603.04648v2 Announce Type: replace-cross Abstract: Real-world reinforcement learning systems must operate under distributional drift in their observation
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Human Presence Detection via Wi-Fi Range-Filtered Doppler Spectrum on Commodity Laptops
arXiv:2603.10845v2 Announce Type: replace-cross Abstract: Human Presence Detection (HPD) is key to enable intelligent power management and security features in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL
arXiv:2603.13606v2 Announce Type: replace-cross Abstract: Mixture-of-Experts (MoE) architectures have become essential for scaling large language models, drivin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition
arXiv:2603.17729v2 Announce Type: replace-cross Abstract: Recent advances in Large Vision-Language Models (LVLMs) have enabled training-free Fine-Grained Visual
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards
arXiv:2603.17808v2 Announce Type: replace-cross Abstract: Video generative models are increasingly used as world models for robotics, where a model generates a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Elastic Weight Consolidation Done Right for Continual Learning
arXiv:2603.18596v2 Announce Type: replace-cross Abstract: Weight regularization methods in continual learning (CL) alleviate catastrophic forgetting by assessin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Mi:dm K 2.5 Pro
arXiv:2603.18788v2 Announce Type: replace-cross Abstract: The evolving LLM landscape requires capabilities beyond simple text generation, prioritizing multi-ste
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs
arXiv:2603.20209v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) combine the linguistic strengths of LLMs with the ability to
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
CRoCoDiL: Continuous and Robust Conditioned Diffusion for Language
arXiv:2603.20210v2 Announce Type: replace-cross Abstract: Masked Diffusion Models (MDMs) provide an efficient non-causal alternative to autoregressive generatio
DeepCamp AI