Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,148 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Before We Trust Them: Decision-Making Failures in Navigation of Foundation Models
arXiv:2601.05529v4 Announce Type: replace Abstract: High success rates on navigation-related tasks do not necessarily translate into reliable decision making by
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation
arXiv:2601.08323v3 Announce Type: replace Abstract: Equipping agents with memory is essential for solving real-world long-horizon problems. However, most existi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
See, Symbolize, Act: Grounding VLMs with Spatial Representations for Better Gameplay
arXiv:2603.11601v2 Announce Type: replace Abstract: Vision-Language Models (VLMs) excel at describing visual scenes, yet struggle to translate perception into p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems
arXiv:2603.20833v2 Announce Type: replace Abstract: As AI agent ecosystems grow, agents need mechanisms to monitor relevant knowledge in real time. Semantic pub
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
arXiv:2405.00181v3 Announce Type: replace-cross Abstract: Video anomaly understanding (VAU) aims to automatically comprehend unusual occurrences in videos, ther
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
CGRA4ML: A Hardware/Software Framework to Implement Neural Networks for Scientific Edge Computing
arXiv:2408.15561v4 Announce Type: replace-cross Abstract: The scientific community increasingly relies on machine learning (ML) for near-sensor processing, leve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation
arXiv:2502.00262v4 Announce Type: replace-cross Abstract: Autonomous driving systems face significant challenges in handling unpredictable edge-case scenarios,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation
arXiv:2505.20353v3 Announce Type: replace-cross Abstract: Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
StreamDiT: Real-Time Streaming Text-to-Video Generation
arXiv:2507.03745v4 Announce Type: replace-cross Abstract: Recently, great progress has been achieved in text-to-video (T2V) generation by scaling transformer-ba
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning
arXiv:2508.14765v3 Announce Type: replace-cross Abstract: Designing therapeutic peptides with tailored properties is hindered by the vastness of sequence space,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Attention-Aligned Reasoning for Large Language Models
arXiv:2510.03223v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) tend to generate a long reasoning chain when solving complex tasks. Howev
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding
arXiv:2511.00810v3 Announce Type: replace-cross Abstract: Graphical user interface (GUI) grounding is a key capability for computer-use agents, mapping natural-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Route Experts by Sequence, not by Token
arXiv:2511.06494v2 Announce Type: replace-cross Abstract: Mixture-of-Experts (MoE) architectures scale large language models (LLMs) by activating only a subset
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Any4D: Open-Prompt 4D Generation from Natural Language and Images
arXiv:2511.18746v2 Announce Type: replace-cross Abstract: While video-generation-based embodied world models have gained increasing attention, their reliance on
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Aligning LLMs with Biomedical Knowledge using Balanced Fine-Tuning
arXiv:2511.21075v2 Announce Type: replace-cross Abstract: Aligning Large Language Models (LLMs) with biomedical knowledge requires understanding both concepts a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos
arXiv:2512.01707v2 Announce Type: replace-cross Abstract: Streaming video understanding requires models not only to process temporally incoming frames, but also
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning
arXiv:2512.02425v2 Announce Type: replace-cross Abstract: Recent advances in video large language models have demonstrated strong capabilities in understanding
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models
arXiv:2512.13607v2 Announce Type: replace-cross Abstract: Building general-purpose reasoning models with reinforcement learning (RL) entails substantial cross-d
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations
arXiv:2512.14080v2 Announce Type: replace-cross Abstract: Mixture of Experts (MoE) models have emerged as the de facto architecture for scaling up language mode
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Dual-objective Language Models: Training Efficiency Without Overfitting
arXiv:2512.14549v3 Announce Type: replace-cross Abstract: This paper combines autoregressive and masked-diffusion training objectives without any architectural
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
MRG-R1: Reinforcement Learning for Clinically Aligned Medical Report Generation
arXiv:2512.16145v2 Announce Type: replace-cross Abstract: Medical report generation aims to automatically produce radiology-style reports from medical images, s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs
arXiv:2512.16378v3 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) expand beyond text, integrating speech as a native modality has given
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
The Dual-State Architecture for Reliable LLM Agents
arXiv:2512.20660v2 Announce Type: replace-cross Abstract: Large Language Models deployed as code generation agents exhibit stochastic behavior incompatible with
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?
arXiv:2601.13227v2 Announce Type: replace-cross Abstract: RAG systems are increasingly evaluated and optimized using LLM judges, an approach that is rapidly bec
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
NRR-Phi: Text-to-State Mapping for Ambiguity Preservation in LLM Inference
arXiv:2601.19933v5 Announce Type: replace-cross Abstract: Large language models exhibit a systematic tendency toward early semantic commitment: given ambiguous
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Values from Casual Conversations
arXiv:2601.22440v2 Announce Type: replace-cross Abstract: Does AI understand human values? While this remains an open philosophical question, we take a pragmati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions
arXiv:2602.00095v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) hold significant promise for revolutionizing traditional educ
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
PISCO: Precise Video Instance Insertion with Sparse Control
arXiv:2602.08277v2 Announce Type: replace-cross Abstract: The landscape of AI video generation is undergoing a pivotal shift: moving beyond general generation -
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
SWE Context Bench: A Benchmark for Context Learning in Coding
arXiv:2602.08316v2 Announce Type: replace-cross Abstract: Large language models are increasingly used as programming agents for repository level software engine
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
arXiv:2602.09678v2 Announce Type: replace-cross Abstract: Since 1887, administrative law has navigated a "capability-accountability trap": technological change
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
The Effective Depth Paradox: Evaluating the Relationship between Architectural Topology and Trainability in Deep CNNs
arXiv:2602.13298v2 Announce Type: replace-cross Abstract: This paper investigates the relationship between convolutional neural network (CNN) and image recognit
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference
arXiv:2602.18846v2 Announce Type: replace-cross Abstract: Vision-language models (VLMs) have achieved remarkable multimodal understanding and reasoning capabili
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
PedaCo-Gen: Scaffolding Pedagogical Agency in Human-AI Collaborative Video Authoring
arXiv:2602.19623v2 Announce Type: replace-cross Abstract: While advancements in Text-to-Video (T2V) generative AI offer a promising path toward democratizing co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis
arXiv:2602.20207v2 Announce Type: replace-cross Abstract: Knowledge editing in Large Language Models (LLMs) aims to update the model's prediction for a specific
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization
arXiv:2603.14267v3 Announce Type: replace-cross Abstract: Video dubbing has broad applications in filmmaking, multimedia creation, and assistive speech technolo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation
arXiv:2603.15159v4 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have shown strong potential for code generation, yet they remain limited
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
MLLM-based Textual Explanations for Face Comparison
arXiv:2603.16629v3 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) have recently been proposed as a means to generate natural-la
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Modernizing Amdahl's Law: How AI Scaling Laws Shape Computer Architecture
arXiv:2603.20654v2 Announce Type: replace-cross Abstract: Classical Amdahl's Law assumes a fixed decomposition between serial and parallel work and homogeneous
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning
arXiv:2603.21440v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) demonstrate impressive natural language capabilities but often struggle w

Forbes Innovation
🧠 Large Language Models
⚡ AI Lesson
3w ago
Where Digital And Robot-Based AI Agents Now Prevail
A company pursuing 'aggressive modeling scenarios' with AI can anticipate 10% growth,

Forbes Innovation
🧠 Large Language Models
⚡ AI Lesson
3w ago
AI Inference Takes Center Stage At KubeCon Europe 2026
KubeCon Europe 2026 made AI inference its central focus with major CNCF donations including llm-d, Nvidia's GPU DRA driver and a growing AI conformance program.
Techpoint Africa
🧠 Large Language Models
⚡ AI Lesson
3w ago
After dropping out of the university, this Nigerian lady built an AI shopping assistant for Nigerians
In this edition of After Hours, we follow Amina Asu-Beks and how she built an AI-shopping assistant without a technical background or a completed university deg
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
3w ago
I Built Rosetta: An AI Agent That Turns a Notion Row Into a Personalized Onboarding Experience
New hires don't fail because they're unqualified. They fail because the context is scattered, the answers are buried, and the first week is chaos. I've seen it
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
3w ago
ARC-AGI-3 Proves AI Still Can't Replace Human Judgment - And That's the Point
Every few months, something drops that cuts through the AI hype and forces the conversation back to reality. This week, that something was ARC-AGI-3. The result
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
3w ago
terminals were never meant for coding agents
Last week I had 3 agents running. Claude Code in one terminal, Codex in another, OpenCode in a third. I looked away for maybe 10 minutes to read a PR. When I ca
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
3w ago
I Tested GPT-5.4 vs Claude Opus 4.6 vs Gemini 3.1 Pro on 5 Real Coding Tasks
Why I Ran This Test I use all three models daily for coding. But I've never put them head-to-head on the exact same tasks. So I designed 5 real-world coding cha
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
3w ago
The Tiny AI Emotion Engine That Makes Your Companion Feel Alive (Meet DiEmo for LivinGrimoire)
🔥 The Tiny AI Emotion Engine That Makes Your Companion Feel Alive (Meet DiEmo for LivinGrimoire) Most AI companions feel either too robotic… or too clingy. Wha
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
3w ago
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen
DeepCamp AI