Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,555

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,407 Reads 5,148

Showing 5,148 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Before We Trust Them: Decision-Making Failures in Navigation of Foundation Models

arXiv:2601.05529v4 Announce Type: replace Abstract: High success rates on navigation-related tasks do not necessarily translate into reliable decision making by

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

arXiv:2601.08323v3 Announce Type: replace Abstract: Equipping agents with memory is essential for solving real-world long-horizon problems. However, most existi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

See, Symbolize, Act: Grounding VLMs with Spatial Representations for Better Gameplay

arXiv:2603.11601v2 Announce Type: replace Abstract: Vision-Language Models (VLMs) excel at describing visual scenes, yet struggle to translate perception into p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems

arXiv:2603.20833v2 Announce Type: replace Abstract: As AI agent ecosystems grow, agents need mechanisms to monitor relevant knowledge in real time. Semantic pub

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly

arXiv:2405.00181v3 Announce Type: replace-cross Abstract: Video anomaly understanding (VAU) aims to automatically comprehend unusual occurrences in videos, ther

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CGRA4ML: A Hardware/Software Framework to Implement Neural Networks for Scientific Edge Computing

arXiv:2408.15561v4 Announce Type: replace-cross Abstract: The scientific community increasingly relies on machine learning (ML) for near-sensor processing, leve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation

arXiv:2502.00262v4 Announce Type: replace-cross Abstract: Autonomous driving systems face significant challenges in handling unpredictable edge-case scenarios,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation

arXiv:2505.20353v3 Announce Type: replace-cross Abstract: Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

StreamDiT: Real-Time Streaming Text-to-Video Generation

arXiv:2507.03745v4 Announce Type: replace-cross Abstract: Recently, great progress has been achieved in text-to-video (T2V) generation by scaling transformer-ba

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning

arXiv:2508.14765v3 Announce Type: replace-cross Abstract: Designing therapeutic peptides with tailored properties is hindered by the vastness of sequence space,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Attention-Aligned Reasoning for Large Language Models

arXiv:2510.03223v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) tend to generate a long reasoning chain when solving complex tasks. Howev

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding

arXiv:2511.00810v3 Announce Type: replace-cross Abstract: Graphical user interface (GUI) grounding is a key capability for computer-use agents, mapping natural-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Route Experts by Sequence, not by Token

arXiv:2511.06494v2 Announce Type: replace-cross Abstract: Mixture-of-Experts (MoE) architectures scale large language models (LLMs) by activating only a subset

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv:2511.18746v2 Announce Type: replace-cross Abstract: While video-generation-based embodied world models have gained increasing attention, their reliance on

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Aligning LLMs with Biomedical Knowledge using Balanced Fine-Tuning

arXiv:2511.21075v2 Announce Type: replace-cross Abstract: Aligning Large Language Models (LLMs) with biomedical knowledge requires understanding both concepts a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos

arXiv:2512.01707v2 Announce Type: replace-cross Abstract: Streaming video understanding requires models not only to process temporally incoming frames, but also

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

arXiv:2512.02425v2 Announce Type: replace-cross Abstract: Recent advances in video large language models have demonstrated strong capabilities in understanding

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

arXiv:2512.13607v2 Announce Type: replace-cross Abstract: Building general-purpose reasoning models with reinforcement learning (RL) entails substantial cross-d

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations

arXiv:2512.14080v2 Announce Type: replace-cross Abstract: Mixture of Experts (MoE) models have emerged as the de facto architecture for scaling up language mode

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dual-objective Language Models: Training Efficiency Without Overfitting

arXiv:2512.14549v3 Announce Type: replace-cross Abstract: This paper combines autoregressive and masked-diffusion training objectives without any architectural

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MRG-R1: Reinforcement Learning for Clinically Aligned Medical Report Generation

arXiv:2512.16145v2 Announce Type: replace-cross Abstract: Medical report generation aims to automatically produce radiology-style reports from medical images, s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs

arXiv:2512.16378v3 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) expand beyond text, integrating speech as a native modality has given

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

The Dual-State Architecture for Reliable LLM Agents

arXiv:2512.20660v2 Announce Type: replace-cross Abstract: Large Language Models deployed as code generation agents exhibit stochastic behavior incompatible with

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

arXiv:2601.13227v2 Announce Type: replace-cross Abstract: RAG systems are increasingly evaluated and optimized using LLM judges, an approach that is rapidly bec

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

NRR-Phi: Text-to-State Mapping for Ambiguity Preservation in LLM Inference

arXiv:2601.19933v5 Announce Type: replace-cross Abstract: Large language models exhibit a systematic tendency toward early semantic commitment: given ambiguous

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Values from Casual Conversations

arXiv:2601.22440v2 Announce Type: replace-cross Abstract: Does AI understand human values? While this remains an open philosophical question, we take a pragmati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions

arXiv:2602.00095v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) hold significant promise for revolutionizing traditional educ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

PISCO: Precise Video Instance Insertion with Sparse Control

arXiv:2602.08277v2 Announce Type: replace-cross Abstract: The landscape of AI video generation is undergoing a pivotal shift: moving beyond general generation -

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SWE Context Bench: A Benchmark for Context Learning in Coding

arXiv:2602.08316v2 Announce Type: replace-cross Abstract: Large language models are increasingly used as programming agents for repository level software engine

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap

arXiv:2602.09678v2 Announce Type: replace-cross Abstract: Since 1887, administrative law has navigated a "capability-accountability trap": technological change

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

The Effective Depth Paradox: Evaluating the Relationship between Architectural Topology and Trainability in Deep CNNs

arXiv:2602.13298v2 Announce Type: replace-cross Abstract: This paper investigates the relationship between convolutional neural network (CNN) and image recognit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference

arXiv:2602.18846v2 Announce Type: replace-cross Abstract: Vision-language models (VLMs) have achieved remarkable multimodal understanding and reasoning capabili

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

PedaCo-Gen: Scaffolding Pedagogical Agency in Human-AI Collaborative Video Authoring

arXiv:2602.19623v2 Announce Type: replace-cross Abstract: While advancements in Text-to-Video (T2V) generative AI offer a promising path toward democratizing co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis

arXiv:2602.20207v2 Announce Type: replace-cross Abstract: Knowledge editing in Large Language Models (LLMs) aims to update the model's prediction for a specific

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

arXiv:2603.14267v3 Announce Type: replace-cross Abstract: Video dubbing has broad applications in filmmaking, multimedia creation, and assistive speech technolo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

arXiv:2603.15159v4 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have shown strong potential for code generation, yet they remain limited

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MLLM-based Textual Explanations for Face Comparison

arXiv:2603.16629v3 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) have recently been proposed as a means to generate natural-la

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Modernizing Amdahl's Law: How AI Scaling Laws Shape Computer Architecture

arXiv:2603.20654v2 Announce Type: replace-cross Abstract: Classical Amdahl's Law assumes a fixed decomposition between serial and parallel work and homogeneous

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning

arXiv:2603.21440v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) demonstrate impressive natural language capabilities but often struggle w

Where Digital And Robot-Based AI Agents Now Prevail

Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 3w ago

Where Digital And Robot-Based AI Agents Now Prevail

A company pursuing 'aggressive modeling scenarios' with AI can anticipate 10% growth,

AI Inference Takes Center Stage At KubeCon Europe 2026

Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 3w ago

AI Inference Takes Center Stage At KubeCon Europe 2026

KubeCon Europe 2026 made AI inference its central focus with major CNCF donations including llm-d, Nvidia's GPU DRA driver and a growing AI conformance program.

Techpoint Africa 🧠 Large Language Models ⚡ AI Lesson 3w ago

After dropping out of the university, this Nigerian lady built an AI shopping assistant for Nigerians

In this edition of After Hours, we follow Amina Asu-Beks and how she built an AI-shopping assistant without a technical background or a completed university deg

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

I Built Rosetta: An AI Agent That Turns a Notion Row Into a Personalized Onboarding Experience

New hires don't fail because they're unqualified. They fail because the context is scattered, the answers are buried, and the first week is chaos. I've seen it

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

ARC-AGI-3 Proves AI Still Can't Replace Human Judgment - And That's the Point

Every few months, something drops that cuts through the AI hype and forces the conversation back to reality. This week, that something was ARC-AGI-3. The result

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

terminals were never meant for coding agents

Last week I had 3 agents running. Claude Code in one terminal, Codex in another, OpenCode in a third. I looked away for maybe 10 minutes to read a PR. When I ca

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

I Tested GPT-5.4 vs Claude Opus 4.6 vs Gemini 3.1 Pro on 5 Real Coding Tasks

Why I Ran This Test I use all three models daily for coding. But I've never put them head-to-head on the exact same tasks. So I designed 5 real-world coding cha

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

The Tiny AI Emotion Engine That Makes Your Companion Feel Alive (Meet DiEmo for LivinGrimoire)

🔥 The Tiny AI Emotion Engine That Makes Your Companion Feel Alive (Meet DiEmo for LivinGrimoire) Most AI companions feel either too robotic… or too clingy. Wha

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen