Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,949

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,460 Reads 5,489

Showing 5,489 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Generating Findings for Jaw Cysts in Dental Panoramic Radiographs Using a GPT-Based VLM: A Preliminary Study on Building a Two-Stage Self-Correction Loop with Structured Output (SLSO) Framework

arXiv:2510.02001v4 Announce Type: replace-cross Abstract: Vision-language models (VLMs) such as GPT (Generative Pre-Trained Transformer) have shown potential fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents

arXiv:2510.14967v2 Announce Type: replace-cross Abstract: Large language model (LLM)-based agents are increasingly trained with reinforcement learning (RL) to e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

arXiv:2510.15994v2 Announce Type: replace-cross Abstract: The Model Context Protocol (MCP) standardizes how large language model (LLM) agents discover, describe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GUIrilla: A Scalable Framework for Automated Desktop UI Exploration

arXiv:2510.16051v2 Announce Type: replace-cross Abstract: The performance and generalization of foundation models for interactive systems critically depend on t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding

arXiv:2510.21356v2 Announce Type: replace-cross Abstract: Eye gaze offers valuable cues about attention, short-term intent, and future actions, making it a powe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Quantifying Systemic Vulnerability in the Foundation Model Industry

arXiv:2510.23421v2 Announce Type: replace-cross Abstract: The foundation model industry exhibits unprecedented concentration in critical inputs: semiconductors,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

arXiv:2511.05919v3 Announce Type: replace-cross Abstract: LLMs are now an integral part of information retrieval. As such, their role as question answering chat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding

arXiv:2511.12449v2 Announce Type: replace-cross Abstract: Recent Multimodal Large Language Models (MLLMs) have significantly advanced e-commerce product underst

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation

arXiv:2511.21732v2 Announce Type: replace-cross Abstract: Humor, as both a creative human activity and a social binding mechanism, has long posed a major challe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Understanding

arXiv:2512.02487v2 Announce Type: replace-cross Abstract: Recent advances in 3D scene-language understanding have leveraged Large Language Models (LLMs) for 3D

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles

arXiv:2512.03454v3 Announce Type: replace-cross Abstract: Interpreting natural-language commands to localize target objects is critical for autonomous driving (

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Arc Gradient Descent: A Geometrically Motivated Gradient Descent-based Optimiser with Phase-Aware, User-Controlled Step Dynamics (proof-of-concept)

arXiv:2512.06737v3 Announce Type: replace-cross Abstract: The paper presents the formulation, implementation, and evaluation of the ArcGD optimiser. The evaluat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Metaphor-based Jailbreak Attacks on Text-to-Image Models

arXiv:2512.10766v2 Announce Type: replace-cross Abstract: Text-to-image (T2I) models commonly incorporate defense mechanisms to prevent the generation of sensit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Schr\"odinger's Navigator: Imagining an Ensemble of Futures for Zero-Shot Object Navigation

arXiv:2512.21201v2 Announce Type: replace-cross Abstract: Zero-shot object navigation (ZSON) requires robots to locate target objects in unseen environments wit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AI-Generated Code Is Not Reproducible (Yet): An Empirical Study of Dependency Gaps in LLM-Based Coding Agents

arXiv:2512.22387v3 Announce Type: replace-cross Abstract: The rise of Large Language Models (LLMs) as coding agents promises to accelerate software development,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

VLM-CAD: VLM-Optimized Collaborative Agent Design Workflow for Analog Circuit Sizing

arXiv:2601.07315v4 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) have demonstrated remarkable potential in multimodal reasoning, yet they

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Hierarchical Long Video Understanding with Audiovisual Entity Cohesion and Agentic Search

arXiv:2601.13719v2 Announce Type: replace-cross Abstract: Long video understanding presents significant challenges for vision-language models due to extremely l

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Representational Homomorphism Predicts and Improves Compositional Generalization In Transformer Language Model

arXiv:2601.18858v2 Announce Type: replace-cross Abstract: Compositional generalization-the ability to interpret novel combinations of familiar components-remain

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

arXiv:2601.22060v3 Announce Type: replace-cross Abstract: Multimodal large language models (MLLMs) have achieved remarkable success across a broad range of visi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Residual Decoding: Mitigating Hallucinations in Large Vision-Language Models via History-Aware Residual Guidance

arXiv:2602.01047v3 Announce Type: replace-cross Abstract: Large Vision-Language Models (LVLMs) can reason from image-text inputs and perform well in various mul

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

FlyPrompt: Brain-Inspired Random-Expanded Routing with Temporal-Ensemble Experts for General Continual Learning

arXiv:2602.01976v3 Announce Type: replace-cross Abstract: General continual learning (GCL) challenges intelligent systems to learn from single-pass, non-station

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Behavioral Consistency Validation for LLM Agents: An Analysis of Trading-Style Switching through Stock-Market Simulation

arXiv:2602.07023v2 Announce Type: replace-cross Abstract: Recent works have increasingly applied Large Language Models (LLMs) as agents in financial stock marke

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Energy-Aware Reinforcement Learning for Robotic Manipulation of Articulated Components in Infrastructure Operation and Maintenance

arXiv:2602.12288v3 Announce Type: replace-cross Abstract: With the growth of intelligent civil infrastructure and smart cities, operation and maintenance (O&M)

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

KDFlow: A User-Friendly and Efficient Knowledge Distillation Framework for Large Language Models

arXiv:2603.01875v2 Announce Type: replace-cross Abstract: Knowledge distillation (KD) is an essential technique to compress large language models (LLMs) into sm

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG

arXiv:2603.03292v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) exhibit high reasoning capacity in medical question-answering, but their

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

When Sensors Fail: Temporal Sequence Models for Robust PPO under Sensor Drift

arXiv:2603.04648v2 Announce Type: replace-cross Abstract: Real-world reinforcement learning systems must operate under distributional drift in their observation

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Human Presence Detection via Wi-Fi Range-Filtered Doppler Spectrum on Commodity Laptops

arXiv:2603.10845v2 Announce Type: replace-cross Abstract: Human Presence Detection (HPD) is key to enable intelligent power management and security features in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL

arXiv:2603.13606v2 Announce Type: replace-cross Abstract: Mixture-of-Experts (MoE) architectures have become essential for scaling large language models, drivin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition

arXiv:2603.17729v2 Announce Type: replace-cross Abstract: Recent advances in Large Vision-Language Models (LVLMs) have enabled training-free Fine-Grained Visual

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards

arXiv:2603.17808v2 Announce Type: replace-cross Abstract: Video generative models are increasingly used as world models for robotics, where a model generates a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Elastic Weight Consolidation Done Right for Continual Learning

arXiv:2603.18596v2 Announce Type: replace-cross Abstract: Weight regularization methods in continual learning (CL) alleviate catastrophic forgetting by assessin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Mi:dm K 2.5 Pro

arXiv:2603.18788v2 Announce Type: replace-cross Abstract: The evolving LLM landscape requires capabilities beyond simple text generation, prioritizing multi-ste

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs

arXiv:2603.20209v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) combine the linguistic strengths of LLMs with the ability to

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CRoCoDiL: Continuous and Robust Conditioned Diffusion for Language

arXiv:2603.20210v2 Announce Type: replace-cross Abstract: Masked Diffusion Models (MDMs) provide an efficient non-causal alternative to autoregressive generatio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

An Industrial-Scale Retrieval-Augmented Generation Framework for Requirements Engineering: Empirical Evaluation with Automotive Manufacturing Data

arXiv:2603.20534v2 Announce Type: replace-cross Abstract: Requirements engineering in Industry 4.0 faces critical challenges with heterogeneous, unstructured do

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

arXiv:2603.20586v2 Announce Type: replace-cross Abstract: As long-context language modeling becomes increasingly important, the cost of maintaining and attendin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

arXiv:2603.21289v2 Announce Type: replace-cross Abstract: Recent progress in multimodal large language models has led to strong performance on reasoning tasks,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DeepXplain: XAI-Guided Autonomous Defense Against Multi-Stage APT Campaigns

arXiv:2603.21296v2 Announce Type: replace-cross Abstract: Advanced Persistent Threats (APTs) are stealthy, multi-stage attacks that require adaptive and timely

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study

arXiv:2603.21439v2 Announce Type: replace-cross Abstract: Multidisciplinary Software Development (MSD) requires domain experts and developers to collaborate acr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT

arXiv:2603.21606v2 Announce Type: replace-cross Abstract: Current language model training commonly applies multi-task Supervised Fine-Tuning (SFT) using a homog

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Uncertainty-guided Compositional Alignment with Part-to-Whole Semantic Representativeness in Hyperbolic Vision-Language Models

arXiv:2603.22042v2 Announce Type: replace-cross Abstract: While Vision-Language Models (VLMs) have achieved remarkable performance, their Euclidean embeddings r

OpenAI Discontinues AI Video Gen App Sora

Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago

OpenAI Discontinues AI Video Gen App Sora

OpenAI has quietly shut down Sora, its short-form AI video app that promised to let anyone create viral videos from text prompts, after just six months.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Introducing the OpenAI Safety Bug Bounty program

OpenAI launches a Safety Bug Bounty program to identify AI abuse and safety risks, including agentic vulnerabilities, prompt injection, and data exfiltration.

AI-Native Subdomains Make AI-Ready Websites Without Technical Overhaul

Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago

AI-Native Subdomains Make AI-Ready Websites Without Technical Overhaul

AI agents struggle with modern, content heavy websites. It's slow and expensive to crawl. The markdown standard makes your business discoverable to AI without r

Pentagon’s ‘Attempt to Cripple’ Anthropic Is Troubling, Judge Says

Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Pentagon’s ‘Attempt to Cripple’ Anthropic Is Troubling, Judge Says

During a hearing Tuesday, a district court judge questioned the Department of Defense’s motivations for labeling the Claude AI developer a supply-chain risk.

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Stop Guessing Your API Costs: Track LLM Tokens in Real Time

If you're building with LLMs in 2026, you already know the pain: API costs can spiral fast, and most of the time you have no idea how many tokens you're actuall

TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Anthropic hands Claude Code more control, but keeps it on a leash

Anthropic’s new auto mode for Claude Code lets AI execute tasks with fewer approvals, reflecting a broader shift toward more autonomous tools that balance speed

OpenAI open-sources teen safety policies for developers amid mounting lawsuits over ChatGPT deaths

The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago

OpenAI open-sources teen safety policies for developers amid mounting lawsuits over ChatGPT deaths

OpenAI has spent the past year fielding lawsuits from the families of young people who died after extended interactions with ChatGPT. Now it is trying to give t