Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,949
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,489 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Generating Findings for Jaw Cysts in Dental Panoramic Radiographs Using a GPT-Based VLM: A Preliminary Study on Building a Two-Stage Self-Correction Loop with Structured Output (SLSO) Framework
arXiv:2510.02001v4 Announce Type: replace-cross Abstract: Vision-language models (VLMs) such as GPT (Generative Pre-Trained Transformer) have shown potential fo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents
arXiv:2510.14967v2 Announce Type: replace-cross Abstract: Large language model (LLM)-based agents are increasingly trained with reinforcement learning (RL) to e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
arXiv:2510.15994v2 Announce Type: replace-cross Abstract: The Model Context Protocol (MCP) standardizes how large language model (LLM) agents discover, describe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
GUIrilla: A Scalable Framework for Automated Desktop UI Exploration
arXiv:2510.16051v2 Announce Type: replace-cross Abstract: The performance and generalization of foundation models for interactive systems critically depend on t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding
arXiv:2510.21356v2 Announce Type: replace-cross Abstract: Eye gaze offers valuable cues about attention, short-term intent, and future actions, making it a powe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Quantifying Systemic Vulnerability in the Foundation Model Industry
arXiv:2510.23421v2 Announce Type: replace-cross Abstract: The foundation model industry exhibits unprecedented concentration in critical inputs: semiconductors,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs
arXiv:2511.05919v3 Announce Type: replace-cross Abstract: LLMs are now an integral part of information retrieval. As such, their role as question answering chat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding
arXiv:2511.12449v2 Announce Type: replace-cross Abstract: Recent Multimodal Large Language Models (MLLMs) have significantly advanced e-commerce product underst
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation
arXiv:2511.21732v2 Announce Type: replace-cross Abstract: Humor, as both a creative human activity and a social binding mechanism, has long posed a major challe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Understanding
arXiv:2512.02487v2 Announce Type: replace-cross Abstract: Recent advances in 3D scene-language understanding have leveraged Large Language Models (LLMs) for 3D
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles
arXiv:2512.03454v3 Announce Type: replace-cross Abstract: Interpreting natural-language commands to localize target objects is critical for autonomous driving (
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Arc Gradient Descent: A Geometrically Motivated Gradient Descent-based Optimiser with Phase-Aware, User-Controlled Step Dynamics (proof-of-concept)
arXiv:2512.06737v3 Announce Type: replace-cross Abstract: The paper presents the formulation, implementation, and evaluation of the ArcGD optimiser. The evaluat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Metaphor-based Jailbreak Attacks on Text-to-Image Models
arXiv:2512.10766v2 Announce Type: replace-cross Abstract: Text-to-image (T2I) models commonly incorporate defense mechanisms to prevent the generation of sensit
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Schr\"odinger's Navigator: Imagining an Ensemble of Futures for Zero-Shot Object Navigation
arXiv:2512.21201v2 Announce Type: replace-cross Abstract: Zero-shot object navigation (ZSON) requires robots to locate target objects in unseen environments wit
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
AI-Generated Code Is Not Reproducible (Yet): An Empirical Study of Dependency Gaps in LLM-Based Coding Agents
arXiv:2512.22387v3 Announce Type: replace-cross Abstract: The rise of Large Language Models (LLMs) as coding agents promises to accelerate software development,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
VLM-CAD: VLM-Optimized Collaborative Agent Design Workflow for Analog Circuit Sizing
arXiv:2601.07315v4 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) have demonstrated remarkable potential in multimodal reasoning, yet they
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Hierarchical Long Video Understanding with Audiovisual Entity Cohesion and Agentic Search
arXiv:2601.13719v2 Announce Type: replace-cross Abstract: Long video understanding presents significant challenges for vision-language models due to extremely l
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Representational Homomorphism Predicts and Improves Compositional Generalization In Transformer Language Model
arXiv:2601.18858v2 Announce Type: replace-cross Abstract: Compositional generalization-the ability to interpret novel combinations of familiar components-remain
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
arXiv:2601.22060v3 Announce Type: replace-cross Abstract: Multimodal large language models (MLLMs) have achieved remarkable success across a broad range of visi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Residual Decoding: Mitigating Hallucinations in Large Vision-Language Models via History-Aware Residual Guidance
arXiv:2602.01047v3 Announce Type: replace-cross Abstract: Large Vision-Language Models (LVLMs) can reason from image-text inputs and perform well in various mul
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
FlyPrompt: Brain-Inspired Random-Expanded Routing with Temporal-Ensemble Experts for General Continual Learning
arXiv:2602.01976v3 Announce Type: replace-cross Abstract: General continual learning (GCL) challenges intelligent systems to learn from single-pass, non-station
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Behavioral Consistency Validation for LLM Agents: An Analysis of Trading-Style Switching through Stock-Market Simulation
arXiv:2602.07023v2 Announce Type: replace-cross Abstract: Recent works have increasingly applied Large Language Models (LLMs) as agents in financial stock marke
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Energy-Aware Reinforcement Learning for Robotic Manipulation of Articulated Components in Infrastructure Operation and Maintenance
arXiv:2602.12288v3 Announce Type: replace-cross Abstract: With the growth of intelligent civil infrastructure and smart cities, operation and maintenance (O&M)
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
KDFlow: A User-Friendly and Efficient Knowledge Distillation Framework for Large Language Models
arXiv:2603.01875v2 Announce Type: replace-cross Abstract: Knowledge distillation (KD) is an essential technique to compress large language models (LLMs) into sm
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG
arXiv:2603.03292v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) exhibit high reasoning capacity in medical question-answering, but their
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
When Sensors Fail: Temporal Sequence Models for Robust PPO under Sensor Drift
arXiv:2603.04648v2 Announce Type: replace-cross Abstract: Real-world reinforcement learning systems must operate under distributional drift in their observation
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Human Presence Detection via Wi-Fi Range-Filtered Doppler Spectrum on Commodity Laptops
arXiv:2603.10845v2 Announce Type: replace-cross Abstract: Human Presence Detection (HPD) is key to enable intelligent power management and security features in
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL
arXiv:2603.13606v2 Announce Type: replace-cross Abstract: Mixture-of-Experts (MoE) architectures have become essential for scaling large language models, drivin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition
arXiv:2603.17729v2 Announce Type: replace-cross Abstract: Recent advances in Large Vision-Language Models (LVLMs) have enabled training-free Fine-Grained Visual
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards
arXiv:2603.17808v2 Announce Type: replace-cross Abstract: Video generative models are increasingly used as world models for robotics, where a model generates a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Elastic Weight Consolidation Done Right for Continual Learning
arXiv:2603.18596v2 Announce Type: replace-cross Abstract: Weight regularization methods in continual learning (CL) alleviate catastrophic forgetting by assessin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Mi:dm K 2.5 Pro
arXiv:2603.18788v2 Announce Type: replace-cross Abstract: The evolving LLM landscape requires capabilities beyond simple text generation, prioritizing multi-ste
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs
arXiv:2603.20209v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) combine the linguistic strengths of LLMs with the ability to
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
CRoCoDiL: Continuous and Robust Conditioned Diffusion for Language
arXiv:2603.20210v2 Announce Type: replace-cross Abstract: Masked Diffusion Models (MDMs) provide an efficient non-causal alternative to autoregressive generatio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
An Industrial-Scale Retrieval-Augmented Generation Framework for Requirements Engineering: Empirical Evaluation with Automotive Manufacturing Data
arXiv:2603.20534v2 Announce Type: replace-cross Abstract: Requirements engineering in Industry 4.0 faces critical challenges with heterogeneous, unstructured do
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning
arXiv:2603.20586v2 Announce Type: replace-cross Abstract: As long-context language modeling becomes increasingly important, the cost of maintaining and attendin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning
arXiv:2603.21289v2 Announce Type: replace-cross Abstract: Recent progress in multimodal large language models has led to strong performance on reasoning tasks,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
DeepXplain: XAI-Guided Autonomous Defense Against Multi-Stage APT Campaigns
arXiv:2603.21296v2 Announce Type: replace-cross Abstract: Advanced Persistent Threats (APTs) are stealthy, multi-stage attacks that require adaptive and timely
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study
arXiv:2603.21439v2 Announce Type: replace-cross Abstract: Multidisciplinary Software Development (MSD) requires domain experts and developers to collaborate acr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT
arXiv:2603.21606v2 Announce Type: replace-cross Abstract: Current language model training commonly applies multi-task Supervised Fine-Tuning (SFT) using a homog
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Uncertainty-guided Compositional Alignment with Part-to-Whole Semantic Representativeness in Hyperbolic Vision-Language Models
arXiv:2603.22042v2 Announce Type: replace-cross Abstract: While Vision-Language Models (VLMs) have achieved remarkable performance, their Euclidean embeddings r
OpenAI Discontinues AI Video Gen App Sora
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
OpenAI Discontinues AI Video Gen App Sora
OpenAI has quietly shut down Sora, its short-form AI video app that promised to let anyone create viral videos from text prompts, after just six months.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Introducing the OpenAI Safety Bug Bounty program
OpenAI launches a Safety Bug Bounty program to identify AI abuse and safety risks, including agentic vulnerabilities, prompt injection, and data exfiltration.
AI-Native Subdomains Make AI-Ready Websites Without Technical Overhaul
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
AI-Native Subdomains Make AI-Ready Websites Without Technical Overhaul
AI agents struggle with modern, content heavy websites. It's slow and expensive to crawl. The markdown standard makes your business discoverable to AI without r
Pentagon’s ‘Attempt to Cripple’ Anthropic Is Troubling, Judge Says
Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Pentagon’s ‘Attempt to Cripple’ Anthropic Is Troubling, Judge Says
During a hearing Tuesday, a district court judge questioned the Department of Defense’s motivations for labeling the Claude AI developer a supply-chain risk.
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Stop Guessing Your API Costs: Track LLM Tokens in Real Time
If you're building with LLMs in 2026, you already know the pain: API costs can spiral fast, and most of the time you have no idea how many tokens you're actuall
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Anthropic hands Claude Code more control, but keeps it on a leash
Anthropic’s new auto mode for Claude Code lets AI execute tasks with fewer approvals, reflecting a broader shift toward more autonomous tools that balance speed
OpenAI open-sources teen safety policies for developers amid mounting lawsuits over ChatGPT deaths
The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
OpenAI open-sources teen safety policies for developers amid mounting lawsuits over ChatGPT deaths
OpenAI has spent the past year fielding lawsuits from the families of young people who died after extended interactions with ChatGPT. Now it is trying to give t