Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,443

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,387 Reads 5,056

Showing 5,056 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

MemMachine: A Ground-Truth-Preserving Memory System for Personalized AI Agents

arXiv:2604.04853v1 Announce Type: new Abstract: Large Language Model (LLM) agents require persistent memory to maintain personalization, factual continuity, and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

arXiv:2604.04898v1 Announce Type: new Abstract: Proprietary AI systems have recently demonstrated impressive capabilities on complex proof-based problems, with

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLMs-Healthcare : Current Applications and Challenges of Large Language Models in various Medical Specialties

arXiv:2311.12882v3 Announce Type: cross Abstract: We aim to present a comprehensive overview of the latest advancements in utilizing Large Language Models (LLMs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

From Concept to Practice: an Automated LLM-aided UVM Machine for RTL Verification

arXiv:2504.19959v3 Announce Type: cross Abstract: Verification presents a major bottleneck in Integrated Circuit (IC) development, consuming nearly 70% of the t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

The Persuasion Paradox: When LLM Explanations Fail to Improve Human-AI Team Performance

arXiv:2604.03237v1 Announce Type: cross Abstract: While natural-language explanations from large language models (LLMs) are widely adopted to improve transparen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Scaling DPPs for RAG: Density Meets Diversity

arXiv:2604.03240v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by grounding generation in external

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Classifying Problem and Solution Framing in Congressional Social Media

arXiv:2604.03247v1 Announce Type: cross Abstract: Policy setting in the USA according to the ``Garbage Can'' model differentiates between ``problem'' and ``solu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

BLK-Assist: A Methodological Framework for Artist-Led Co-Creation with Generative AI Models

arXiv:2604.03249v1 Announce Type: cross Abstract: This paper presents BLK-Assist, a modular framework for artist-specific fine-tuning of diffusion models using

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Robust LLM Performance Certification via Constrained Maximum Likelihood Estimation

arXiv:2604.03257v1 Announce Type: cross Abstract: The ability to rigorously estimate the failure rates of large language models (LLMs) is a prerequisite for the

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression

arXiv:2604.03258v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated impressive capabilities across various tasks, but the billion-s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Why Attend to Everything? Focus is the Key

arXiv:2604.03260v1 Announce Type: cross Abstract: We introduce Focus, a method that learns which token pairs matter rather than approximating all of them. Learn

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LPC-SM: Local Predictive Coding and Sparse Memory for Long-Context Language Modeling

arXiv:2604.03263v1 Announce Type: cross Abstract: Most current long-context language models still rely on attention to handle both local interaction and long-ra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Impact of geophysical fields on Deep Learning-based Lagrangian drift simulations

arXiv:2604.03292v1 Announce Type: cross Abstract: We assess the influence of different Eulerian geophysical input fields on Lagrangian drift simulations using D

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Scaling Teams or Scaling Time? Memory Enabled Lifelong Learning in LLM Multi-Agent Systems

arXiv:2604.03295v1 Announce Type: cross Abstract: Large language model (LLM) multi-agent systems can scale along two distinct dimensions: by increasing the numb

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

3D-IDE: 3D Implicit Depth Emergent

arXiv:2604.03296v1 Announce Type: cross Abstract: Leveraging 3D information within Multimodal Large Language Models (MLLMs) has recently shown significant advan

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

XAttnRes: Cross-Stage Attention Residuals for Medical Image Segmentation

arXiv:2604.03297v1 Announce Type: cross Abstract: In the field of Large Language Models (LLMs), Attention Residuals have recently demonstrated that learned, sel

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Embedding-Only Uplink for Onboard Retrieval Under Shift in Remote Sensing

arXiv:2604.03301v1 Announce Type: cross Abstract: Downlink bottlenecks motivate onboard systems that prioritize hazards without transmitting raw pixels. We stud

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Beyond Static Vision: Scene Dynamic Field Unlocks Intuitive Physics Understanding in Multi-modal Large Language Models

arXiv:2604.03302v1 Announce Type: cross Abstract: While Multimodal Large Language Models (MLLMs) have demonstrated impressive capabilities in image and video un

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Generative Chemical Language Models for Energetic Materials Discovery

arXiv:2604.03304v1 Announce Type: cross Abstract: The discovery of new energetic materials remains a pressing challenge hindered by limited availability of high

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

V-Reflection: Transforming MLLMs from Passive Observers to Active Interrogators

arXiv:2604.03307v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) have achieved remarkable success, yet they remain prone to perception

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

VitaTouch: Property-Aware Vision-Tactile-Language Model for Robotic Quality Inspection in Manufacturing

arXiv:2604.03322v1 Announce Type: cross Abstract: Quality inspection in smart manufacturing requires identifying intrinsic material and surface properties beyon

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

CoLoRSMamba: Conditional LoRA-Steered Mamba for Supervised Multimodal Violence Detection

arXiv:2604.03329v1 Announce Type: cross Abstract: Violence detection benefits from audio, but real-world soundscapes can be noisy or weakly related to the visib

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

The Ideation Bottleneck: Decomposing the Quality Gap Between AI-Generated and Human Economics Research

arXiv:2604.03338v1 Announce Type: cross Abstract: Autonomous AI systems can now generate complete economics research papers, but they substantially underperform

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Learning Additively Compositional Latent Actions for Embodied AI

arXiv:2604.03340v1 Announce Type: cross Abstract: Latent action learning infers pseudo-action labels from visual transitions, providing an approach to leverage

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Towards Intelligent Energy Security: A Unified Spatio-Temporal and Graph Learning Framework for Scalable Electricity Theft Detection in Smart Grids

arXiv:2604.03344v1 Announce Type: cross Abstract: Electricity theft and non-technical losses (NTLs) remain critical challenges in modern smart grids, causing si

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

CresOWLve: Benchmarking Creative Problem-Solving Over Real-World Knowledge

arXiv:2604.03374v1 Announce Type: cross Abstract: Creative problem-solving requires combining multiple cognitive abilities, including logical reasoning, lateral

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Can LLMs Reason About Attention? Towards Zero-Shot Analysis of Multimodal Classroom Behavior

arXiv:2604.03401v1 Announce Type: cross Abstract: Understanding student engagement usually requires time-consuming manual observation or invasive recording that

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Zero-Shot Quantization via Weight-Space Arithmetic

arXiv:2604.03420v1 Announce Type: cross Abstract: We show that robustness to post-training quantization (PTQ) is a transferable direction in weight space. We ca

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Inference-Path Optimization via Circuit Duplication in Frozen Visual Transformers for Marine Species Classification

arXiv:2604.03428v1 Announce Type: cross Abstract: Automated underwater species classification is constrained by annotation cost and environmental variation that

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

MetaSAEs: Joint Training with a Decomposability Penalty Produces More Atomic Sparse Autoencoder Latents

arXiv:2604.03436v1 Announce Type: cross Abstract: Sparse autoencoders (SAEs) are increasingly used for safety-relevant applications including alignment detectio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Measuring LLM Trust Allocation Across Conflicting Software Artifacts

arXiv:2604.03447v1 Announce Type: cross Abstract: LLM-based software engineering assistants fail not only by producing incorrect outputs, but also by allocating

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution

arXiv:2604.03472v1 Announce Type: cross Abstract: Co-evolutionary self-play, where one language model generates problems and another solves them, promises auton

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Evolutionary Search for Automated Design of Uncertainty Quantification Methods

arXiv:2604.03473v1 Announce Type: cross Abstract: Uncertainty quantification (UQ) methods for large language models are predominantly designed by hand based on

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Large Language Models Align with the Human Brain during Creative Thinking

arXiv:2604.03480v1 Announce Type: cross Abstract: Creative thinking is a fundamental aspect of human cognition, and divergent thinking-the capacity to generate

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 2w ago

Sim2Real-AD: A Modular Sim-to-Real Framework for Deploying VLM-Guided Reinforcement Learning in Real-World Autonomous Driving

arXiv:2604.03497v1 Announce Type: cross Abstract: Deploying reinforcement learning policies trained in simulation to real autonomous vehicles remains a fundamen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Incentives shape how humans co-create with generative AI

arXiv:2604.03529v1 Announce Type: cross Abstract: Generative AI is quickly becoming an integral part of people's everyday workflows. Early evidence has shown th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LangFIR: Discovering Sparse Language-Specific Features from Monolingual Data for Language Steering

arXiv:2604.03532v1 Announce Type: cross Abstract: Large language models (LLMs) show strong multilingual capabilities, yet reliably controlling the language of t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Focus Matters: Phase-Aware Suppression for Hallucination in Vision-Language Models

arXiv:2604.03556v1 Announce Type: cross Abstract: Large Vision-Language Models (LVLMs) have achieved impressive progress in multimodal reasoning, yet they remai

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Unveiling Language Routing Isolation in Multilingual MoE Models for Interpretable Subnetwork Adaptation

arXiv:2604.03592v1 Announce Type: cross Abstract: Mixture-of-Experts (MoE) models exhibit striking performance disparities across languages, yet the internal me

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Neural Global Optimization via Iterative Refinement from Noisy Samples

arXiv:2604.03614v1 Announce Type: cross Abstract: Global optimization of black-box functions from noisy samples is a fundamental challenge in machine learning a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Toward Executable Repository-Level Code Generation via Environment Alignment

arXiv:2604.03622v1 Announce Type: cross Abstract: Large language models (LLMs) have achieved strong performance on code generation, but existing methods still s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Persistent Cross-Attempt State Optimization for Repository-Level Code Generation

arXiv:2604.03632v1 Announce Type: cross Abstract: Large language models (LLMs) have achieved substantial progress in repository-level code generation. However,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

A Generative Foundation Model for Multimodal Histopathology

arXiv:2604.03635v1 Announce Type: cross Abstract: Accurate diagnosis and treatment of complex diseases require integrating histological, molecular, and clinical

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Delayed Homomorphic Reinforcement Learning for Environments with Delayed Feedback

arXiv:2604.03641v1 Announce Type: cross Abstract: Reinforcement learning in real-world systems is often accompanied by delayed feedback, which breaks the Markov

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Stabilizing Unsupervised Self-Evolution of MLLMs via Continuous Softened Retracing reSampling

arXiv:2604.03647v1 Announce Type: cross Abstract: In the unsupervised self-evolution of Multimodal Large Language Models, the quality of feedback signals during

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

AI Appeals Processor: A Deep Learning Approach to Automated Classification of Citizen Appeals in Government Services

arXiv:2604.03672v1 Announce Type: cross Abstract: Government agencies worldwide face growing volumes of citizen appeals, with electronic submissions increasing

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Unlocking Prompt Infilling Capability for Diffusion Language Models

arXiv:2604.03677v1 Announce Type: cross Abstract: Masked diffusion language models (dLMs) generate text through bidirectional denoising, yet this capability rem

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LightThinker++: From Reasoning Compression to Memory Management

arXiv:2604.03679v1 Announce Type: cross Abstract: Large language models (LLMs) excel at complex reasoning, yet their efficiency is limited by the surging cognit