Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,932

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,459 Reads 5,473

Showing 5,473 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

AFSS: Artifact-Focused Self-Synthesis for Mitigating Bias in Audio Deepfake Detection

arXiv:2603.26856v1 Announce Type: cross Abstract: The rapid advancement of generative models has enabled highly realistic audio deepfakes, yet current detectors

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Strategic Candidacy in Generative AI Arenas

arXiv:2603.26891v1 Announce Type: cross Abstract: AI arenas, which rank generative models from pairwise preferences of users, are a popular method for measuring

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Magic Words or Methodical Work? Challenging Conventional Wisdom in LLM-Based Political Text Annotation

arXiv:2603.26898v1 Announce Type: cross Abstract: Political scientists are rapidly adopting large language models (LLMs) for text annotation, yet the sensitivit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Are LLMs Good For Quantum Software, Architecture, and System Design?

arXiv:2603.26904v1 Announce Type: cross Abstract: Quantum computers promise massive computational speedup for problems in many critical domains, such as physics

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Mimetic Alignment with ASPECT: Evaluation of AI-inferred Personal Profiles

arXiv:2603.26922v1 Announce Type: cross Abstract: AI agents that communicate on behalf of individuals need to capture how each person actually communicates, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

AutoSiMP: Autonomous Topology Optimization from Natural Language via LLM-Driven Problem Configuration and Adaptive Solver Control

arXiv:2603.27000v1 Announce Type: cross Abstract: We present AutoSiMP, an autonomous pipeline that transforms a natural-language structural problem description

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

TAPS: Task Aware Proposal Distributions for Speculative Sampling

arXiv:2603.27027v1 Announce Type: cross Abstract: Speculative decoding accelerates autoregressive generation by letting a lightweight draft model propose future

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Unsupervised Behavioral Compression: Learning Low-Dimensional Policy Manifolds through State-Occupancy Matching

arXiv:2603.27044v1 Announce Type: cross Abstract: Deep Reinforcement Learning (DRL) is widely recognized as sample-inefficient, a limitation attributable in par

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Persona-Based Simulation of Human Opinion at Population Scale

arXiv:2603.27056v1 Announce Type: cross Abstract: What does it mean to model a person, not merely to predict isolated responses, preferences, or behaviors, but

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Debiasing Large Language Models toward Social Factors in Online Behavior Analytics through Prompt Knowledge Tuning

arXiv:2603.27057v1 Announce Type: cross Abstract: Attribution theory explains how individuals interpret and attribute others' behavior in a social context by em

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

arXiv:2603.27064v1 Announce Type: cross Abstract: Understanding charts requires models to jointly reason over geometric visual patterns, structured numerical da

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Voice-based debate with an AI adversary is associated with increased divergent ideation

arXiv:2603.27073v1 Announce Type: cross Abstract: Concerns that interacting with generative AI homogenizes human cognition are largely based on evidence from te

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Sovereign Context Protocol: An Open Attribution Layer for Human-Generated Content in the Age of Large Language Models

arXiv:2603.27094v1 Announce Type: cross Abstract: Large Language Models (LLMs) consume vast quantities of human-generated content for both training and real-tim

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Bayesian-Symbolic Integration for Uncertainty-Aware Parking Prediction

arXiv:2603.27119v1 Announce Type: cross Abstract: Accurate parking availability prediction is critical for intelligent transportation systems, but real-world de

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 4w ago

A Tight Expressivity Hierarchy for GNN-Based Entity Resolution in Master Data Management

arXiv:2603.27154v1 Announce Type: cross Abstract: Entity resolution -- identifying database records that refer to the same real-world entity -- is naturally mod

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

GSR-GNN: Training Acceleration and Memory-Saving Framework of Deep GNNs on Circuit Graph

arXiv:2603.27156v1 Announce Type: cross Abstract: Graph Neural Networks (GNNs) show strong promise for circuit analysis, but scaling to modern large-scale circu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

EuraGovExam: A Multilingual Multimodal Benchmark from Real-World Civil Service Exams

arXiv:2603.27223v1 Announce Type: cross Abstract: We present EuraGovExam, a multilingual and multimodal benchmark sourced from real-world civil service examinat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Diagnosing and Repairing Unsafe Channels in Vision-Language Models via Causal Discovery and Dual-Modal Safety Subspace Projection

arXiv:2603.27240v1 Announce Type: cross Abstract: Large Vision-Language Models (LVLMs) have achieved impressive performance across multimodal understanding and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Zero-shot Vision-Language Reranking for Cross-View Geolocalization

arXiv:2603.27251v1 Announce Type: cross Abstract: Cross-view geolocalization (CVGL) systems, while effective at retrieving a list of relevant candidates (high R

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Amalgam: Hybrid LLM-PGM Synthesis Algorithm for Accuracy and Realism

arXiv:2603.27254v1 Announce Type: cross Abstract: To generate synthetic datasets, e.g., in domains such as healthcare, the literature proposes approaches of two

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

From Foundation ECG Models to NISQ Learners: Distilling ECGFounder into a VQC Student

arXiv:2603.27269v1 Announce Type: cross Abstract: Foundation models have recently improved electrocardiogram (ECG) representation learning, but their deployment

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Codebase-Memory: Tree-Sitter-Based Knowledge Graphs for LLM Code Exploration via MCP

arXiv:2603.27277v1 Announce Type: cross Abstract: Large Language Model (LLM) coding agents typically explore codebases through repeated file-reading and grep-se

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

GUIDE: Guided Updates for In-context Decision Evolution in LLM-Driven Spacecraft Operations

arXiv:2603.27306v1 Announce Type: cross Abstract: Large language models (LLMs) have been proposed as supervisory agents for spacecraft operations, but existing

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Culturally Adaptive Explainable LLM Assessment for Multilingual Information Disorder: A Human-in-the-Loop Approach

arXiv:2603.27356v1 Announce Type: cross Abstract: Recognizing information disorder is difficult because judgments about manipulation depend on cultural and ling

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Conditional Factuality Controlled LLMs with Generalization Certificates via Conformal Sampling

arXiv:2603.27403v1 Announce Type: cross Abstract: Large language models (LLMs) need reliable test-time control of hallucinations. Existing conformal methods for

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

The Geometry of Harmful Intent: Training-Free Anomaly Detection via Angular Deviation in LLM Residual Streams

arXiv:2603.27412v1 Announce Type: cross Abstract: We present LatentBiopsy, a training-free method for detecting harmful prompts by analysing the geometry of res

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

CarbonEdge: Carbon-Aware Deep Learning Inference Framework for Sustainable Edge Computing

arXiv:2603.27420v1 Announce Type: cross Abstract: Deep learning applications at the network edge lead to a significant growth in AI-related carbon emissions, pr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Improving Attributed Long-form Question Answering with Intent Awareness

arXiv:2603.27435v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly being used to generate comprehensive, knowledge-intensive report

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Multi-Agent Dialectical Refinement for Enhanced Argument Classification

arXiv:2603.27451v1 Announce Type: cross Abstract: Argument Mining (AM) is a foundational technology for automated writing evaluation, yet traditional supervised

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

arXiv:2603.27460v1 Announce Type: cross Abstract: Foundation models have demonstrated remarkable success across diverse domains and tasks, primarily due to the

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

TurboAngle: Near-Lossless KV Cache Compression via Uniform Angle Quantization

arXiv:2603.27467v1 Announce Type: cross Abstract: We compress KV cache entries by quantizing angles in the Fast Walsh-Hadamard domain, where a random diagonal r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models

arXiv:2603.27481v1 Announce Type: cross Abstract: Multimodal Continual Instruction Tuning aims to continually enhance Large Vision Language Models (LVLMs) by le

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Difference Feedback: Generating Multimodal Process-Level Supervision for VLM Reinforcement Learning

arXiv:2603.27482v1 Announce Type: cross Abstract: Vision--language models (VLMs) are increasingly aligned via Group Relative Policy Optimization (GRPO)-style tr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Copilot-Assisted Second-Thought Framework for Brain-to-Robot Hand Motion Decoding

arXiv:2603.27492v1 Announce Type: cross Abstract: Motor kinematics prediction (MKP) from electroencephalography (EEG) is an important research area for developi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs

arXiv:2603.27494v1 Announce Type: cross Abstract: To enhance the perception and reasoning capabilities of multimodal large language models in complex visual sce

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Toward Reliable Evaluation of LLM-Based Financial Multi-Agent Systems: Taxonomy, Coordination Primacy, and Cost Awareness

arXiv:2603.27539v1 Announce Type: cross Abstract: Multi-agent systems based on large language models (LLMs) for financial trading have grown rapidly since 2023,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding

arXiv:2603.27593v1 Announce Type: cross Abstract: Recent progress in video large language models (Video-LLMs) has enabled strong offline reasoning over long and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling

arXiv:2603.27624v1 Announce Type: cross Abstract: Mixture-of-Experts is a promising approach for edge AI with low-batch inference. Yet, on-device deployments of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents

arXiv:2603.27626v1 Announce Type: cross Abstract: I propose Umwelt engineering -- the deliberate design of the linguistic cognitive environment -- as a third la

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

EvA: An Evidence-First Audio Understanding Paradigm for LALMs

arXiv:2603.27667v1 Announce Type: cross Abstract: Large Audio Language Models (LALMs) still struggle in complex acoustic scenes because they often fail to prese

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

LVRPO: Language-Visual Alignment with GRPO for Multimodal Understanding and Generation

arXiv:2603.27693v1 Announce Type: cross Abstract: Unified multimodal pretraining has emerged as a promising paradigm for jointly modeling language and vision wi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

RAP: Retrieve, Adapt, and Prompt-Fit for Training-Free Few-Shot Medical Image Segmentation

arXiv:2603.27705v1 Announce Type: cross Abstract: Few-shot medical image segmentation (FSMIS) has achieved notable progress, yet most existing methods mainly re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

KVSculpt: KV Cache Compression as Distillation

arXiv:2603.27819v1 Announce Type: cross Abstract: KV cache compression is critical for efficient long-context LLM inference. Approaches that reduce the per-pair

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Kernel Dynamics under Path Entropy Maximization

arXiv:2603.27880v1 Announce Type: cross Abstract: We propose a variational framework in which the kernel function k : X x X -> R, interpreted as the foundationa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

ITQ3_S: High-Fidelity 3-bit LLM Inference via Interleaved Ternary Quantization with Rotation-Domain Smoothing

arXiv:2603.27914v1 Announce Type: cross Abstract: We present \textbf{ITQ3\_S} (Interleaved Ternary Quantization -- Specialized), a novel 3-bit weight quantizati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey

arXiv:2603.27918v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) integrate information from multiple modalities such as text, images,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding

arXiv:2603.27942v1 Announce Type: cross Abstract: Japanese scene text poses challenges that multilingual benchmarks often fail to capture, including mixed scrip

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Vision-Language Models

arXiv:2603.27982v1 Announce Type: cross Abstract: Vision-language models (VLMs) achieve strong performance on many benchmarks, yet a basic reliability question