Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,674

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,438 Reads 5,236

Showing 5,236 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion

arXiv:2603.19266v1 Announce Type: cross Abstract: Distilling robust reasoning capabilities from large language models (LLMs) into smaller, computationally effic

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Full-Stack Domain Enhancement for Combustion LLMs: Construction and Optimization

arXiv:2603.19268v1 Announce Type: cross Abstract: Large language models (LLMs) in the direction of task adaptation and capability enhancement for professional f

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Human-Centered Workflow for Using Large Language Models in Content Analysis

arXiv:2603.19271v1 Announce Type: cross Abstract: While many researchers use Large Language Models (LLMs) through chat-based access, their real potential lies i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Transformers are Stateless Differentiable Neural Computers

arXiv:2603.19272v1 Announce Type: cross Abstract: Differentiable Neural Computers (DNCs) were introduced as recurrent architectures equipped with an addressable

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CURE: A Multimodal Benchmark for Clinical Understanding and Retrieval Evaluation

arXiv:2603.19274v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) demonstrate considerable potential in clinical diagnostics, a domain

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language Models

arXiv:2603.19275v1 Announce Type: cross Abstract: Automatic summarization of radiology reports is an essential application to reduce the burden on physicians. P

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Flat to Structural: Enhancing Automated Short Answer Grading with GraphRAG

arXiv:2603.19276v1 Announce Type: cross Abstract: Automated short answer grading (ASAG) is critical for scaling educational assessment, yet large language model

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

HypeLoRA: Hyper-Network-Generated LoRA Adapters for Calibrated Language Model Fine-Tuning

arXiv:2603.19278v1 Announce Type: cross Abstract: Modern Transformer-based models frequently suffer from miscalibration, producing overconfident predictions tha

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Feature-Based Models to Generative AI: Validity Evidence for Constructed Response Scoring

arXiv:2603.19280v1 Announce Type: cross Abstract: The rapid advancements in large language models and generative artificial intelligence (AI) capabilities are m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

URAG: A Benchmark for Uncertainty Quantification in Retrieval-Augmented Large Language Models

arXiv:2603.19281v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) has emerged as a widely adopted approach for enhancing LLMs in scenarios

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Framing Effects in Independent-Agent Large Language Models: A Cross-Family Behavioral Analysis

arXiv:2603.19282v1 Announce Type: cross Abstract: In many real-world applications, large language models (LLMs) operate as independent agents without interactio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CDEoH: Category-Driven Automatic Algorithm Design With Large Language Models

arXiv:2603.19284v1 Announce Type: cross Abstract: With the rapid advancement of large language models (LLMs), LLM-based heuristic search methods have demonstrat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Generalized Stock Price Prediction for Multiple Stocks Combined with News Fusion

arXiv:2603.19286v1 Announce Type: cross Abstract: Predicting stock prices presents challenges in financial forecasting. While traditional approaches such as ARI

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Speculating Experts Accelerates Inference for Mixture-of-Experts

arXiv:2603.19289v1 Announce Type: cross Abstract: Mixture-of-Experts (MoE) models have gained popularity as a means of scaling the capacity of large language mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Neural Dynamics Self-Attention for Spiking Transformers

arXiv:2603.19290v1 Announce Type: cross Abstract: Integrating Spiking Neural Networks (SNNs) with Transformer architectures offers a promising pathway to balanc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

LLM-MRD: LLM-Guided Multi-View Reasoning Distillation for Fake News Detection

arXiv:2603.19293v1 Announce Type: cross Abstract: Multimodal fake news detection is crucial for mitigating societal disinformation. Existing approaches attempt

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Maximizing mutual information between user-contexts and responses improve LLM personalization with no additional data

arXiv:2603.19294v1 Announce Type: cross Abstract: While post-training has successfully improved large language models (LLMs) across a variety of domains, these

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Parameter-Efficient Token Embedding Editing for Clinical Class-Level Unlearning

arXiv:2603.19302v1 Announce Type: cross Abstract: Machine unlearning is increasingly important for clinical language models, where privacy regulations and insti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Agreement Between Large Language Models, Human Reviewers, and Authors in Evaluating STROBE Checklists for Observational Studies in Rheumatology

arXiv:2603.19303v1 Announce Type: cross Abstract: Introduction: Evaluating compliance with the Strengthening the Reporting of Observational Studies in Epidemiol

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

VERDICT: Verifiable Evolving Reasoning with Directive-Informed Collegial Teams for Legal Judgment Prediction

arXiv:2603.19306v1 Announce Type: cross Abstract: Legal Judgment Prediction (LJP) predicts applicable law articles, charges, and penalty terms from case facts.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MemReward: Graph-Based Experience Memory for LLM Reward Prediction with Limited Labels

arXiv:2603.19310v1 Announce Type: cross Abstract: Training large language models (LLMs) for complex reasoning via reinforcement learning requires reward labels

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

arXiv:2603.19312v1 Announce Type: cross Abstract: Joint Embedding Predictive Architectures (JEPAs) offer a compelling framework for learning world models in com

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Memory-Driven Role-Playing: Evaluation and Enhancement of Persona Knowledge Utilization in LLMs

arXiv:2603.19313v1 Announce Type: cross Abstract: A core challenge for faithful LLM role-playing is sustaining consistent characterization throughout long, open

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Ternary Gamma Semirings: From Neural Implementation to Categorical Foundations

arXiv:2603.19317v1 Announce Type: cross Abstract: This paper establishes a theoretical framework connecting neural network learning with abstract algebraic stru

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Prompt-tuning with Attribute Guidance for Low-resource Entity Matching

arXiv:2603.19321v1 Announce Type: cross Abstract: Entity Matching (EM) is an important task that determines the logical relationship between two entities, such

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A General Deep Learning Framework for Wireless Resource Allocation under Discrete Constraints

arXiv:2603.19322v1 Announce Type: cross Abstract: While deep learning (DL)-based methods have achieved remarkable success in continuous wireless resource alloca

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Goedel-Code-Prover: Hierarchical Proof Search for Open State-of-the-Art Code Verification

arXiv:2603.19329v1 Announce Type: cross Abstract: Large language models (LLMs) can generate plausible code but offer limited guarantees of correctness. Formally

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

POET: Power-Oriented Evolutionary Tuning for LLM-Based RTL PPA Optimization

arXiv:2603.19333v1 Announce Type: cross Abstract: Applying large language models (LLMs) to RTL code optimization for improved power, performance, and area (PPA)

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Do Post-Training Algorithms Actually Differ? A Controlled Study Across Model Scales Uncovers Scale-Dependent Ranking Inversions

arXiv:2603.19335v1 Announce Type: cross Abstract: Post-training alignment has produced dozens of competing algorithms -- DPO, SimPO, KTO, GRPO, and others -- ye

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Diffusion-Guided Semantic Consistency for Multimodal Heterogeneity

arXiv:2603.19337v1 Announce Type: cross Abstract: Federated learning (FL) is severely challenged by non-independent and identically distributed (non-IID) client

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Spectral Tempering for Embedding Compression in Dense Passage Retrieval

arXiv:2603.19339v1 Announce Type: cross Abstract: Dimensionality reduction is critical for deploying dense retrieval systems at scale, yet mainstream post-hoc m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Beyond Weighted Summation: Learnable Nonlinear Aggregation Functions for Robust Artificial Neurons

arXiv:2603.19344v1 Announce Type: cross Abstract: Weighted summation has remained the default input aggregation mechanism in artificial neurons since the earlie

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Scalable Prompt Routing via Fine-Grained Latent Task Discovery

arXiv:2603.19415v1 Announce Type: cross Abstract: Prompt routing dynamically selects the most appropriate large language model from a pool of candidates for eac

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Investigating In-Context Privacy Learning by Integrating User-Facing Privacy Tools into Conversational Agents

arXiv:2603.19416v1 Announce Type: cross Abstract: Supporting users in protecting sensitive information when using conversational agents (CAs) is crucial, as use

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Autonomy Tax: Defense Training Breaks LLM Agents

arXiv:2603.19423v1 Announce Type: cross Abstract: Large language model (LLM) agents increasingly rely on external tools (file operations, API calls, database tr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure

arXiv:2603.19426v1 Announce Type: cross Abstract: Prior work uses linear probes on benchmark prompts as evidence of evaluation awareness in large language model

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

TrustFlow: Topic-Aware Vector Reputation Propagation for Multi-Agent Ecosystems

arXiv:2603.19452v1 Announce Type: cross Abstract: We introduce TrustFlow, a reputation propagation algorithm that assigns each software agent a multi-dimensiona

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Global Convergence of Multiplicative Updates for the Matrix Mechanism: A Collaborative Proof with Gemini 3

arXiv:2603.19465v1 Announce Type: cross Abstract: We analyze a fixed-point iteration $v \leftarrow \phi(v)$ arising in the optimization of a regularized nuclear

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Framework for Formalizing LLM Agent Security

arXiv:2603.19469v1 Announce Type: cross Abstract: Security in LLM agents is inherently contextual. For example, the same action taken by an agent may represent

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

arXiv:2603.19470v1 Announce Type: cross Abstract: Off-policy problems such as policy staleness and training-inference mismatch, has become a major bottleneck fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Linear Social Choice with Few Queries: A Moment-Based Approach

arXiv:2603.19510v1 Announce Type: cross Abstract: Most social choice rules assume access to full rankings, while current alignment practice -- despite aiming fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Inducing Sustained Creativity and Diversity in Large Language Models

arXiv:2603.19519v1 Announce Type: cross Abstract: We address a not-widely-recognized subset of exploratory search, where a user sets out on a typically long "se

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Depictions of Depression in Generative AI Video Models: A Preliminary Study of OpenAI's Sora 2

arXiv:2603.19527v1 Announce Type: cross Abstract: Generative video models are increasingly capable of producing complex depictions of mental health experiences,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Plagiarism or Productivity? Students Moral Disengagement and Behavioral Intentions to Use ChatGPT in Academic Writing

arXiv:2603.19549v1 Announce Type: cross Abstract: This study examined how moral disengagement influences Filipino college students' intention to use ChatGPT in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Optimal Scalar Quantization for Matrix Multiplication: Closed-Form Density and Phase Transition

arXiv:2603.19559v1 Announce Type: cross Abstract: We study entrywise scalar quantization of two matrices prior to multiplication. Given $A\in R^{m\times k}$ and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

PFM-VEPAR: Prompting Foundation Models for RGB-Event Camera based Pedestrian Attribute Recognition

arXiv:2603.19565v1 Announce Type: cross Abstract: Event-based pedestrian attribute recognition (PAR) leverages motion cues to enhance RGB cameras in low-light a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AI Psychosis: Does Conversational AI Amplify Delusion-Related Language?

arXiv:2603.19574v1 Announce Type: cross Abstract: Conversational AI systems are increasingly used for personal reflection and emotional disclosure, raising conc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Evolving Embodied Intelligence: Graph Neural Network--Driven Co-Design of Morphology and Control in Soft Robotics

arXiv:2603.19582v1 Announce Type: cross Abstract: The intelligent behavior of robots does not emerge solely from control systems, but from the tight coupling be