Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,597
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,182 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Dual-Criterion Curriculum Learning: Application to Temporal Data
arXiv:2603.23573v1 Announce Type: cross Abstract: Curriculum Learning (CL) is a meta-learning paradigm that trains a model by feeding the data instances increme
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning
arXiv:2603.23574v1 Announce Type: cross Abstract: Federated Learning (FL), as a popular distributed learning paradigm, has shown outstanding performance in impr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs
arXiv:2603.23575v1 Announce Type: cross Abstract: Today, large language models have demonstrated their strengths in various tasks ranging from reasoning, code g
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Wafer-Level Etch Spatial Profiling for Process Monitoring from Time-Series with Time-LLM
arXiv:2603.23576v1 Announce Type: cross Abstract: Understanding wafer-level spatial variations from in-situ process signals is essential for advanced plasma etc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
LLMORPH: Automated Metamorphic Testing of Large Language Models
arXiv:2603.23611v1 Announce Type: cross Abstract: Automated testing is essential for evaluating and improving the reliability of Large Language Models (LLMs), y
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops
arXiv:2603.23613v1 Announce Type: cross Abstract: Large Language Models (LLMs) are showing remarkable performance in generating source code, yet the generated c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
A Theory of LLM Information Susceptibility
arXiv:2603.23626v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed as optimization modules in agentic systems, yet the fun
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Swiss-Bench SBP-002: A Frontier Model Comparison on Swiss Legal and Regulatory Tasks
arXiv:2603.23646v1 Announce Type: cross Abstract: While recent work has benchmarked large language models on Swiss legal translation (Niklaus et al., 2025) and
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Probing Ethical Framework Representations in Large Language Models: Structure, Entanglement, and Methodological Challenges
arXiv:2603.23659v1 Announce Type: cross Abstract: When large language models make ethical judgments, do their internal representations distinguish between norma
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Prototype Fusion: A Training-Free Multi-Layer Approach to OOD Detection
arXiv:2603.23677v1 Announce Type: cross Abstract: Deep learning models are increasingly deployed in safety-critical applications, where reliable out-of-distribu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
PLACID: Privacy-preserving Large language models for Acronym Clinical Inference and Disambiguation
arXiv:2603.23678v1 Announce Type: cross Abstract: Large Language Models (LLMs) offer transformative solutions across many domains, but healthcare integration is
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Assessment Design in the AI Era: A Method for Identifying Items Functioning Differentially for Humans and Chatbots
arXiv:2603.23682v1 Announce Type: cross Abstract: The rapid adoption of large language models (LLMs) in education raises profound challenges for assessment desi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
The Diminishing Returns of Early-Exit Decoding in Modern LLMs
arXiv:2603.23701v1 Announce Type: cross Abstract: In Large Language Model (LLM) inference, early-exit refers to stopping computation at an intermediate layer on
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Self Paced Gaussian Contextual Reinforcement Learning
arXiv:2603.23755v1 Announce Type: cross Abstract: Curriculum learning improves reinforcement learning (RL) efficiency by sequencing tasks from simple to complex
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Human-in-the-Loop Pareto Optimization: Trade-off Characterization for Assist-as-Needed Training and Performance Evaluation
arXiv:2603.23777v1 Announce Type: cross Abstract: During human motor skill training and physical rehabilitation, there is an inherent trade-off between task dif
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models
arXiv:2603.23783v2 Announce Type: cross Abstract: Adapting large-scale foundation models to new domains with limited supervision remains a fundamental challenge
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
The Cognitive Firewall:Securing Browser Based AI Agents Against Indirect Prompt Injection Via Hybrid Edge Cloud Defense
arXiv:2603.23791v1 Announce Type: cross Abstract: Deploying large language models (LLMs) as autonomous browser agents exposes a significant attack surface in th
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Object Search in Partially-Known Environments via LLM-informed Model-based Planning and Prompt Selection
arXiv:2603.23800v1 Announce Type: cross Abstract: We present a novel LLM-informed model-based planning framework, and a novel prompt selection method, for objec
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Deep Neural Regression Collapse
arXiv:2603.23805v1 Announce Type: cross Abstract: Neural Collapse is a phenomenon that helps identify sparse and low rank structures in deep classifiers. Recent
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Willful Disobedience: Automatically Detecting Failures in Agentic Traces
arXiv:2603.23806v1 Announce Type: cross Abstract: AI agents are increasingly embedded in real software systems, where they execute multi-step workflows through
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Perturbation: A simple and efficient adversarial tracer for representation learning in language models
arXiv:2603.23821v1 Announce Type: cross Abstract: Linguistic representation learning in deep neural language models (LMs) has been studied for decades, for both
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transformers
arXiv:2603.23823v1 Announce Type: cross Abstract: Knowledge tracing models mastery over interconnected concepts, often organized by prerequisites. We analyze hi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
PoliticsBench: Benchmarking Political Values in Large Language Models with Multi-Turn Roleplay
arXiv:2603.23841v1 Announce Type: cross Abstract: While Large Language Models (LLMs) are increasingly used as primary sources of information, their potential fo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Generative AI User Experience: Developing Human--AI Epistemic Partnership
arXiv:2603.23863v1 Announce Type: cross Abstract: Generative AI (GenAI) has rapidly entered education, yet its user experience is often explained through adopti
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Can VLMs Reason Robustly? A Neuro-Symbolic Investigation
arXiv:2603.23867v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) have been applied to a wide range of reasoning tasks, yet it remains unclear whe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation
arXiv:2603.23871v1 Announce Type: cross Abstract: Large language models trained with reinforcement learning (RL) for mathematical reasoning face a fundamental c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
The Luna Bound Propagator for Formal Analysis of Neural Networks
arXiv:2603.23878v1 Announce Type: cross Abstract: The parameterized CROWN analysis, a.k.a., alpha-CROWN, has emerged as a practically successful bound propagati
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Latent Bias Alignment for High-Fidelity Diffusion Inversion in Real-World Image Reconstruction and Manipulation
arXiv:2603.23903v1 Announce Type: cross Abstract: Recent research has shown that text-to-image diffusion models are capable of generating high-quality images gu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Self-Distillation for Multi-Token Prediction
arXiv:2603.23911v1 Announce Type: cross Abstract: As Large Language Models (LLMs) scale up, inference efficiency becomes a critical bottleneck. Multi-Token Pred
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
DecepGPT: Schema-Driven Deception Detection with Multicultural Datasets and Robust Multimodal Learning
arXiv:2603.23916v1 Announce Type: cross Abstract: Multimodal deception detection aims to identify deceptive behavior by analyzing audiovisual cues for forensics
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage
arXiv:2603.23966v1 Announce Type: cross Abstract: With frequently evolving Advanced Persistent Threats (APTs) in cyberspace, traditional security solutions appr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More
arXiv:2603.23971v1 Announce Type: cross Abstract: Developers and consumers increasingly choose reasoning language models (RLMs) based on their listed API prices
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
From Untamed Black Box to Interpretable Pedagogical Orchestration: The Ensemble of Specialized LLMs Architecture for Adaptive Tutoring
arXiv:2603.23990v1 Announce Type: cross Abstract: Monolithic Large Language Models (LLMs) used in educational dialogue often behave as "black boxes," where peda
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Understanding the Challenges in Iterative Generative Optimization with LLMs
arXiv:2603.23994v1 Announce Type: cross Abstract: Generative optimization uses large language models (LLMs) to iteratively improve artifacts (such as code, work
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Schema on the Inside: A Two-Phase Fine-Tuning Method for High-Efficiency Text-to-SQL at Scale
arXiv:2603.24023v1 Announce Type: cross Abstract: Applying large, proprietary API-based language models to text-to-SQL tasks poses a significant industry challe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs
arXiv:2603.24034v1 Announce Type: cross Abstract: Contextual automatic speech recognition (ASR) with Speech-LLMs is typically trained with oracle conversation h
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Mitigating Object Hallucinations in LVLMs via Attention Imbalance Rectification
arXiv:2603.24058v1 Announce Type: cross Abstract: Object hallucination in Large Vision-Language Models (LVLMs) severely compromises their reliability in real-wo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm
arXiv:2603.24079v1 Announce Type: cross Abstract: Recently, multimodal large language models (MLLMs) have emerged as a unified paradigm for language and image g
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Knowledge-Guided Manipulation Using Multi-Task Reinforcement Learning
arXiv:2603.24083v1 Announce Type: cross Abstract: This paper introduces Knowledge Graph based Massively Multi-task Model-based Policy Optimization (KG-M3PO), a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization
arXiv:2603.24093v1 Announce Type: cross Abstract: Recently, reinforcement learning~(RL) has become an important approach for improving the capabilities of large
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation
arXiv:2603.24124v1 Announce Type: cross Abstract: RLHF-aligned language models exhibit response homogenization: on TruthfulQA (n=790), 40-79% of questions produ
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare
arXiv:2603.24132v1 Announce Type: cross Abstract: Conversational artificial intelligence has the potential to assist users in preliminary medical consultations,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula
arXiv:2603.24202v1 Announce Type: cross Abstract: Reinforcement learning (RL) has emerged as a powerful paradigm for improving large language models beyond supe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search
arXiv:2603.24203v1 Announce Type: cross Abstract: Recent advances in the Model Context Protocol (MCP) have enabled large language models (LLMs) to invoke extern
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Powerful Teachers Matter: Text-Guided Multi-view Knowledge Distillation with Visual Prior Enhancement
arXiv:2603.24208v1 Announce Type: cross Abstract: Knowledge distillation transfers knowledge from large teacher models to smaller students for efficient inferen
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Uncovering Memorization in Timeseries Imputation models: LBRM Membership Inference and its link to attribute Leakage
arXiv:2603.24213v1 Announce Type: cross Abstract: Deep learning models for time series imputation are now essential in fields such as healthcare, the Internet o
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias
arXiv:2603.24218v1 Announce Type: cross Abstract: Large Language Models (LLMs) enhanced with Retrieval-Augmented Generation (RAG) have achieved substantial impr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing
arXiv:2603.24221v1 Announce Type: cross Abstract: The increasing complexity and interconnectivity of digital infrastructures make scalable and reliable security