Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,926
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,466 reads from curated sources

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 3w ago
How to Reduce OpenClaw and Agent Token Costs
Introduction When teams first deploy OpenClaw or custom AI agents, the immediate focus is on capability. Does the agent work? Can it execute the task? But withi
Yoast SEO Blog 🧠 Large Language Models ⚡ AI Lesson 3w ago
Introducing llms.txt to Shopify: Give AI a map to your best products
You’ve worked hard to build your product catalog. The last thing you want is AI tools like ChatGPT or Google Gemini describing your products inaccurately to pot
Search Engine Journal 🧠 Large Language Models ⚡ AI Lesson 3w ago
How To Identify Which LLM Is Actually Working For You [Webinar] via @sejournal, @hethr_campbell
Learn how different LLMs impact conversions in your industry. Do not miss our expert panel webinar for practical advice. The post How To Identify Which LLM Is A
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Multiverse: Language-Conditioned Multi-Game Level Blending via Shared Representation
arXiv:2603.26782v1 Announce Type: new Abstract: Text-to-level generation aims to translate natural language descriptions into structured game levels, enabling i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Concerning Uncertainty -- A Systematic Survey of Uncertainty-Aware XAI
arXiv:2603.26838v1 Announce Type: new Abstract: This paper surveys uncertainty-aware explainable artificial intelligence (UAXAI), examining how uncertainty is i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Neuro-Symbolic Learning for Predictive Process Monitoring via Two-Stage Logic Tensor Networks with Rule Pruning
arXiv:2603.26944v1 Announce Type: new Abstract: Predictive modeling on sequential event data is critical for fraud detection and healthcare monitoring. Existing
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Transparency as Architecture: Structural Compliance Gaps in EU AI Act Article 50 II
arXiv:2603.26983v1 Announce Type: new Abstract: Art. 50 II of the EU Artificial Intelligence Act mandates dual transparency for AI-generated content: outputs mu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
FormalProofBench: Can Models Write Graduate Level Math Proofs That Are Formally Verified?
arXiv:2603.26996v1 Announce Type: new Abstract: We present FormalProofBench, a private benchmark designed to evaluate whether AI models can produce formally ver
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
When Verification Hurts: Asymmetric Effects of Multi-Agent Feedback in Logic Proof Tutoring
arXiv:2603.27076v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used for automated tutoring, but their reliability in structured s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
The Price of Meaning: Why Every Semantic Memory System Forgets
arXiv:2603.27116v1 Announce Type: new Abstract: Every major AI memory system in production today organises information by meaning. That organisation enables gen
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MediHive: A Decentralized Agent Collective for Medical Reasoning
arXiv:2603.27150v1 Announce Type: new Abstract: Large language models (LLMs) have revolutionized medical reasoning tasks, yet single-agent systems often falter
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
daVinci-LLM:Towards the Science of Pretraining
arXiv:2603.27164v1 Announce Type: new Abstract: The foundational pretraining phase determines a model's capability ceiling, as post-training struggles to overco
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Aligning LLMs with Graph Neural Solvers for Combinatorial Optimization
arXiv:2603.27169v1 Announce Type: new Abstract: Recent research has demonstrated the effectiveness of large language models (LLMs) in solving combinatorial opti
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Quantification of Credal Uncertainty: A Distance-Based Approach
arXiv:2603.27270v1 Announce Type: new Abstract: Credal sets, i.e., closed convex sets of probability measures, provide a natural framework to represent aleatori
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba
arXiv:2603.27314v1 Announce Type: new Abstract: Music-to-dance generation has broad applications in virtual reality, dance education, and digital character anim
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CounterMoral: Editing Morals in Language Models
arXiv:2603.27338v1 Announce Type: new Abstract: Recent advancements in language model technology have significantly enhanced the ability to edit factual informa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance
arXiv:2603.27343v1 Announce Type: new Abstract: Task-completion rate is the standard proxy for LLM agent capability, but models with identical completion scores
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
LLM Readiness Harness: Evaluation, Observability, and CI Gates for LLM/RAG Applications
arXiv:2603.27355v1 Announce Type: new Abstract: We present a readiness harness for LLM and RAG applications that turns evaluation into a deployment decision wor
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Defend: Automated Rebuttals for Peer Review with Minimal Author Guidance
arXiv:2603.27360v1 Announce Type: new Abstract: Rebuttal generation is a critical component of the peer review process for scientific papers, enabling authors t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Heterogeneous Debate Engine: Identity-Grounded Cognitive Architecture for Resilient LLM-Based Ethical Tutoring
arXiv:2603.27404v1 Announce Type: new Abstract: Large Language Models (LLMs) are being increasingly used as autonomous agents in complex reasoning tasks, openin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Greedy Is a Strong Default: Agents as Iterative Optimizers
arXiv:2603.27415v1 Announce Type: new Abstract: Classical optimization algorithms--hill climbing, simulated annealing, population-based methods--generate candid
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
AstraAI: LLMs, Retrieval, and AST-Guided Assistance for HPC Codebases
arXiv:2603.27423v1 Announce Type: new Abstract: We present AstraAI, a command-line interface (CLI) coding framework for high-performance computing (HPC) softwar
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Dual-Stage LLM Framework for Scenario-Centric Semantic Interpretation in Driving Assistance
arXiv:2603.27536v1 Announce Type: new Abstract: Advanced Driver Assistance Systems (ADAS) increasingly rely on learning-based perception, yet safety-relevant fa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
DSevolve: Enabling Real-Time Adaptive Scheduling on Dynamic Shop Floor with LLM-Evolved Heuristic Portfolios
arXiv:2603.27628v1 Announce Type: new Abstract: In dynamic manufacturing environments, disruptions such as machine breakdowns and new order arrivals continuousl
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
SkyNet: Belief-Aware Planning for Partially-Observable Stochastic Games
arXiv:2603.27751v1 Announce Type: new Abstract: In 2019, Google DeepMind released MuZero, a model-based reinforcement learning method that achieves strong resul
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CARV: A Diagnostic Benchmark for Compositional Analogical Reasoning in Multimodal LLMs
arXiv:2603.27958v1 Announce Type: new Abstract: Analogical reasoning tests a fundamental aspect of human cognition: mapping the relation from one pair of object
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
SARL: Label-Free Reinforcement Learning by Rewarding Reasoning Topology
arXiv:2603.27977v1 Announce Type: new Abstract: Reinforcement learning has become central to improving large reasoning models, but its success still relies heav
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA
arXiv:2603.28026v1 Announce Type: new Abstract: Scientific figure multiple-choice question answering (MCQA) requires models to reason over diverse visual eviden
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Beyond the Answer: Decoding the Behavior of LLMs as Scientific Reasoners
arXiv:2603.28038v1 Announce Type: new Abstract: As Large Language Models (LLMs) achieve increasingly sophisticated performance on complex reasoning tasks, curre
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Meta-Harness: End-to-End Optimization of Model Harnesses
arXiv:2603.28052v1 Announce Type: new Abstract: The performance of large language model (LLM) systems depends not only on model weights, but also on their harne
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
SLOW: Strategic Logical-inference Open Workspace for Cognitive Adaptation in AI Tutoring
arXiv:2603.28062v1 Announce Type: new Abstract: While Large Language Models (LLMs) have demonstrated remarkable fluency in educational dialogues, most generativ
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning
arXiv:2603.28135v1 Announce Type: new Abstract: Recent test-time reasoning methods improve performance by generating more candidate chains or searching over lar
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
PReD: An LLM-based Foundation Multimodal Model for Electromagnetic Perception, Recognition, and Decision
arXiv:2603.28183v1 Announce Type: new Abstract: Multimodal Large Language Models have demonstrated powerful cross-modal understanding and reasoning capabilities
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling
arXiv:2603.28197v1 Announce Type: new Abstract: Pluralistic alignment is essential for adapting large language models (LLMs) to the diverse preferences of indiv
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Reasoning as Energy Minimization over Structured Latent Trajectories
arXiv:2603.28248v1 Announce Type: new Abstract: Single-shot neural decoders commit to answers without iterative refinement, while chain-of-thought methods intro
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Evaluating LLMs for Answering Student Questions in Introductory Programming Courses
arXiv:2603.28295v1 Announce Type: new Abstract: The rapid emergence of Large Language Models (LLMs) presents both opportunities and challenges for programming e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CoE: Collaborative Entropy for Uncertainty Quantification in Agentic Multi-LLM Systems
arXiv:2603.28360v1 Announce Type: new Abstract: Uncertainty estimation in multi-LLM systems remains largely single-model-centric: existing methods quantify unce
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Deep Research of Deep Research: From Transformer to Agent, From AI to AI for Science
arXiv:2603.28361v1 Announce Type: new Abstract: With the advancement of large language models (LLMs) in their knowledge base and reasoning capabilities, their i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments via Two-Player Zero-Sum Game
arXiv:2603.28386v1 Announce Type: new Abstract: A central challenge in building continually improving agents is that training environments are typically static
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
The Scaffold Effect: How Prompt Framing Drives Apparent Multimodal Gains in Clinical VLM Evaluation
arXiv:2603.28387v1 Announce Type: new Abstract: Trustworthy clinical AI requires that performance gains reflect genuine evidence integration rather than surface
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
T-Norm Operators for EU AI Act Compliance Classification: An Empirical Comparison of Lukasiewicz, Product, and G\"odel Semantics in a Neuro-Symbolic Reasoning System
arXiv:2603.28558v1 Announce Type: new Abstract: We present a first comparative pilot study of three t-norm operators -- Lukasiewicz (T_L), Product (T_P), and G\
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models
arXiv:2603.28590v1 Announce Type: new Abstract: Large language models (LLMs) can generate chains of thought (CoTs) that are not always causally responsible for
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Seeing with You: Perception-Reasoning Coevolution for Multimodal Reasoning
arXiv:2603.28618v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has substantially enhanced the reasoning capabilities of m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AIGENIE from its Bottle
arXiv:2603.28643v1 Announce Type: new Abstract: Psychological scale development has traditionally required extensive expert involvement, iterative revision, and
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Not Search, But Scan: Benchmarking MLLMs on Scan-Oriented Academic Paper Reasoning
arXiv:2603.28651v1 Announce Type: new Abstract: With the rapid progress of multimodal large language models (MLLMs), AI already performs well at literature retr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Exploring Cultural Variations in Moral Judgments with Large Language Models
arXiv:2506.12433v2 Announce Type: cross Abstract: Large Language Models (LLMs) have shown strong performance across many tasks, but their ability to capture cul
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs
arXiv:2603.20253v1 Announce Type: cross Abstract: Evaluating LLM agents for scientific tasks has focused on token costs while ignoring tool-use costs like simul
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
M-RAG: Making RAG Faster, Stronger, and More Efficient
arXiv:2603.26667v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) has become a widely adopted paradigm for enhancing the reliability of lar