Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,568

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,408 Reads 5,160

Showing 5,160 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

arXiv:2603.25133v1 Announce Type: new Abstract: Rubric-based evaluation has become a prevailing paradigm for evaluating instruction following in large language

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

UniAI-GraphRAG: Synergizing Ontology-Guided Extraction, Multi-Dimensional Clustering, and Dual-Channel Fusion for Robust Multi-Hop Reasoning

arXiv:2603.25152v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) systems face significant challenges in complex reasoning, multi-hop queries

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

arXiv:2603.25158v1 Announce Type: new Abstract: Equipping Large Language Model (LLM) agents with domain-specific skills is critical for tackling complex tasks.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

The Competence Shadow: Theory and Bounds of AI Assistance in Safety Engineering

arXiv:2603.25197v1 Announce Type: new Abstract: As AI assistants become integrated into safety engineering workflows for Physical AI systems, a critical questio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation

arXiv:2603.25266v1 Announce Type: new Abstract: Probabilistic abstract interpretation is a theory used to extract particular properties of a computer program wh

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

SliderQuant: Accurate Post-Training Quantization for LLMs

arXiv:2603.25284v1 Announce Type: new Abstract: In this paper, we address post-training quantization (PTQ) for large language models (LLMs) from an overlooked p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Evaluating Language Models for Harmful Manipulation

arXiv:2603.25326v1 Announce Type: new Abstract: Interest in the concept of AI-driven harmful manipulation is growing, yet current approaches to evaluating it ar

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automated and Human-Driven Vehicles

arXiv:2603.25328v1 Announce Type: new Abstract: Automated Vehicle (AV) control in mixed traffic, where AVs coexist with human-driven vehicles, poses significant

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Agentic Trust Coordination for Federated Learning through Adaptive Thresholding and Autonomous Decision Making in Sustainable and Resilient Industrial Networks

arXiv:2603.25334v1 Announce Type: new Abstract: Distributed intelligence in industrial networks increasingly integrates sensing, communication, and computation

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

4OPS: Structural Difficulty Modeling in Integer Arithmetic Puzzles

arXiv:2603.25356v1 Announce Type: new Abstract: Arithmetic puzzle games provide a controlled setting for studying difficulty in mathematical reasoning tasks, a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models

arXiv:2603.25412v1 Announce Type: new Abstract: Large language models (LLMs) increasingly rely on explicit chain-of-thought (CoT) reasoning to solve complex tas

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation

arXiv:2603.25415v1 Announce Type: new Abstract: Semantic world models enable embodied agents to reason about objects, relations, and spatial context beyond pure

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Cross-Model Disagreement as a Label-Free Correctness Signal

arXiv:2603.25450v1 Announce Type: new Abstract: Detecting when a language model is wrong without ground truth labels is a fundamental challenge for safe deploym

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Retraining as Approximate Bayesian Inference

arXiv:2603.25480v1 Announce Type: new Abstract: Model retraining is usually treated as an ongoing maintenance task. But as Harrison Katz now argues, retraining

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents

arXiv:2603.25498v1 Announce Type: new Abstract: As the Web transitions from static retrieval to generative interaction, the escalating environmental footprint o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?

arXiv:2603.25633v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly used in math education not only as problem solvers but also as ass

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning

arXiv:2603.25720v1 Announce Type: new Abstract: Robust perception and reasoning require consistency across sensory modalities. Yet current multimodal models oft

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Back to Basics: Revisiting ASR in the Age of Voice Agents

arXiv:2603.25727v1 Announce Type: new Abstract: Automatic speech recognition (ASR) systems have achieved near-human accuracy on curated benchmarks, yet still fa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

History of generative Artificial Intelligence (AI) chatbots: past, present, and future development

arXiv:2402.05122v1 Announce Type: cross Abstract: This research provides an in-depth comprehensive review of the progress of chatbot technology over time, from

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Malicious LLM-Based Conversational AI Makes Users Reveal Personal Information

arXiv:2506.11680v1 Announce Type: cross Abstract: LLM-based Conversational AIs (CAIs), also known as GenAI chatbots, like ChatGPT, are increasingly used across

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels

arXiv:2603.24595v1 Announce Type: cross Abstract: The widespread adoption of large language models (LLMs) has made GPU-accelerated inference a critical part of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs

arXiv:2603.24596v1 Announce Type: cross Abstract: While the shift from cascaded dialogue systems to end-to-end (E2E) speech Large Language Models (LLMs) improve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

A Learnable SIM Paradigm: Fundamentals, Training Techniques, and Applications

arXiv:2603.24599v1 Announce Type: cross Abstract: Stacked intelligent metasurfaces (SIMs) represent a breakthrough in wireless hardware by comprising multilayer

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

FED-HARGPT: A Hybrid Centralized-Federated Approach of a Transformer-based Architecture for Human Context Recognition

arXiv:2603.24601v1 Announce Type: cross Abstract: The study explores a hybrid centralized-federated approach for Human Activity Recognition (HAR) using a Transf

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

MuViS: Multimodal Virtual Sensing Benchmark

arXiv:2603.24602v1 Announce Type: cross Abstract: Virtual sensing aims to infer hard-to-measure quantities from accessible measurements and is central to percep

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Causal AI For AMS Circuit Design: Interpretable Parameter Effects Analysis

arXiv:2603.24618v1 Announce Type: cross Abstract: Analog-mixed-signal (AMS) circuits are highly non-linear and operate on continuous real-world signals, making

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Sketch2Simulation: Automating Flowsheet Generation via Multi Agent Large Language Models

arXiv:2603.24629v1 Announce Type: cross Abstract: Converting process sketches into executable simulation models remains a major bottleneck in process systems en

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

TRAJEVAL: Decomposing Code Agent Trajectories for Fine-Grained Diagnosis

arXiv:2603.24631v1 Announce Type: cross Abstract: Code agents can autonomously resolve GitHub issues, yet when they fail, current evaluation provides no visibil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Dual-Graph Multi-Agent Reinforcement Learning for Handover Optimization

arXiv:2603.24634v1 Announce Type: cross Abstract: HandOver (HO) control in cellular networks is governed by a set of HO control parameters that are traditionall

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

DyMRL: Dynamic Multispace Representation Learning for Multimodal Event Forecasting in Knowledge Graph

arXiv:2603.24636v1 Announce Type: cross Abstract: Accurate representation of multimodal knowledge is crucial for event forecasting in real-world scenarios. Howe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Experiential Reflective Learning for Self-Improving LLM Agents

arXiv:2603.24639v1 Announce Type: cross Abstract: Recent advances in large language models (LLMs) have enabled the development of autonomous agents capable of c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models

arXiv:2603.24721v1 Announce Type: cross Abstract: Spatial reasoning focuses on locating target objects based on spatial relations in 3D scenes, which plays a cr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Decentralized Task Scheduling in Distributed Systems: A Deep Reinforcement Learning Approach

arXiv:2603.24738v1 Announce Type: cross Abstract: Efficient task scheduling in large-scale distributed systems presents significant challenges due to dynamic wo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Grokking as a Falsifiable Finite-Size Transition

arXiv:2603.24746v1 Announce Type: cross Abstract: Grokking -- the delayed onset of generalization after early memorization -- is often described with phase-tran

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

arXiv:2603.24772v1 Announce Type: cross Abstract: Clinical documentation is a critical factor for patient safety, diagnosis, and continuity of care. The adminis

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

From Untestable to Testable: Metamorphic Testing in the Age of LLMs

arXiv:2603.24774v1 Announce Type: cross Abstract: This article discusses the challenges of testing software systems with increasingly integrated AI and LLM func

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Dissecting Model Failures in Abdominal Aortic Aneurysm Segmentation through Explainability-Driven Analysis

arXiv:2603.24801v1 Announce Type: cross Abstract: Computed tomography image segmentation of complex abdominal aortic aneurysms (AAA) often fails because the mod

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

GoldiCLIP: The Goldilocks Approach for Balancing Explicit Supervision for Language-Image Pretraining

arXiv:2603.24804v1 Announce Type: cross Abstract: Until recently, the success of large-scale vision-language models (VLMs) has primarily relied on billion-sampl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

FODMP: Fast One-Step Diffusion of Movement Primitives Generation for Time-Dependent Robot Actions

arXiv:2603.24806v1 Announce Type: cross Abstract: Diffusion models are increasingly used for robot learning, but current designs face a clear trade-off. Action-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Generative Adversarial Perturbations with Cross-paradigm Transferability on Localized Crowd Counting

arXiv:2603.24821v1 Announce Type: cross Abstract: State-of-the-art crowd counting and localization are primarily modeled using two paradigms: density maps and p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

arXiv:2603.24844v1 Announce Type: cross Abstract: Given a question, a language model (LM) implicitly encodes a distribution over possible answers. In practice,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

NeuroVLM-Bench: Evaluation of Vision-Enabled Large Language Models for Clinical Reasoning in Neurological Disorders

arXiv:2603.24846v1 Announce Type: cross Abstract: Recent advances in multimodal large language models enable new possibilities for image-based decision support.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective

arXiv:2603.24857v1 Announce Type: cross Abstract: As machine learning (ML) systems expand in both scale and functionality, the security landscape has become inc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

More Than "Means to an End": Supporting Reasoning with Transparently Designed AI Data Science Processes

arXiv:2603.24877v1 Announce Type: cross Abstract: Generative artificial intelligence (AI) tools can now help people perform complex data science tasks regardles

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Surrogates, Spikes, and Sparsity: Performance Analysis and Characterization of SNN Hyperparameters on Hardware

arXiv:2603.24891v1 Announce Type: cross Abstract: Spiking Neural Networks (SNNs) offer inherent advantages for low-power inference through sparse, event-driven

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Sovereign AI at the Front Door of Care: A Physically Unidirectional Architecture for Secure Clinical Intelligence

arXiv:2603.24898v1 Announce Type: cross Abstract: We present a Sovereign AI architecture for clinical triage in which all inference is performed on-device and i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Shaping the Future of Mathematics in the Age of AI

arXiv:2603.24914v1 Announce Type: cross Abstract: Artificial intelligence is transforming mathematics at a speed and scale that demand active engagement from th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago

Evaluating adaptive and generative AI-based feedback and recommendations in a knowledge-graph-integrated programming learning system

arXiv:2603.24940v1 Announce Type: cross Abstract: This paper introduces the design and development of a framework that integrates a large language model (LLM) w