Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,942

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,459 Reads 5,483

Showing 5,483 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding

arXiv:2603.23447v1 Announce Type: cross Abstract: While multi-modality large language models excel in object-centric or indoor scenarios, scaling them to 3D cit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs

arXiv:2603.23481v1 Announce Type: cross Abstract: Video-Action Models (VAMs) have emerged as a promising framework for embodied intelligence, learning implicit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains

arXiv:2603.23482v1 Announce Type: cross Abstract: Requirements engineering is a vital, yet labor-intensive, stage in the software development process. This arti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Failure of contextual invariance in gender inference with large language models

arXiv:2603.23485v1 Announce Type: cross Abstract: Standard evaluation practices assume that large language model (LLM) outputs are stable under contextually equ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions

arXiv:2603.23495v1 Announce Type: cross Abstract: Existing approaches for improving the efficiency of Large Vision-Language Models (LVLMs) are largely based on

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage

arXiv:2603.23501v1 Announce Type: cross Abstract: Vision Language Models (VLMs) are increasingly used for tasks like medical report generation and visual questi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

An Accurate and Interpretable Framework for Trustworthy Process Monitoring

arXiv:2302.10426v3 Announce Type: replace Abstract: Trustworthy process monitoring seeks to build an accurate and interpretable monitoring framework, which is c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Towards Self-Evolving Benchmarks: Synthesizing Agent Trajectories via Test-Time Exploration under Validate-by-Reproduce Paradigm

arXiv:2510.00415v3 Announce Type: replace Abstract: Recent advances in large language models (LLMs) and agent system designs have empowered agents with unpreced

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions

arXiv:2510.05318v3 Announce Type: replace Abstract: Large language models (LLMs) have demonstrated remarkable performance on single-turn text-to-SQL tasks, but

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

BuilderBench: The Building Blocks of Intelligent Agents

arXiv:2510.06288v3 Announce Type: replace Abstract: Today's AI models learn primarily through mimicry and refining, so it is not surprising that they struggle t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Hybrid Stackelberg Game and Diffusion-based Auction for Two-tier Agentic AI Task Offloading in Internet of Agents

arXiv:2511.22076v2 Announce Type: replace Abstract: The Internet of Agents (IoA) is rapidly gaining prominence as a foundational architecture for interconnected

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DriveSafe: A Hierarchical Risk Taxonomy for Safety-Critical LLM-Based Driving Assistants

arXiv:2601.12138v3 Announce Type: replace Abstract: Large Language Models (LLMs) are increasingly integrated into vehicle-based digital assistants, where unsafe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

arXiv:2602.02050v3 Announce Type: replace Abstract: Tool-using agents based on Large Language Models (LLMs) excel in tasks such as mathematical reasoning and mu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search

arXiv:2602.22983v3 Announce Type: replace Abstract: As Large Language Models (LLMs) are increasingly used, their security risks have drawn increasing attention.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays

arXiv:2602.23276v2 Announce Type: replace Abstract: Chest X-ray plays a central role in thoracic diagnosis, and its interpretation inherently requires multi-ste

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Agentic AI-based Coverage Closure for Formal Verification

arXiv:2603.03147v2 Announce Type: replace Abstract: Coverage closure is a critical requirement in Integrated Chip (IC) development process and key metric for ve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Retrieval-Augmented Generation with Covariate Time Series

arXiv:2603.04951v2 Announce Type: replace Abstract: While RAG has greatly enhanced LLMs, extending this paradigm to Time-Series Foundation Models (TSFMs) remain

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Planning as Goal Recognition: Deriving Heuristics from Intention Models -- Extended Version

arXiv:2603.14824v2 Announce Type: replace Abstract: Classical planning aims to find a sequence of actions, a plan, that maps a starting state into one of the go

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Cascade-Aware Multi-Agent Routing: Spatio-Temporal Sidecars and Geometry-Switching

arXiv:2603.17112v2 Announce Type: replace Abstract: Advanced AI reasoning systems route tasks through dynamic execution graphs of specialized agents. We identif

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models

arXiv:2603.20670v2 Announce Type: replace Abstract: The rapid growth in the volume, variety, and velocity of geospatial data has created data ecosystems that ar

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A transformer architecture alteration to incentivise externalised reasoning

arXiv:2603.21376v2 Announce Type: replace Abstract: We propose a new architectural change, and post-training pipeline, for making LLMs more verbose reasoners by

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Cerebra: A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment

arXiv:2603.21597v2 Announce Type: replace Abstract: Modern clinical practice increasingly depends on reasoning over heterogeneous, evolving, and incomplete pati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Mapping the Challenges of HCI: An Application and Evaluation of ChatGPT for Mining Insights at Scale

arXiv:2306.05036v5 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly used for analytical tasks, yet their effectiveness in re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

arXiv:2406.01825v5 Announce Type: replace-cross Abstract: Machine learning (ML) models are increasingly deployed for virtual screening in drug discovery, where

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Almost Sure Convergence of Linear Temporal Difference Learning with Arbitrary Features

arXiv:2409.12135v3 Announce Type: replace-cross Abstract: Temporal difference (TD) learning with linear function approximation (linear TD) is a classic and powe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Dataset Distillation-based Hybrid Federated Learning on Non-IID Data

arXiv:2409.17517v3 Announce Type: replace-cross Abstract: In federated learning, the heterogeneity of client data has a great impact on the performance of model

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

LOGSAFE: Logic-Guided Verification for Trustworthy Federated Time-Series Learning

arXiv:2411.03231v3 Announce Type: replace-cross Abstract: This paper introduces LOGSAFE, a defense mechanism for federated learning in time series settings, par

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Streaming Attention Approximation via Discrepancy Theory

arXiv:2502.07861v3 Announce Type: replace-cross Abstract: Large language models (LLMs) have achieved impressive success, but their high memory requirements pres

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Multiplicative learning from observation-prediction ratios

arXiv:2503.10144v2 Announce Type: replace-cross Abstract: Additive parameter updates, as used in gradient descent and its adaptive extensions, underpin most mod

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Automating quantum feature map design via large language models

arXiv:2504.07396v2 Announce Type: replace-cross Abstract: Quantum feature maps are a key component of quantum machine learning, encoding classical data into qua

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Leakage and Interpretability in Concept-Based Models

arXiv:2504.14094v3 Announce Type: replace-cross Abstract: Concept-based Models aim to improve interpretability by predicting high-level intermediate concepts, r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GAIA: A Foundation Model for Operational Atmospheric Dynamics

arXiv:2505.18179v3 Announce Type: replace-cross Abstract: We introduce GAIA (Geospatial Artificial Intelligence for Atmospheres), a hybrid self-supervised geosp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Generalizable Heuristic Generation Through LLMs with Meta-Optimization

arXiv:2505.20881v2 Announce Type: replace-cross Abstract: Heuristic design with large language models (LLMs) has emerged as a promising approach for tackling co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale

arXiv:2506.02548v3 Announce Type: replace-cross Abstract: AI agents have significant potential to reshape cybersecurity, making a thorough assessment of their c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Learning The Minimum Action Distance

arXiv:2506.09276v3 Announce Type: replace-cross Abstract: This paper presents a state representation framework for Markov decision processes (MDPs) that can be

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

RedTopic: Toward Topic-Diverse Red Teaming of Large Language Models

arXiv:2507.00026v2 Announce Type: replace-cross Abstract: As large language models (LLMs) are increasingly deployed as black-box components in real-world applic

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Graph Structure Learning with Privacy Guarantees for Open Graph Data

arXiv:2507.19116v3 Announce Type: replace-cross Abstract: Publishing open graph data while preserving individual privacy remains challenging when data publisher

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Product Hilbert Spaces to the Generalized Koopman Operator and the Nonlinear Fundamental Lemma

arXiv:2508.07494v2 Announce Type: replace-cross Abstract: The generalization of the Koopman operator to systems with control input and the derivation of a nonli

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Context to Intent: Reasoning-Guided Function-Level Code Completion

arXiv:2508.09537v2 Announce Type: replace-cross Abstract: The growing capabilities of Large Language Models (LLMs) have led to their widespread adoption for fun

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DreamAudio: Customized Text-to-Audio Generation with Diffusion Models

arXiv:2509.06027v2 Announce Type: replace-cross Abstract: With the development of large-scale diffusion-based and language-modeling-based generative models, imp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MARS: toward more efficient multi-agent collaboration for LLM reasoning

arXiv:2509.20502v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have achieved impressive results in natural language understanding, yet t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

VL-KnG: Persistent Spatiotemporal Knowledge Graphs from Egocentric Video for Embodied Scene Understanding

arXiv:2510.01483v2 Announce Type: replace-cross Abstract: Vision-language models (VLMs) demonstrate strong image-level scene understanding but often lack persis

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Generating Findings for Jaw Cysts in Dental Panoramic Radiographs Using a GPT-Based VLM: A Preliminary Study on Building a Two-Stage Self-Correction Loop with Structured Output (SLSO) Framework

arXiv:2510.02001v4 Announce Type: replace-cross Abstract: Vision-language models (VLMs) such as GPT (Generative Pre-Trained Transformer) have shown potential fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents

arXiv:2510.14967v2 Announce Type: replace-cross Abstract: Large language model (LLM)-based agents are increasingly trained with reinforcement learning (RL) to e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

arXiv:2510.15994v2 Announce Type: replace-cross Abstract: The Model Context Protocol (MCP) standardizes how large language model (LLM) agents discover, describe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GUIrilla: A Scalable Framework for Automated Desktop UI Exploration

arXiv:2510.16051v2 Announce Type: replace-cross Abstract: The performance and generalization of foundation models for interactive systems critically depend on t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding

arXiv:2510.21356v2 Announce Type: replace-cross Abstract: Eye gaze offers valuable cues about attention, short-term intent, and future actions, making it a powe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Quantifying Systemic Vulnerability in the Foundation Model Industry

arXiv:2510.23421v2 Announce Type: replace-cross Abstract: The foundation model industry exhibits unprecedented concentration in critical inputs: semiconductors,