Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,534

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,407 Reads 5,127

Showing 5,127 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Evaluating and Understanding Scheming Propensity in LLM Agents

arXiv:2603.01608v2 Announce Type: replace Abstract: As frontier language models are increasingly deployed as autonomous agents pursuing complex, long-term objec

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Seed1.8 Model Card: Towards Generalized Real-World Agency

arXiv:2603.20633v2 Announce Type: replace Abstract: We present Seed1.8, a foundation model aimed at generalized real-world agency: going beyond single-turn pred

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks

arXiv:2603.21636v2 Announce Type: replace Abstract: Public benchmarks increasingly govern how large language models (LLMs) are ranked, selected, and deployed. W

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Continual Graph Learning: A Survey

arXiv:2301.12230v2 Announce Type: replace-cross Abstract: Continual Graph Learning (CGL) enables models to incrementally learn from streaming graph-structured d

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Deep Neural Networks: A Formulation Via Non-Archimedean Analysis

arXiv:2402.00094v2 Announce Type: replace-cross Abstract: We introduce a new class of deep neural networks (DNNs) with multilayered tree-like architectures. The

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ

arXiv:2402.11877v2 Announce Type: replace-cross Abstract: Reinforcement learning has witnessed significant advancements, particularly with the emergence of mode

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Bidirectional Multimodal Prompt Learning with Scale-Aware Training for Few-Shot Multi-Class Anomaly Detection

arXiv:2408.13516v2 Announce Type: replace-cross Abstract: Few-shot multi-class anomaly detection is crucial in real industrial settings, where only a few normal

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Explainable AI needs formalization

arXiv:2409.14590v5 Announce Type: replace-cross Abstract: The field of "explainable artificial intelligence" (XAI) seemingly addresses the desire that decisions

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Recent Advances of Multimodal Continual Learning: A Comprehensive Survey

arXiv:2410.05352v3 Announce Type: replace-cross Abstract: Continual learning (CL) aims to empower machine learning models to learn continually from new data, wh

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Gradient Compression Beyond Low-Rank: Wavelet Subspaces Compact Optimizer States

arXiv:2501.07237v4 Announce Type: replace-cross Abstract: Large language models (LLMs) have shown impressive performance across a range of natural language proc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

A Survey of Zero-Knowledge Proof Based Verifiable Machine Learning

arXiv:2502.18535v2 Announce Type: replace-cross Abstract: Machine learning is increasingly deployed through outsourced and cloud-based pipelines, which improve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs

arXiv:2503.05371v3 Announce Type: replace-cross Abstract: We present a novel approach to bias mitigation in large language models (LLMs) by applying steering ve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text

arXiv:2504.19467v4 Announce Type: replace-cross Abstract: Large language models (LLMs) hold great promise for medical applications and are evolving rapidly, wit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models

arXiv:2505.03821v2 Announce Type: replace-cross Abstract: We investigate the ability of Vision Language Models (VLMs) to perform visual perspective taking using

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Self-Bootstrapping Automated Program Repair: Using LLMs to Generate and Evaluate Synthetic Training Data for Bug Repair

arXiv:2505.07372v2 Announce Type: replace-cross Abstract: This paper presents a novel methodology for enhancing Automated Program Repair (APR) through synthetic

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Structured Agent Distillation for Large Language Model

arXiv:2505.13820v4 Announce Type: replace-cross Abstract: Large language models (LLMs) exhibit strong capabilities as decision-making agents by interleaving rea

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

VLM-SAFE: Vision-Language Model-Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving

arXiv:2505.16377v2 Announce Type: replace-cross Abstract: Autonomous driving policy learning with reinforcement learning (RL) is fundamentally limited by low sa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification

arXiv:2506.04450v5 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are increasingly adopted across domains such as education, healthcare, an

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights

arXiv:2506.17337v4 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) have shown promise in automating image diagnosis and interpretation in c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Multi-Sample Prompting and Actor-Critic Prompt Optimization for Diverse Synthetic Data Generation

arXiv:2506.21138v2 Announce Type: replace-cross Abstract: High-quality labeled datasets are fundamental for training and evaluating machine learning models, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models

arXiv:2508.02343v2 Announce Type: replace-cross Abstract: Quantization significantly accelerates inference in large language models (LLMs) by replacing original

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

arXiv:2508.13773v3 Announce Type: replace-cross Abstract: Despite advances in the Transformer architecture, their effectiveness for long-term time series foreca

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation

arXiv:2509.16952v2 Announce Type: replace-cross Abstract: The growing volume of academic papers has made it increasingly difficult for researchers to efficientl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Multi-View Attention Multiple-Instance Learning Enhanced by LLM Reasoning for Cognitive Distortion Detection

arXiv:2509.17292v2 Announce Type: replace-cross Abstract: Cognitive distortions have been closely linked to mental health disorders, yet their automatic detecti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Advancing Few-Shot Pediatric Arrhythmia Classification with a Novel Contrastive Loss and Multimodal Learning

arXiv:2509.19315v2 Announce Type: replace-cross Abstract: Arrhythmias are a major cause of sudden cardiac death in children, making automated rhythm classificat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dual-Space Smoothness for Robust and Balanced LLM Unlearning

arXiv:2509.23362v2 Announce Type: replace-cross Abstract: As large language models evolve, Machine Unlearning has emerged to address growing concerns around use

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models

arXiv:2509.25848v3 Announce Type: replace-cross Abstract: Reasoning has emerged as a pivotal capability in Large Language Models (LLMs). Through Reinforcement L

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

arXiv:2510.04618v3 Announce Type: replace-cross Abstract: Large language model (LLM) applications such as agents and domain-specific reasoning increasingly rely

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling

arXiv:2510.05825v2 Announce Type: replace-cross Abstract: Inference-Time Scaling (ITS) improves language models by allocating more computation at generation tim

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation

arXiv:2510.08553v2 Announce Type: replace-cross Abstract: Vision-and-Language Navigation (VLN) requires agents to follow natural language instructions through e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CLMN: Concept based Language Models via Neural Symbolic Reasoning

arXiv:2510.10063v2 Announce Type: replace-cross Abstract: Deep learning has advanced NLP, but interpretability remains limited, especially in healthcare and fin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Schema for In-Context Learning

arXiv:2510.13905v3 Announce Type: replace-cross Abstract: In-Context Learning (ICL) enables transformer-based language models to adapt to new tasks by condition

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings

arXiv:2510.15681v3 Announce Type: replace-cross Abstract: Translating human-written mathematical theorems and proofs from natural language (NL) into formal lang

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models

arXiv:2510.20351v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly exposed to data contamination, i.e., performance gains d

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning

arXiv:2510.25311v2 Announce Type: replace-cross Abstract: Reinforcement Learning algorithms are primarily focused on learning a policy that maximizes expected r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks

arXiv:2511.10465v2 Announce Type: replace-cross Abstract: While prompt optimization has emerged as a critical technique for enhancing language model performance

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

arXiv:2511.11483v4 Announce Type: replace-cross Abstract: Recent text-to-image (T2I) models have made remarkable progress in generating visually realistic and s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Scaling Spatial Intelligence with Multimodal Foundation Models

arXiv:2511.13719v4 Announce Type: replace-cross Abstract: Despite remarkable progress, multimodal foundation models still exhibit surprising deficiencies in spa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Object-Centric World Models for Causality-Aware Reinforcement Learning

arXiv:2511.14262v3 Announce Type: replace-cross Abstract: World models have been developed to support sample-efficient deep reinforcement learning agents. Howev

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning

arXiv:2511.15090v2 Announce Type: replace-cross Abstract: Scientific documents contain complex multimodal structures, which makes evidence localization and scie

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search

arXiv:2511.16681v2 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) systems have become a dominant approach to augment large language

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

arXiv:2511.19413v3 Announce Type: replace-cross Abstract: Unified Multimodal Models (UMMs) have shown impressive performance in both understanding and generatio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings

arXiv:2511.21428v2 Announce Type: replace-cross Abstract: We present a novel unsupervised framework to unlock vast unlabeled human demonstration data from conti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Single-Round Scalable Analytic Federated Learning

arXiv:2512.03336v2 Announce Type: replace-cross Abstract: Federated Learning (FL) is plagued by two key challenges: high communication overhead and performance

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

A Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control

arXiv:2512.04653v2 Announce Type: replace-cross Abstract: Multi-agent reinforcement learning (MARL) has emerged as a promising paradigm for adaptive traffic sig

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Multilingual Medical Reasoning for Question Answering with Large Language Models

arXiv:2512.05658v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) with reasoning capabilities have recently demonstrated strong potential i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models

arXiv:2512.08503v2 Announce Type: replace-cross Abstract: Multi-modal large reasoning models (MLRMs) pose significant privacy risks by inferring precise geograp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models

arXiv:2512.10932v2 Announce Type: replace-cross Abstract: Early children's developmental trajectories set up a natural goal for sample-efficient pretraining of