Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,534
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,127 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Evaluating and Understanding Scheming Propensity in LLM Agents
arXiv:2603.01608v2 Announce Type: replace Abstract: As frontier language models are increasingly deployed as autonomous agents pursuing complex, long-term objec
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Seed1.8 Model Card: Towards Generalized Real-World Agency
arXiv:2603.20633v2 Announce Type: replace Abstract: We present Seed1.8, a foundation model aimed at generalized real-world agency: going beyond single-turn pred
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks
arXiv:2603.21636v2 Announce Type: replace Abstract: Public benchmarks increasingly govern how large language models (LLMs) are ranked, selected, and deployed. W
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Continual Graph Learning: A Survey
arXiv:2301.12230v2 Announce Type: replace-cross Abstract: Continual Graph Learning (CGL) enables models to incrementally learn from streaming graph-structured d
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Deep Neural Networks: A Formulation Via Non-Archimedean Analysis
arXiv:2402.00094v2 Announce Type: replace-cross Abstract: We introduce a new class of deep neural networks (DNNs) with multilayered tree-like architectures. The
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
arXiv:2402.11877v2 Announce Type: replace-cross Abstract: Reinforcement learning has witnessed significant advancements, particularly with the emergence of mode
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Bidirectional Multimodal Prompt Learning with Scale-Aware Training for Few-Shot Multi-Class Anomaly Detection
arXiv:2408.13516v2 Announce Type: replace-cross Abstract: Few-shot multi-class anomaly detection is crucial in real industrial settings, where only a few normal
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Explainable AI needs formalization
arXiv:2409.14590v5 Announce Type: replace-cross Abstract: The field of "explainable artificial intelligence" (XAI) seemingly addresses the desire that decisions
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Recent Advances of Multimodal Continual Learning: A Comprehensive Survey
arXiv:2410.05352v3 Announce Type: replace-cross Abstract: Continual learning (CL) aims to empower machine learning models to learn continually from new data, wh
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Gradient Compression Beyond Low-Rank: Wavelet Subspaces Compact Optimizer States
arXiv:2501.07237v4 Announce Type: replace-cross Abstract: Large language models (LLMs) have shown impressive performance across a range of natural language proc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
A Survey of Zero-Knowledge Proof Based Verifiable Machine Learning
arXiv:2502.18535v2 Announce Type: replace-cross Abstract: Machine learning is increasingly deployed through outsourced and cloud-based pipelines, which improve
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs
arXiv:2503.05371v3 Announce Type: replace-cross Abstract: We present a novel approach to bias mitigation in large language models (LLMs) by applying steering ve
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text
arXiv:2504.19467v4 Announce Type: replace-cross Abstract: Large language models (LLMs) hold great promise for medical applications and are evolving rapidly, wit
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models
arXiv:2505.03821v2 Announce Type: replace-cross Abstract: We investigate the ability of Vision Language Models (VLMs) to perform visual perspective taking using
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Self-Bootstrapping Automated Program Repair: Using LLMs to Generate and Evaluate Synthetic Training Data for Bug Repair
arXiv:2505.07372v2 Announce Type: replace-cross Abstract: This paper presents a novel methodology for enhancing Automated Program Repair (APR) through synthetic
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Structured Agent Distillation for Large Language Model
arXiv:2505.13820v4 Announce Type: replace-cross Abstract: Large language models (LLMs) exhibit strong capabilities as decision-making agents by interleaving rea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
VLM-SAFE: Vision-Language Model-Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving
arXiv:2505.16377v2 Announce Type: replace-cross Abstract: Autonomous driving policy learning with reinforcement learning (RL) is fundamentally limited by low sa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification
arXiv:2506.04450v5 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are increasingly adopted across domains such as education, healthcare, an
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights
arXiv:2506.17337v4 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) have shown promise in automating image diagnosis and interpretation in c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Multi-Sample Prompting and Actor-Critic Prompt Optimization for Diverse Synthetic Data Generation
arXiv:2506.21138v2 Announce Type: replace-cross Abstract: High-quality labeled datasets are fundamental for training and evaluating machine learning models, yet
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models
arXiv:2508.02343v2 Announce Type: replace-cross Abstract: Quantization significantly accelerates inference in large language models (LLMs) by replacing original
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting
arXiv:2508.13773v3 Announce Type: replace-cross Abstract: Despite advances in the Transformer architecture, their effectiveness for long-term time series foreca
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation
arXiv:2509.16952v2 Announce Type: replace-cross Abstract: The growing volume of academic papers has made it increasingly difficult for researchers to efficientl
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Multi-View Attention Multiple-Instance Learning Enhanced by LLM Reasoning for Cognitive Distortion Detection
arXiv:2509.17292v2 Announce Type: replace-cross Abstract: Cognitive distortions have been closely linked to mental health disorders, yet their automatic detecti
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Advancing Few-Shot Pediatric Arrhythmia Classification with a Novel Contrastive Loss and Multimodal Learning
arXiv:2509.19315v2 Announce Type: replace-cross Abstract: Arrhythmias are a major cause of sudden cardiac death in children, making automated rhythm classificat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Dual-Space Smoothness for Robust and Balanced LLM Unlearning
arXiv:2509.23362v2 Announce Type: replace-cross Abstract: As large language models evolve, Machine Unlearning has emerged to address growing concerns around use
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models
arXiv:2509.25848v3 Announce Type: replace-cross Abstract: Reasoning has emerged as a pivotal capability in Large Language Models (LLMs). Through Reinforcement L
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
arXiv:2510.04618v3 Announce Type: replace-cross Abstract: Large language model (LLM) applications such as agents and domain-specific reasoning increasingly rely
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling
arXiv:2510.05825v2 Announce Type: replace-cross Abstract: Inference-Time Scaling (ITS) improves language models by allocating more computation at generation tim
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation
arXiv:2510.08553v2 Announce Type: replace-cross Abstract: Vision-and-Language Navigation (VLN) requires agents to follow natural language instructions through e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CLMN: Concept based Language Models via Neural Symbolic Reasoning
arXiv:2510.10063v2 Announce Type: replace-cross Abstract: Deep learning has advanced NLP, but interpretability remains limited, especially in healthcare and fin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Schema for In-Context Learning
arXiv:2510.13905v3 Announce Type: replace-cross Abstract: In-Context Learning (ICL) enables transformer-based language models to adapt to new tasks by condition
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings
arXiv:2510.15681v3 Announce Type: replace-cross Abstract: Translating human-written mathematical theorems and proofs from natural language (NL) into formal lang
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models
arXiv:2510.20351v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly exposed to data contamination, i.e., performance gains d
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
arXiv:2510.25311v2 Announce Type: replace-cross Abstract: Reinforcement Learning algorithms are primarily focused on learning a policy that maximizes expected r
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
arXiv:2511.10465v2 Announce Type: replace-cross Abstract: While prompt optimization has emerged as a critical technique for enhancing language model performance
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation
arXiv:2511.11483v4 Announce Type: replace-cross Abstract: Recent text-to-image (T2I) models have made remarkable progress in generating visually realistic and s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Scaling Spatial Intelligence with Multimodal Foundation Models
arXiv:2511.13719v4 Announce Type: replace-cross Abstract: Despite remarkable progress, multimodal foundation models still exhibit surprising deficiencies in spa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Object-Centric World Models for Causality-Aware Reinforcement Learning
arXiv:2511.14262v3 Announce Type: replace-cross Abstract: World models have been developed to support sample-efficient deep reinforcement learning agents. Howev
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning
arXiv:2511.15090v2 Announce Type: replace-cross Abstract: Scientific documents contain complex multimodal structures, which makes evidence localization and scie
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search
arXiv:2511.16681v2 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) systems have become a dominant approach to augment large language
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
UniGame: Turning a Unified Multimodal Model Into Its Own Adversary
arXiv:2511.19413v3 Announce Type: replace-cross Abstract: Unified Multimodal Models (UMMs) have shown impressive performance in both understanding and generatio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings
arXiv:2511.21428v2 Announce Type: replace-cross Abstract: We present a novel unsupervised framework to unlock vast unlabeled human demonstration data from conti
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Single-Round Scalable Analytic Federated Learning
arXiv:2512.03336v2 Announce Type: replace-cross Abstract: Federated Learning (FL) is plagued by two key challenges: high communication overhead and performance
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
A Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control
arXiv:2512.04653v2 Announce Type: replace-cross Abstract: Multi-agent reinforcement learning (MARL) has emerged as a promising paradigm for adaptive traffic sig
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Multilingual Medical Reasoning for Question Answering with Large Language Models
arXiv:2512.05658v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) with reasoning capabilities have recently demonstrated strong potential i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models
arXiv:2512.08503v2 Announce Type: replace-cross Abstract: Multi-modal large reasoning models (MLRMs) pose significant privacy risks by inferring precise geograp
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models
arXiv:2512.10932v2 Announce Type: replace-cross Abstract: Early children's developmental trajectories set up a natural goal for sample-efficient pretraining of