Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,698
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,256 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Multi-Layered Memory Architectures for LLM Agents: An Experimental Evaluation of Long-Term Context Retention
arXiv:2603.29194v1 Announce Type: cross Abstract: Long-horizon dialogue systems suffer from semanticdrift and unstable memory retention across extended sessions
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Improving Ensemble Forecasts of Abnormally Deflecting Tropical Cyclones with Fused Atmosphere-Ocean-Terrain Data
arXiv:2603.29200v1 Announce Type: cross Abstract: Deep learning-based tropical cyclone (TC) forecasting methods have demonstrated significant potential and appl
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
SyriSign: A Parallel Corpus for Arabic Text to Syrian Arabic Sign Language Translation
arXiv:2603.29219v1 Announce Type: cross Abstract: Sign language is the primary approach of communication for the Deaf and Hard-of-Hearing (DHH) community. While
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Derived Fields Preserve Fine-Scale Detail in Budgeted Neural Simulators
arXiv:2603.29224v1 Announce Type: cross Abstract: Fine-scale-faithful neural simulation under fixed storage budgets remains challenging. Many existing methods r
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MemRerank: Preference Memory for Personalized Product Reranking
arXiv:2603.29247v1 Announce Type: cross Abstract: LLM-based shopping agents increasingly rely on long purchase histories and multi-turn interactions for persona
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
PRISM: A Multi-View Multi-Capability Retail Video Dataset for Embodied Vision-Language Models
arXiv:2603.29281v1 Announce Type: cross Abstract: A critical gap exists between the general-purpose visual understanding of state-of-the-art physical AI models
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Sima AIunty: Caste Audit in LLM-Driven Matchmaking
arXiv:2603.29288v1 Announce Type: cross Abstract: Social and personal decisions in relational domains such as matchmaking are deeply entwined with cultural norm
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus
arXiv:2603.29292v1 Announce Type: cross Abstract: Improving the code generation capabilities of large language models (LLMs) typically relies on supervised fine
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning
arXiv:2603.29328v1 Announce Type: cross Abstract: Backdoor attacks on federated learning (FL) are most often evaluated with synthetic corner patches or out-of-d
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
PromptForge-350k: A Large-Scale Dataset and Contrastive Framework for Prompt-Based AI Image Forgery Localization
arXiv:2603.29386v1 Announce Type: cross Abstract: The rapid democratization of prompt-based AI image editing has recently exacerbated the risks associated with
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Security in LLM-as-a-Judge: A Comprehensive SoK
arXiv:2603.29403v1 Announce Type: cross Abstract: LLM-as-a-Judge (LaaJ) is a novel paradigm in which powerful language models are used to assess the quality, sa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Hallucination-aware intermediate representation edit in large vision-language models
arXiv:2603.29405v1 Announce Type: cross Abstract: Large Vision-Language Models have demonstrated exceptional performance in multimodal reasoning and complex sce
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
AGFT: Alignment-Guided Fine-Tuning for Zero-Shot Adversarial Robustness of Vision-Language Models
arXiv:2603.29410v1 Announce Type: cross Abstract: Pre-trained vision-language models (VLMs) exhibit strong zero-shot generalization but remain vulnerable to adv
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Adversarial Prompt Injection Attack on Multimodal Large Language Models
arXiv:2603.29418v1 Announce Type: cross Abstract: Although multimodal large language models (MLLMs) are increasingly deployed in real-world applications, their
ArXiv cs.AI 🧠 Large Language Models 📄 Paper 3w ago
Few-shot Writer Adaptation via Multimodal In-Context Learning
arXiv:2603.29450v1 Announce Type: cross Abstract: While state-of-the-art Handwritten Text Recognition (HTR) models perform well on standard benchmarks, they fre
ArXiv cs.AI 🧠 Large Language Models 📄 Paper 3w ago
An Isotropic Approach to Efficient Uncertainty Quantification with Gradient Norms
arXiv:2603.29466v1 Announce Type: cross Abstract: Existing methods for quantifying predictive uncertainty in neural networks are either computationally intracta
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
M-MiniGPT4: Multilingual VLLM Alignment via Translated Data
arXiv:2603.29467v1 Announce Type: cross Abstract: This paper presents a Multilingual Vision Large Language Model, named M-MiniGPT4. Our model exhibits strong vi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MemFactory: Unified Inference & Training Framework for Agent Memory
arXiv:2603.29493v1 Announce Type: cross Abstract: Memory-augmented Large Language Models (LLMs) are essential for developing capable, long-term AI agents. Recen
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Impact of enriched meaning representations for language generation in dialogue tasks: A comprehensive exploration of the relevance of tasks, corpora and metrics
arXiv:2603.29518v1 Announce Type: cross Abstract: Conversational systems should generate diverse language forms to interact fluently and accurately with users.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Baby Scale: Investigating Models Trained on Individual Children's Language Input
arXiv:2603.29522v1 Announce Type: cross Abstract: Modern language models (LMs) must be trained on many orders of magnitude more words of training data than huma
ArXiv cs.AI 🧠 Large Language Models 📄 Paper 3w ago
Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models
arXiv:2603.29552v1 Announce Type: cross Abstract: Multilingualism is incredibly common around the world, leading to many important theoretical and practical que
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Convergent Representations of Linguistic Constructions in Human and Artificial Neural Systems
arXiv:2603.29617v1 Announce Type: cross Abstract: Understanding how the brain processes linguistic constructions is a central challenge in cognitive neuroscienc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper 3w ago
Concept frustration: Aligning human concepts and machine representations
arXiv:2603.29654v1 Announce Type: cross Abstract: Aligning human-interpretable concepts with the internal representations learned by modern machine learning sys
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Agenda-based Narrative Extraction: Steering Pathfinding Algorithms with Large Language Models
arXiv:2603.29661v1 Announce Type: cross Abstract: Existing narrative extraction methods face a trade-off between coherence, interactivity, and multi-storyline s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Mind the Gap: A Framework for Assessing Pitfalls in Multimodal Active Learning
arXiv:2603.29677v1 Announce Type: cross Abstract: Multimodal learning enables neural networks to integrate information from heterogeneous sources, but active le
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
KEditVis: A Visual Analytics System for Knowledge Editing of Large Language Models
arXiv:2603.29689v1 Announce Type: cross Abstract: Large Language Models (LLMs) demonstrate exceptional capabilities in factual question answering, yet they some
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
BotVerse: Real-Time Event-Driven Simulation of Social Agents
arXiv:2603.29741v1 Announce Type: cross Abstract: BotVerse is a scalable, event-driven framework for high-fidelity social simulation using LLM-based agents. It
ArXiv cs.AI 🧠 Large Language Models 📄 Paper 3w ago
From Density Matrices to Phase Transitions in Deep Learning: Spectral Early Warnings and Interpretability
arXiv:2603.29805v1 Announce Type: cross Abstract: A key problem in the modern study of AI is predicting and understanding emergent capabilities in models during
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA
arXiv:2603.29844v1 Announce Type: cross Abstract: The development of Vision-Language-Action (VLA) models has been significantly accelerated by pre-trained Visio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
GENIE: Gram-Eigenmode INR Editing with Closed-Form Geometry Updates
arXiv:2603.29860v1 Announce Type: cross Abstract: Implicit Neural Representations (INRs) provide compact models of geometry, but it is unclear when their learne
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Performance Evaluation of LLMs in Automated RDF Knowledge Graph Generation
arXiv:2603.29878v1 Announce Type: cross Abstract: Cloud systems generate large, heterogeneous log data containing critical infrastructure, application, and secu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Generative AI in Action: Field Experimental Evidence from Alibaba's Customer Service Operations
arXiv:2603.29888v1 Announce Type: cross Abstract: In collaboration with Alibaba, this study leverages a large-scale field experiment to assess the impact of a g
ArXiv cs.AI 🧠 Large Language Models 📄 Paper 3w ago
Interview-Informed Generative Agents for Product Discovery: A Validation Study
arXiv:2603.29890v1 Announce Type: cross Abstract: Large language models (LLMs) have shown strong performance on standardized social science instruments, but the
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
UniRank: End-to-End Domain-Specific Reranking of Hybrid Text-Image Candidates
arXiv:2603.29897v1 Announce Type: cross Abstract: Reranking is a critical component in many information retrieval pipelines. Despite remarkable progress in text
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Bethe Ansatz with a Large Language Model
arXiv:2603.29932v1 Announce Type: cross Abstract: We explore the capability of a Large Language Model (LLM) to perform specific computations in mathematical phy
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Enhancing Structural Mapping with LLM-derived Abstractions for Analogical Reasoning in Narratives
arXiv:2603.29997v1 Announce Type: cross Abstract: Analogical reasoning is a key driver of human generalization in problem-solving and argumentation. Yet, analog
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Models
arXiv:2603.30022v1 Announce Type: cross Abstract: This paper introduces a new hybrid framework that combines Reinforcement Learning (RL) and Large Language Mode
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Tucker Attention: A generalization of approximate attention mechanisms
arXiv:2603.30033v1 Announce Type: cross Abstract: The pursuit of reducing the memory footprint of the self-attention mechanism in multi-headed self attention (M
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Aligned, Orthogonal or In-conflict: When can we safely optimize Chain-of-Thought?
arXiv:2603.30036v1 Announce Type: cross Abstract: Chain-of-Thought (CoT) monitoring, in which automated systems monitor the CoT of an LLM, is a promising approa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MindCube: Spatial Mental Modeling from Limited Views
arXiv:2506.21458v2 Announce Type: replace Abstract: Can Vision-Language Models (VLMs) imagine the full scene from just a few views, like humans do? Humans form
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CLAUSE: Agentic Neuro-Symbolic Knowledge Graph Reasoning via Dynamic Learnable Context Engineering
arXiv:2509.21035v2 Announce Type: replace Abstract: Knowledge graphs provide structured context for multi-hop question answering, but deployed systems must bala
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
arXiv:2510.14538v2 Announce Type: replace Abstract: Neuro-symbolic (NeSy) AI aims to develop deep neural networks whose predictions comply with prior knowledge
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
From Efficiency to Adaptivity: A Deeper Look at Adaptive Reasoning in Large Language Models
arXiv:2511.10788v3 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have made reasoning a central benchmark for evaluating intel
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MedBayes-Lite: Bayesian Uncertainty Quantification for Safe Clinical Decision Support
arXiv:2511.16625v2 Announce Type: replace Abstract: We propose MedBayes-Lite, a lightweight Bayesian enhancement for transformer-based clinical language models
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Med-CMR: A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multimodal Reasoning
arXiv:2512.00818v2 Announce Type: replace Abstract: MLLMs MLLMs are beginning to appear in clinical workflows, but their ability to perform complex medical reas
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
The Geometry of Thought: How Scale Restructures Reasoning In Large Language Models
arXiv:2601.13358v2 Announce Type: replace Abstract: Scale does not uniformly improve reasoning - it restructures it. Analyzing 25,000+ chain-of-thought trajecto
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Distilling LLM Reasoning into Graph of Concept Predictors
arXiv:2602.03006v2 Announce Type: replace Abstract: Deploying Large Language Models (LLMs) for discriminative workloads is often limited by inference latency, c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
GenOL: Generating Diverse Examples for Name-only Online Learning
arXiv:2403.10853v4 Announce Type: replace-cross Abstract: Online learning methods often rely on supervised data. However, under data distribution shifts, such a