3,169 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AIForbes InnovationOpenAI NewsDev.to AIHugging Face BlogHackernoon
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Textual Equilibrium Propagation for Deep Compound AI Systems
arXiv:2601.21064v3 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly deployed as part of compound AI systems that coordinate
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4d ago
Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions
arXiv:2602.09987v4 Announce Type: replace-cross Abstract: Influence functions are commonly used to attribute model behavior to training documents. We explore th
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Equivariant Evidential Deep Learning for Interatomic Potentials
arXiv:2602.10419v2 Announce Type: replace-cross Abstract: Uncertainty quantification (UQ) is critical for assessing the reliability of machine learning interato
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking
arXiv:2602.16746v3 Announce Type: replace-cross Abstract: Grokking -- the delayed transition from memorization to generalization in small algorithmic tasks -- r
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Early-Warning Signals of Grokking via Loss-Landscape Geometry
arXiv:2602.16967v3 Announce Type: replace-cross Abstract: Grokking -- the abrupt transition from memorization to generalization after prolonged training -- has
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure
arXiv:2602.18523v3 Announce Type: replace-cross Abstract: Grokking -- the abrupt transition from memorization to generalization long after near-zero training lo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
CeRA: Overcoming the Linear Ceiling of Low-Rank Adaptation via Capacity Expansion
arXiv:2602.22911v5 Announce Type: replace-cross Abstract: Low-Rank Adaptation (LoRA) dominates parameter-efficient fine-tuning (PEFT). However, it faces a ``lin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond
arXiv:2603.01589v2 Announce Type: replace-cross Abstract: The success of large language models (LLMs) in scientific domains has heightened safety concerns, prom
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Escaping the BLEU Trap: A Signal-Grounded Framework with Decoupled Semantic Guidance for EEG-to-Text Decoding
arXiv:2603.03312v2 Announce Type: replace-cross Abstract: Decoding natural language from non-invasive EEG signals is a promising yet challenging task. However,
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis
arXiv:2603.05917v2 Announce Type: replace-cross Abstract: Stock market prediction presents considerable challenges for investors, financial institutions, and po
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education
arXiv:2603.16663v3 Announce Type: replace-cross Abstract: The AIED community envisions AI evolving "from tools to teammates," yet our understanding of AI teamma
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models
arXiv:2603.17677v2 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) improves factual grounding by incorporating external knowledge in
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4d ago
Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions
arXiv:2603.18109v2 Announce Type: replace-cross Abstract: We report the discovery of bimodal structure in the drift rate distribution of upward-drifting burst c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
CoDA: Exploring Chain-of-Distribution Attacks and Post-Hoc Token-Space Repair for Medical Vision-Language Models
arXiv:2603.18545v2 Announce Type: replace-cross Abstract: Medical vision--language models (MVLMs) are increasingly used as perceptual backbones in radiology pip
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
JointFM-0.1: A Foundation Model for Multi-Target Joint Distributional Prediction
arXiv:2603.20266v2 Announce Type: replace-cross Abstract: Despite the rapid advancements in Artificial Intelligence (AI), Stochastic Differential Equations (SDE
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 4d ago
Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars
arXiv:2604.01447v2 Announce Type: replace-cross Abstract: Recent 3D Gaussian splatting methods built atop SMPL achieve remarkable visual fidelity while continua
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 4d ago
ProdCodeBench: A Production-Derived Benchmark for Evaluating AI Coding Agents
arXiv:2604.01527v2 Announce Type: replace-cross Abstract: Benchmarks that reflect production workloads are better for evaluating AI coding agents in industrial
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation
arXiv:2604.01989v2 Announce Type: replace-cross Abstract: Like a body at rest that stays at rest, we find that visual attention in multimodal large language mod
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
How Emotion Shapes the Behavior of LLMs and Agents: A Mechanistic Study
arXiv:2604.00005v1 Announce Type: new Abstract: Emotion plays an important role in human cognition and performance. Motivated by this, we investigate whether an
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction
arXiv:2604.00085v1 Announce Type: new Abstract: Large language models applied to clinical prediction exhibit case-level heterogeneity: simple cases yield consis