📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AI Forbes Innovation OpenAI News Dev.to AI Hugging Face Blog Hackernoon

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation

arXiv:2512.18809v2 Announce Type: replace-cross Abstract: Short-form video moderation increasingly needs learning pipelines that protect user privacy without pa

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 4d ago

Unified Thinker: A General Reasoning Modular Core for Image Generation

arXiv:2601.03127v2 Announce Type: replace-cross Abstract: Despite impressive progress in high-fidelity image synthesis, generative models still struggle with lo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

No Universal Hyperbola: A Formal Disproof of the Epistemic Trade-Off Between Certainty and Scope in Symbolic and Generative AI

arXiv:2601.08845v2 Announce Type: replace-cross Abstract: In direct response to requests for a logico-mathematical test of the conjecture, we formally disprove

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Autonomous Computational Catalysis Research via Agentic Systems

arXiv:2601.13508v2 Announce Type: replace-cross Abstract: Fully automating the scientific process is a transformative ambition in materials science, yet current

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Textual Equilibrium Propagation for Deep Compound AI Systems

arXiv:2601.21064v3 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly deployed as part of compound AI systems that coordinate

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4d ago

Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions

arXiv:2602.09987v4 Announce Type: replace-cross Abstract: Influence functions are commonly used to attribute model behavior to training documents. We explore th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Equivariant Evidential Deep Learning for Interatomic Potentials

arXiv:2602.10419v2 Announce Type: replace-cross Abstract: Uncertainty quantification (UQ) is critical for assessing the reliability of machine learning interato

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking

arXiv:2602.16746v3 Announce Type: replace-cross Abstract: Grokking -- the delayed transition from memorization to generalization in small algorithmic tasks -- r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Early-Warning Signals of Grokking via Loss-Landscape Geometry

arXiv:2602.16967v3 Announce Type: replace-cross Abstract: Grokking -- the abrupt transition from memorization to generalization after prolonged training -- has

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure

arXiv:2602.18523v3 Announce Type: replace-cross Abstract: Grokking -- the abrupt transition from memorization to generalization long after near-zero training lo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

CeRA: Overcoming the Linear Ceiling of Low-Rank Adaptation via Capacity Expansion

arXiv:2602.22911v5 Announce Type: replace-cross Abstract: Low-Rank Adaptation (LoRA) dominates parameter-efficient fine-tuning (PEFT). However, it faces a ``lin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

arXiv:2603.01589v2 Announce Type: replace-cross Abstract: The success of large language models (LLMs) in scientific domains has heightened safety concerns, prom

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Escaping the BLEU Trap: A Signal-Grounded Framework with Decoupled Semantic Guidance for EEG-to-Text Decoding

arXiv:2603.03312v2 Announce Type: replace-cross Abstract: Decoding natural language from non-invasive EEG signals is a promising yet challenging task. However,

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis

arXiv:2603.05917v2 Announce Type: replace-cross Abstract: Stock market prediction presents considerable challenges for investors, financial institutions, and po

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education

arXiv:2603.16663v3 Announce Type: replace-cross Abstract: The AIED community envisions AI evolving "from tools to teammates," yet our understanding of AI teamma

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models

arXiv:2603.17677v2 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) improves factual grounding by incorporating external knowledge in

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4d ago

Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions

arXiv:2603.18109v2 Announce Type: replace-cross Abstract: We report the discovery of bimodal structure in the drift rate distribution of upward-drifting burst c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

CoDA: Exploring Chain-of-Distribution Attacks and Post-Hoc Token-Space Repair for Medical Vision-Language Models

arXiv:2603.18545v2 Announce Type: replace-cross Abstract: Medical vision--language models (MVLMs) are increasingly used as perceptual backbones in radiology pip

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

JointFM-0.1: A Foundation Model for Multi-Target Joint Distributional Prediction

arXiv:2603.20266v2 Announce Type: replace-cross Abstract: Despite the rapid advancements in Artificial Intelligence (AI), Stochastic Differential Equations (SDE

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 4d ago

Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars

arXiv:2604.01447v2 Announce Type: replace-cross Abstract: Recent 3D Gaussian splatting methods built atop SMPL achieve remarkable visual fidelity while continua

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 4d ago

ProdCodeBench: A Production-Derived Benchmark for Evaluating AI Coding Agents

arXiv:2604.01527v2 Announce Type: replace-cross Abstract: Benchmarks that reflect production workloads are better for evaluating AI coding agents in industrial

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation

arXiv:2604.01989v2 Announce Type: replace-cross Abstract: Like a body at rest that stays at rest, we find that visual attention in multimodal large language mod

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

How Emotion Shapes the Behavior of LLMs and Agents: A Mechanistic Study

arXiv:2604.00005v1 Announce Type: new Abstract: Emotion plays an important role in human cognition and performance. Motivated by this, we investigate whether an

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction

arXiv:2604.00085v1 Announce Type: new Abstract: Large language models applied to clinical prediction exhibit case-level heterogeneity: simple cases yield consis