📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation
arXiv:2512.18809v2 Announce Type: replace-cross Abstract: Short-form video moderation increasingly needs learning pipelines that protect user privacy without pa
ArXiv cs.AI
💻 AI-Assisted Coding
📄 Paper
⚡ AI Lesson
4d ago
Unified Thinker: A General Reasoning Modular Core for Image Generation
arXiv:2601.03127v2 Announce Type: replace-cross Abstract: Despite impressive progress in high-fidelity image synthesis, generative models still struggle with lo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
No Universal Hyperbola: A Formal Disproof of the Epistemic Trade-Off Between Certainty and Scope in Symbolic and Generative AI
arXiv:2601.08845v2 Announce Type: replace-cross Abstract: In direct response to requests for a logico-mathematical test of the conjecture, we formally disprove
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4d ago
Autonomous Computational Catalysis Research via Agentic Systems
arXiv:2601.13508v2 Announce Type: replace-cross Abstract: Fully automating the scientific process is a transformative ambition in materials science, yet current
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Textual Equilibrium Propagation for Deep Compound AI Systems
arXiv:2601.21064v3 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly deployed as part of compound AI systems that coordinate
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
4d ago
Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions
arXiv:2602.09987v4 Announce Type: replace-cross Abstract: Influence functions are commonly used to attribute model behavior to training documents. We explore th
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Equivariant Evidential Deep Learning for Interatomic Potentials
arXiv:2602.10419v2 Announce Type: replace-cross Abstract: Uncertainty quantification (UQ) is critical for assessing the reliability of machine learning interato
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking
arXiv:2602.16746v3 Announce Type: replace-cross Abstract: Grokking -- the delayed transition from memorization to generalization in small algorithmic tasks -- r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Early-Warning Signals of Grokking via Loss-Landscape Geometry
arXiv:2602.16967v3 Announce Type: replace-cross Abstract: Grokking -- the abrupt transition from memorization to generalization after prolonged training -- has
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure
arXiv:2602.18523v3 Announce Type: replace-cross Abstract: Grokking -- the abrupt transition from memorization to generalization long after near-zero training lo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
CeRA: Overcoming the Linear Ceiling of Low-Rank Adaptation via Capacity Expansion
arXiv:2602.22911v5 Announce Type: replace-cross Abstract: Low-Rank Adaptation (LoRA) dominates parameter-efficient fine-tuning (PEFT). However, it faces a ``lin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond
arXiv:2603.01589v2 Announce Type: replace-cross Abstract: The success of large language models (LLMs) in scientific domains has heightened safety concerns, prom
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Escaping the BLEU Trap: A Signal-Grounded Framework with Decoupled Semantic Guidance for EEG-to-Text Decoding
arXiv:2603.03312v2 Announce Type: replace-cross Abstract: Decoding natural language from non-invasive EEG signals is a promising yet challenging task. However,
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4d ago
Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis
arXiv:2603.05917v2 Announce Type: replace-cross Abstract: Stock market prediction presents considerable challenges for investors, financial institutions, and po
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4d ago
When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education
arXiv:2603.16663v3 Announce Type: replace-cross Abstract: The AIED community envisions AI evolving "from tools to teammates," yet our understanding of AI teamma
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models
arXiv:2603.17677v2 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) improves factual grounding by incorporating external knowledge in
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
4d ago
Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions
arXiv:2603.18109v2 Announce Type: replace-cross Abstract: We report the discovery of bimodal structure in the drift rate distribution of upward-drifting burst c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
CoDA: Exploring Chain-of-Distribution Attacks and Post-Hoc Token-Space Repair for Medical Vision-Language Models
arXiv:2603.18545v2 Announce Type: replace-cross Abstract: Medical vision--language models (MVLMs) are increasingly used as perceptual backbones in radiology pip
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
JointFM-0.1: A Foundation Model for Multi-Target Joint Distributional Prediction
arXiv:2603.20266v2 Announce Type: replace-cross Abstract: Despite the rapid advancements in Artificial Intelligence (AI), Stochastic Differential Equations (SDE
ArXiv cs.AI
💻 AI-Assisted Coding
📄 Paper
⚡ AI Lesson
4d ago
Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars
arXiv:2604.01447v2 Announce Type: replace-cross Abstract: Recent 3D Gaussian splatting methods built atop SMPL achieve remarkable visual fidelity while continua
ArXiv cs.AI
💻 AI-Assisted Coding
📄 Paper
⚡ AI Lesson
4d ago
ProdCodeBench: A Production-Derived Benchmark for Evaluating AI Coding Agents
arXiv:2604.01527v2 Announce Type: replace-cross Abstract: Benchmarks that reflect production workloads are better for evaluating AI coding agents in industrial
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation
arXiv:2604.01989v2 Announce Type: replace-cross Abstract: Like a body at rest that stays at rest, we find that visual attention in multimodal large language mod
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
How Emotion Shapes the Behavior of LLMs and Agents: A Mechanistic Study
arXiv:2604.00005v1 Announce Type: new Abstract: Emotion plays an important role in human cognition and performance. Motivated by this, we investigate whether an
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction
arXiv:2604.00085v1 Announce Type: new Abstract: Large language models applied to clinical prediction exhibit case-level heterogeneity: simple cases yield consis
DeepCamp AI