8,253 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 8,253 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (21843) ArXiv cs.AIDev.to AIMedium · AIMedium · ProgrammingForbes InnovationMedium · Machine Learning
ArXiv cs.AI 📄 Paper 2w ago
Structured Uncertainty guided Clarification for LLM Agents
arXiv:2511.08798v2 Announce Type: replace-cross Abstract: LLM agents with tool-calling capabilities often fail when user instructions are ambiguous or incomplet
ArXiv cs.AI 📄 Paper 2w ago
Commanding Humanoid by Free-form Language: A Large Language Action Model with Unified Motion Vocabulary
arXiv:2511.22963v2 Announce Type: replace-cross Abstract: Enabling humanoid robots to follow free-form language commands is critical for seamless human-robot in
ArXiv cs.AI 📄 Paper 2w ago
Bharat Scene Text: A Novel Comprehensive Dataset and Benchmark for Indian Language Scene Text Understanding
arXiv:2511.23071v2 Announce Type: replace-cross Abstract: Reading scene text, that is, text appearing in images, has numerous application areas, including assis
ArXiv cs.AI 📄 Paper 2w ago
See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models
arXiv:2512.02231v2 Announce Type: replace-cross Abstract: Multimodal large language models (MLLMs) are expected to jointly interpret vision, audio, and language
ArXiv cs.AI 📄 Paper 2w ago
From Navigation to Refinement: Revealing the Two-Stage Nature of Flow-based Diffusion Models through Oracle Velocity
arXiv:2512.02826v3 Announce Type: replace-cross Abstract: Flow-based diffusion models have emerged as a leading paradigm for training generative models across i
ArXiv cs.AI 📄 Paper 2w ago
Out-of-the-box: Black-box Causal Attacks on Object Detectors
arXiv:2512.03730v2 Announce Type: replace-cross Abstract: Adversarial perturbations are a useful way to expose vulnerabilities in object detectors. Existing per
ArXiv cs.AI 📄 Paper 2w ago
SkillFactory: Self-Distillation For Learning Cognitive Behaviors
arXiv:2512.04072v2 Announce Type: replace-cross Abstract: Reasoning models leveraging long chains of thought employ various cognitive skills, such as verificati
ArXiv cs.AI 📄 Paper 2w ago
Relational Visual Similarity
arXiv:2512.07833v2 Announce Type: replace-cross Abstract: Humans do not just see attribute similarity -- we also see relational similarity. An apple is like a p
ArXiv cs.AI 📄 Paper 2w ago
Multi-agent Adaptive Mechanism Design
arXiv:2512.21794v3 Announce Type: replace-cross Abstract: We study a sequential mechanism design problem in which a principal seeks to elicit truthful reports f
ArXiv cs.AI 📄 Paper 2w ago
The Two-Stage Decision-Sampling Hypothesis: Understanding the Emergence of Self-Reflection in RL-Trained LLMs
arXiv:2601.01580v2 Announce Type: replace-cross Abstract: Self-reflection capabilities emerge in Large Language Models after RL post-training, with multi-turn R
ArXiv cs.AI 📄 Paper 2w ago
Adversarial Evasion Attacks on Computer Vision using SHAP Values
arXiv:2601.10587v3 Announce Type: replace-cross Abstract: The paper introduces a white-box attack on computer vision models using SHAP values. It demonstrates h
ArXiv cs.AI 📄 Paper 2w ago
Screen, Cache, and Match: A Training-Free Causality-Consistent Reference Frame Framework for Human Animation
arXiv:2601.22160v2 Announce Type: replace-cross Abstract: Human animation aims to generate temporally coherent and visually consistent videos over long sequence
ArXiv cs.AI 📄 Paper 2w ago
Self-Supervised Slice-to-Volume Reconstruction with Gaussian Representations for Fetal MRI
arXiv:2601.22990v2 Announce Type: replace-cross Abstract: Reconstructing 3D fetal MR volumes from motion-corrupted stacks of 2D slices is a crucial and challeng
ArXiv cs.AI 📄 Paper 2w ago
On the Limits of Layer Pruning for Generative Reasoning in Large Language Models
arXiv:2602.01997v2 Announce Type: replace-cross Abstract: Recent work has shown that layer pruning can effectively compress large language models (LLMs) while r
ArXiv cs.AI 📄 Paper 2w ago
Tiled Prompts: Overcoming Prompt Misguidance in Image and Video Super-Resolution
arXiv:2602.03342v2 Announce Type: replace-cross Abstract: Text-conditioned diffusion models have advanced image and video super-resolution by using prompts as s
ArXiv cs.AI 📄 Paper 2w ago
SPEAR: An Engineering Case Study of Multi-Agent Coordination for Smart Contract Auditing
arXiv:2602.04418v3 Announce Type: replace-cross Abstract: We present SPEAR, a multi-agent coordination framework for smart contract auditing that applies establ
ArXiv cs.AI 📄 Paper 2w ago
Overstating Attitudes, Ignoring Networks: LLM Biases in Simulating Misinformation Susceptibility
arXiv:2602.04674v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly used as proxies for human judgment in computational soci
ArXiv cs.AI 📄 Paper 2w ago
Exploring Teachers' Perspectives on Using Conversational AI Agents for Group Collaboration
arXiv:2602.07142v2 Announce Type: replace-cross Abstract: Collaboration is a cornerstone of 21st-century learning, yet teachers continue to face challenges in s
ArXiv cs.AI 📄 Paper 2w ago
An Adaptive Model Selection Framework for Demand Forecasting under Horizon-Induced Degradation to Support Business Strategy and Operations
arXiv:2602.13939v3 Announce Type: replace-cross Abstract: Business environments characterized by intermittent demand, high variability, and multi-step planning
ArXiv cs.AI 📄 Paper 2w ago
SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework
arXiv:2602.17330v3 Announce Type: replace-cross Abstract: Comparative analysis of adaptive immune repertoires at population scale is hampered by two practical b