6,872 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,872 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (17839) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 📄 Paper 1w ago
PAC-BENCH: Evaluating Multi-Agent Collaboration under Privacy Constraints
arXiv:2604.11523v1 Announce Type: new Abstract: We are entering an era in which individuals and organizations increasingly deploy dedicated AI agents that inter
ArXiv cs.AI 📄 Paper 1w ago
Limited Perfect Monotonical Surrogates constructed using low-cost recursive linkage discovery with guaranteed output
arXiv:2604.11524v1 Announce Type: new Abstract: Surrogates provide a cheap solution evaluation and offer significant leverage for optimizing computationally exp
ArXiv cs.AI 📄 Paper 1w ago
Problem Reductions at Scale: Agentic Integration of Computationally Hard Problems
arXiv:2604.11535v1 Announce Type: new Abstract: Solving an NP-hard optimization problem often requires reformulating it for a specific solver -- quantum hardwar
ArXiv cs.AI 📄 Paper 1w ago
A collaborative agent with two lightweight synergistic models for autonomous crystal materials research
arXiv:2604.11540v1 Announce Type: new Abstract: Current large language models require hundreds of billions of parameters yet struggle with domain-specific reaso
ArXiv cs.AI 📄 Paper 1w ago
SemaClaw: A Step Towards General-Purpose Personal AI Agents through Harness Engineering
arXiv:2604.11548v1 Announce Type: new Abstract: The rise of OpenClaw in early 2026 marks the moment when millions of users began deploying personal AI agents in
ArXiv cs.AI 📄 Paper 1w ago
UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents
arXiv:2604.11557v1 Announce Type: new Abstract: Tool-use capability is a fundamental component of LLM agents, enabling them to interact with external systems th
ArXiv cs.AI 📄 Paper 1w ago
Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models
arXiv:2604.11609v1 Announce Type: new Abstract: Large language models exhibit sycophantic tendencies--validating incorrect user beliefs to appear agreeable. We
ArXiv cs.AI 📄 Paper 1w ago
Context Kubernetes: Declarative Orchestration of Enterprise Knowledge for Agentic AI Systems
arXiv:2604.11623v1 Announce Type: new Abstract: We introduce Context Kubernetes, an architecture for orchestrating enterprise knowledge in agentic AI systems, w
ArXiv cs.AI 📄 Paper 1w ago
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time
arXiv:2604.11626v1 Announce Type: new Abstract: Most reward models for visual generation reduce rich human judgments to a single unexplained score, discarding t
ArXiv cs.AI 📄 Paper 1w ago
Why Do Large Language Models Generate Harmful Content?
arXiv:2604.11663v1 Announce Type: new Abstract: Large Language Models (LLMs) have been shown to generate harmful content. However, the underlying causes of such
ArXiv cs.AI 📄 Paper 1w ago
DreamKG: A KG-Augmented Conversational System for People Experiencing Homelessness
arXiv:2604.11703v1 Announce Type: new Abstract: People experiencing homelessness (PEH) face substantial barriers to accessing timely, accurate information about
ArXiv cs.AI 📄 Paper 1w ago
Agentic Driving Coach: Robustness and Determinism of Agentic AI-Powered Human-in-the-Loop Cyber-Physical Systems
arXiv:2604.11705v1 Announce Type: new Abstract: Foundation models, including large language models (LLMs), are increasingly used for human-in-the-loop (HITL) cy
ArXiv cs.AI 📄 Paper 1w ago
A Mamba-Based Multimodal Network for Multiscale Blast-Induced Rapid Structural Damage Assessment
arXiv:2604.11709v1 Announce Type: new Abstract: Accurate and rapid structural damage assessment (SDA) is crucial for post-disaster management, helping responder
ArXiv cs.AI 📄 Paper 1w ago
SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context
arXiv:2604.11716v1 Announce Type: new Abstract: Prior representative ReAct-style approaches in autonomous Software Engineering (SWE) typically lack the explicit
ArXiv cs.AI 📄 Paper 1w ago
Collaborative Multi-Agent Scripts Generation for Enhancing Imperfect-Information Reasoning in Murder Mystery Games
arXiv:2604.11741v1 Announce Type: new Abstract: Vision-language models (VLMs) have shown impressive capabilities in perceptual tasks, yet they degrade in comple
ArXiv cs.AI 📄 Paper 1w ago
Retrieval Is Not Enough: Why Organizational AI Needs Epistemic Infrastructure
arXiv:2604.11759v1 Announce Type: new Abstract: Organizational knowledge used by AI agents typically lacks epistemic structure: retrieval systems surface semant
ArXiv cs.AI 📄 Paper 1w ago
GenTac: Generative Modeling and Forecasting of Soccer Tactics
arXiv:2604.11786v1 Announce Type: new Abstract: Modeling open-play soccer tactics is a formidable challenge due to the stochastic, multi-agent nature of the gam
ArXiv cs.AI 📄 Paper 1w ago
Detecting Safety Violations Across Many Agent Traces
arXiv:2604.11806v1 Announce Type: new Abstract: To identify safety violations, auditors often search over large sets of agent traces. This search is difficult b
ArXiv cs.AI 📄 Paper 1w ago
The Paradox of Professional Input: How Expert Collaboration with AI Systems Shapes Their Future Value
arXiv:2504.12654v1 Announce Type: cross Abstract: This perspective paper examines a fundamental paradox in the relationship between professional expertise and a
ArXiv cs.AI 📄 Paper 1w ago
Retrieval-Augmented Large Language Models for Evidence-Informed Guidance on Cannabidiol Use in Older Adults
arXiv:2604.09548v1 Announce Type: cross Abstract: Older adults commonly experience chronic conditions such as pain and sleep disturbances and may consider canna