3,323 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,323 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (15696) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
FileGram: Grounding Agent Personalization in File-System Behavioral Traces
arXiv:2604.04901v1 Announce Type: cross Abstract: Coworking AI agents operating within local file systems are rapidly emerging as a paradigm in human-AI interac
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
How AI Aggregation Affects Knowledge
arXiv:2604.04906v1 Announce Type: cross Abstract: Artificial intelligence (AI) changes social learning when aggregated outputs become training data for future p
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
Analyzing Symbolic Properties for DRL Agents in Systems and Networking
arXiv:2604.04914v1 Announce Type: cross Abstract: Deep reinforcement learning (DRL) has shown remarkable performance on complex control problems in systems and
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Vero: An Open RL Recipe for General Visual Reasoning
arXiv:2604.04917v1 Announce Type: cross Abstract: What does it take to build a visual reasoner that works across charts, science, spatial understanding, and ope
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Your Pre-trained Diffusion Model Secretly Knows Restoration
arXiv:2604.04924v1 Announce Type: cross Abstract: Pre-trained diffusion models have enabled significant advancements in All-in-One Restoration (AiOR), offering
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Early Stopping for Large Reasoning Models via Confidence Dynamics
arXiv:2604.04930v1 Announce Type: cross Abstract: Large reasoning models rely on long chain-of-thought generation to solve complex problems, but extended reason
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning
arXiv:2302.00797v4 Announce Type: replace Abstract: Opponent modeling methods typically involve two crucial steps: building a belief distribution over opponents
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
A Multi-Agent Reinforcement Learning Framework for Public Health Decision Analysis
arXiv:2311.00855v3 Announce Type: replace Abstract: Human immunodeficiency virus (HIV) is a major public health concern in the United States (U.S.), with about
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible
arXiv:2411.06498v2 Announce Type: replace Abstract: A recent paper (van Rooij et al. 2024) claims to have proved that achieving human-like intelligence using le
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 2w ago
Representation learning to advance multi-institutional studies with electronic health record data from US and France
arXiv:2502.08547v2 Announce Type: replace Abstract: The widespread adoption of electronic health records has created new opportunities for translational clinica
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
arXiv:2502.13388v3 Announce Type: replace Abstract: StarCraft II is a complex and dynamic real-time strategy (RTS) game environment, which is very suitable for
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models
arXiv:2506.17585v3 Announce Type: replace Abstract: Trustworthy language models should provide both correct and verifiable answers. However, citations generated
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game
arXiv:2508.02900v2 Announce Type: replace Abstract: There is a broad consensus that the inability to form long-term plans is one of the key limitations of curre
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Similarity Field Theory: A Mathematical Framework for Intelligence
arXiv:2509.18218v5 Announce Type: replace Abstract: We posit that transforming similarity relations form the structural basis of comprehensible dynamic systems.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
arXiv:2510.09901v2 Announce Type: replace Abstract: Computing has long served as a cornerstone of scientific discovery. Recently, a paradigm shift has emerged w
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges
arXiv:2510.23883v3 Announce Type: replace Abstract: Agentic AI systems powered by large language models (LLMs) and endowed with planning, tool use, memory, and
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval
arXiv:2511.14130v2 Announce Type: replace Abstract: With the rapid progress of large language models (LLMs), financial information retrieval has become a critic
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
An Agent-Based Framework for the Automatic Validation of Mathematical Optimization Models
arXiv:2511.16383v2 Announce Type: replace Abstract: Recently, using Large Language Models (LLMs) to generate optimization models from natural language descripti
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
Agile Deliberation: Concept Deliberation for Subjective Visual Classification
arXiv:2512.10821v2 Announce Type: replace Abstract: From content moderation to content curation, applications requiring vision classifiers for visual concepts a
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows
arXiv:2512.13168v4 Announce Type: replace Abstract: We introduce FinWorkBench (a.k.a. Finch), a benchmark for evaluating agents on real-world, enterprise-grade