📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
View-oriented Conversation Compiler for Agent Trace Analysis
arXiv:2603.29678v1 Announce Type: new Abstract: Agent traces carry increasing analytical value in the era of context learning and harness-driven agentic cogniti
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Beyond the Steeper Curve: AI-Mediated Metacognitive Decoupling and the Limits of the Dunning-Kruger Metaphor
arXiv:2603.29681v1 Announce Type: new Abstract: The common claim that generative AI simply amplifies the Dunning-Kruger effect is too coarse to capture the avai
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1w ago
A First Step Towards Even More Sparse Encodings of Probability Distributions
arXiv:2603.29691v1 Announce Type: new Abstract: Real world scenarios can be captured with lifted probability distributions. However, distributions are usually e
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Measuring the metacognition of AI
arXiv:2603.29693v1 Announce Type: new Abstract: A robust decision-making process must take into account uncertainty, especially when the choice involves inheren
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Symphony for Medical Coding: A Next-Generation Agentic System for Scalable and Explainable Medical Coding
arXiv:2603.29709v1 Announce Type: new Abstract: Medical coding translates free-text clinical documentation into standardized codes drawn from classification sys
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Reinforced Reasoning for End-to-End Retrosynthetic Planning
arXiv:2603.29723v1 Announce Type: new Abstract: Retrosynthetic planning is a fundamental task in organic chemistry, yet remains challenging due to its combinato
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Spontaneous Functional Differentiation in Large Language Models: A Brain-Like Intelligence Economy
arXiv:2603.29735v1 Announce Type: new Abstract: The evolution of intelligence in artificial systems provides a unique opportunity to identify universal computat
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
CausalPulse: An Industrial-Grade Neurosymbolic Multi-Agent Copilot for Causal Diagnostics in Smart Manufacturing
arXiv:2603.29755v1 Announce Type: new Abstract: Modern manufacturing environments demand real-time, trustworthy, and interpretable root-cause insights to sustai
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Tracking vs. Deciding: The Dual-Capability Bottleneck in Searchless Chess Transformers
arXiv:2603.29761v1 Announce Type: new Abstract: A human-like chess engine should mimic the style, errors, and consistency of a strong human player rather than m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Reasoning-Driven Synthetic Data Generation and Evaluation
arXiv:2603.29791v1 Announce Type: new Abstract: Although many AI applications of interest require specialized multi-modal models, relevant data to train such mo
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Owl-AuraID 1.0: An Intelligent System for Autonomous Scientific Instrumentation and Scientific Data Analysis
arXiv:2603.29828v1 Announce Type: new Abstract: Scientific discovery increasingly depends on high-throughput characterization, yet automation is hindered by pro
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
AgentFixer: From Failure Detection to Fix Recommendations in LLM Agentic Systems
arXiv:2603.29848v1 Announce Type: new Abstract: We introduce a comprehensive validation framework for LLM-based agentic systems that provides systematic diagnos
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Spatiotemporal Robustness of Temporal Logic Tasks using Multi-Objective Reasoning
arXiv:2603.29868v1 Announce Type: new Abstract: The reliability of autonomous systems depends on their robustness, i.e., their ability to meet their objectives
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training
arXiv:2603.29871v1 Announce Type: new Abstract: In user-agent interaction scenarios such as recommendation, brainstorming, and code suggestion, Large Language M
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1w ago
A Rational Account of Categorization Based on Information Theory
arXiv:2603.29895v1 Announce Type: new Abstract: We present a new theory of categorization based on an information-theoretic rational analysis. To evaluate this
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation
arXiv:2603.29902v1 Announce Type: new Abstract: Interleaved text-and-image generation represents a significant frontier for Multimodal Large Language Models (ML
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving
arXiv:2603.29908v1 Announce Type: new Abstract: Trajectory planning for autonomous driving increasingly leverages large language models (LLMs) for commonsense r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Uncertainty Gating for Cost-Aware Explainable Artificial Intelligence
arXiv:2603.29915v1 Announce Type: new Abstract: Post-hoc explanation methods are widely used to interpret black-box predictions, but their generation is often c
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1w ago
ScoringBench: A Benchmark for Evaluating Tabular Foundation Models with Proper Scoring Rules
arXiv:2603.29928v1 Announce Type: new Abstract: Tabular foundation models such as TabPFN and TabICL already produce full predictive distributions yet prevailing
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Physiological and Semantic Patterns in Medical Teams Using an Intelligent Tutoring System
arXiv:2603.29950v1 Announce Type: new Abstract: Effective collaboration requires teams to manage complex cognitive and emotional states through Socially Shared
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Structured Intent as a Protocol-Like Communication Layer: Cross-Model Robustness, Framework Comparison, and the Weak-Model Compensation Effect
arXiv:2603.29953v1 Announce Type: new Abstract: How reliably can structured intent representations preserve user goals across different AI models, languages, an
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Extending MONA in Camera Dropbox: Reproduction, Learned Approval, and Design Implications for Reward-Hacking Mitigation
arXiv:2603.29993v1 Announce Type: new Abstract: Myopic Optimization with Non-myopic Approval (MONA) mitigates multi-step reward hacking by restricting the agent
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Friction
arXiv:2603.30031v1 Announce Type: new Abstract: Current autonomous AI agents, driven primarily by Large Language Models (LLMs), operate in a state of cognitive
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
The Last Fingerprint: How Markdown Training Shapes LLM Prose
arXiv:2603.27006v1 Announce Type: cross Abstract: Large language models produce em dashes at varying rates, and the observation that some models "overuse" them
DeepCamp AI