📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
AgentFixer: From Failure Detection to Fix Recommendations in LLM Agentic Systems
arXiv:2603.29848v1 Announce Type: new Abstract: We introduce a comprehensive validation framework for LLM-based agentic systems that provides systematic diagnos
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Spatiotemporal Robustness of Temporal Logic Tasks using Multi-Objective Reasoning
arXiv:2603.29868v1 Announce Type: new Abstract: The reliability of autonomous systems depends on their robustness, i.e., their ability to meet their objectives
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training
arXiv:2603.29871v1 Announce Type: new Abstract: In user-agent interaction scenarios such as recommendation, brainstorming, and code suggestion, Large Language M
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1w ago
A Rational Account of Categorization Based on Information Theory
arXiv:2603.29895v1 Announce Type: new Abstract: We present a new theory of categorization based on an information-theoretic rational analysis. To evaluate this
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation
arXiv:2603.29902v1 Announce Type: new Abstract: Interleaved text-and-image generation represents a significant frontier for Multimodal Large Language Models (ML
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving
arXiv:2603.29908v1 Announce Type: new Abstract: Trajectory planning for autonomous driving increasingly leverages large language models (LLMs) for commonsense r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Uncertainty Gating for Cost-Aware Explainable Artificial Intelligence
arXiv:2603.29915v1 Announce Type: new Abstract: Post-hoc explanation methods are widely used to interpret black-box predictions, but their generation is often c
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1w ago
ScoringBench: A Benchmark for Evaluating Tabular Foundation Models with Proper Scoring Rules
arXiv:2603.29928v1 Announce Type: new Abstract: Tabular foundation models such as TabPFN and TabICL already produce full predictive distributions yet prevailing
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Physiological and Semantic Patterns in Medical Teams Using an Intelligent Tutoring System
arXiv:2603.29950v1 Announce Type: new Abstract: Effective collaboration requires teams to manage complex cognitive and emotional states through Socially Shared
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Structured Intent as a Protocol-Like Communication Layer: Cross-Model Robustness, Framework Comparison, and the Weak-Model Compensation Effect
arXiv:2603.29953v1 Announce Type: new Abstract: How reliably can structured intent representations preserve user goals across different AI models, languages, an
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Extending MONA in Camera Dropbox: Reproduction, Learned Approval, and Design Implications for Reward-Hacking Mitigation
arXiv:2603.29993v1 Announce Type: new Abstract: Myopic Optimization with Non-myopic Approval (MONA) mitigates multi-step reward hacking by restricting the agent
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Friction
arXiv:2603.30031v1 Announce Type: new Abstract: Current autonomous AI agents, driven primarily by Large Language Models (LLMs), operate in a state of cognitive
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
The Last Fingerprint: How Markdown Training Shapes LLM Prose
arXiv:2603.27006v1 Announce Type: cross Abstract: Large language models produce em dashes at varying rates, and the observation that some models "overuse" them
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Focus360: Guiding User Attention in Immersive Videos for VR
arXiv:2603.28774v1 Announce Type: cross Abstract: This demo introduces Focus360, a system designed to enhance user engagement in 360{\deg} VR videos by guiding
ArXiv cs.AI
💻 AI-Assisted Coding
📄 Paper
⚡ AI Lesson
1w ago
DF-ACBlurGAN: Structure-Aware Conditional Generation of Internally Repeated Patterns for Biomaterial Microtopography Design
arXiv:2603.28776v1 Announce Type: cross Abstract: Learning to generate images with internally repeated and periodic structures poses a fundamental challenge for
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1w ago
Byzantine-Robust and Communication-Efficient Distributed Training: Compressive and Cyclic Gradient Coding
arXiv:2603.28780v1 Announce Type: cross Abstract: In this paper, we study the problem of distributed training (DT) under Byzantine attacks with communication co
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1w ago
A Multi-Modal Dataset for Ground Reaction Force Estimation Using Consumer Wearable Sensors
arXiv:2603.28784v1 Announce Type: cross Abstract: This Data Descriptor presents a fully open, multi-modal dataset for estimating vertical ground reaction force
ArXiv cs.AI
🛠️ AI Tools & Apps
📄 Paper
⚡ AI Lesson
1w ago
AI in Work-Based Learning: Understanding the Purposes and Effects of Intelligent Tools Among Student Interns
arXiv:2603.28786v1 Announce Type: cross Abstract: This study examined how student interns in Philippine higher education use intelligent tools during their OJT.
ArXiv cs.AI
🛡️ AI Safety & Ethics
📄 Paper
⚡ AI Lesson
1w ago
Smartphone-Based Identification of Unknown Liquids via Active Vibration Sensing
arXiv:2603.28787v1 Announce Type: cross Abstract: Traditional liquid identification instruments are often unavailable to the general public. This paper shows th
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
1w ago
StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM Serving
arXiv:2603.28795v1 Announce Type: cross Abstract: We address LLM serving workloads where repeated requests share a common solution structure but differ in local
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
1w ago
GaloisSAT: Differentiable Boolean Satisfiability Solving via Finite Field Algebra
arXiv:2603.28796v1 Announce Type: cross Abstract: Boolean satisfiability (SAT) problem, the first problem proven to be NP-complete, has become a fundamental cha
ArXiv cs.AI
🛡️ AI Safety & Ethics
📄 Paper
1w ago
Design and Development of an ML/DL Attack Resistance of RC-Based PUF for IoT Security
arXiv:2603.28798v1 Announce Type: cross Abstract: Physically Unclonable Functions (PUFs) provide promising hardware security for IoT authentication, leveraging
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
1w ago
CREST: Constraint-Release Execution for Multi-Robot Warehouse Shelf Rearrangement
arXiv:2603.28803v1 Announce Type: cross Abstract: Double-Deck Multi-Agent Pickup and Delivery (DD-MAPD) models the multi-robot shelf rearrangement problem in au
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
1w ago
WAter: A Workload-Adaptive Knob Tuning System based on Workload Compression
arXiv:2603.28809v1 Announce Type: cross Abstract: Selecting appropriate values for the configurable parameters of Database Management Systems (DBMS) to improve
DeepCamp AI