📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3d ago
PlayWorld: Learning Robot World Models from Autonomous Play
arXiv:2603.09030v3 Announce Type: replace-cross Abstract: Action-conditioned video models offer a promising path to building general-purpose robot simulators th
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction
arXiv:2603.10047v2 Announce Type: replace-cross Abstract: Hallucinations in large language models (LLMs) are outputs that are syntactically coherent but factual
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
3d ago
Hindsight-Anchored Policy Optimization: Turning Failure into Feedback in Sparse Reward Settings
arXiv:2603.11321v2 Announce Type: replace-cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a promising paradigm for post-tra
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Truth as a Compression Artifact in Language Model Training
arXiv:2603.11749v3 Announce Type: replace-cross Abstract: Why do language models trained on contradictory data prefer correct answers? In controlled experiments
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3d ago
Security Considerations for Artificial Intelligence Agents
arXiv:2603.12230v2 Announce Type: replace-cross Abstract: This article, a lightly adapted version of Perplexity's response to NIST/CAISI Request for Information
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Brittlebench: Quantifying LLM robustness via prompt sensitivity
arXiv:2603.13285v2 Announce Type: replace-cross Abstract: Existing evaluation methods largely rely on clean, static benchmarks, which can overestimate true mode
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3d ago
AI-Driven Predictive Maintenance with Environmental Context Integration for Connected Vehicles: Simulation, Benchmarking, and Field Validation
arXiv:2603.13343v2 Announce Type: replace-cross Abstract: Predictive maintenance for connected vehicles offers the potential to reduce unexpected breakdowns and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Generate Then Correct: Single Shot Global Correction for Aspect Sentiment Quad Prediction
arXiv:2603.13777v2 Announce Type: replace-cross Abstract: Aspect-based sentiment analysis (ABSA) extracts aspect-level sentiment signals from user-generated tex
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3d ago
Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving
arXiv:2603.13842v2 Announce Type: replace-cross Abstract: End-to-end autonomous driving is typically built upon imitation learning (IL), yet its performance is
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Adaptive Stopping for Multi-Turn LLM Reasoning
arXiv:2604.01413v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) increasingly rely on multi-turn reasoning and interaction, such as adapti
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3d ago
GPA: Learning GUI Process Automation from Demonstrations
arXiv:2604.01676v2 Announce Type: replace-cross Abstract: GUI Process Automation (GPA) is a lightweight but general vision-based Robotic Process Automation (RPA
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Holos: A Web-Scale LLM-Based Multi-Agent System for the Agentic Web
arXiv:2604.02334v1 Announce Type: new Abstract: As large language models (LLM)-driven agents transition from isolated task solvers to persistent digital entitie
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Xpertbench: Expert Level Tasks with Rubrics-Based Evaluation
arXiv:2604.02368v1 Announce Type: new Abstract: As Large Language Models (LLMs) exhibit plateauing performance on conventional benchmarks, a pivotal challenge p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Compositional Neuro-Symbolic Reasoning
arXiv:2604.02434v1 Announce Type: new Abstract: We study structured abstraction-based reasoning for the Abstraction and Reasoning Corpus (ARC) and compare its g
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Understanding the Nature of Generative AI as Threshold Logic in High-Dimensional Space
arXiv:2604.02476v1 Announce Type: new Abstract: This paper examines the role of threshold logic in understanding generative artificial intelligence. Threshold f
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
AIVV: Neuro-Symbolic LLM Agent-Integrated Verification and Validation for Trustworthy Autonomous Systems
arXiv:2604.02478v1 Announce Type: new Abstract: Deep learning models excel at detecting anomaly patterns in normal data. However, they do not provide a direct s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
I must delete the evidence: AI Agents Explicitly Cover up Fraud and Violent Crime
arXiv:2604.02500v1 Announce Type: new Abstract: As ongoing research explores the ability of AI agents to be insider threats and act against company interests, w
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
4d ago
A Comprehensive Framework for Long-Term Resiliency Investment Planning under Extreme Weather Uncertainty for Electric Utilities
arXiv:2604.02504v1 Announce Type: new Abstract: Electric utilities must make massive capital investments in the coming years to respond to explosive growth in d
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Interpretable Deep Reinforcement Learning for Element-level Bridge Life-cycle Optimization
arXiv:2604.02528v1 Announce Type: new Abstract: The new Specifications for the National Bridge Inventory (SNBI), in effect from 2022, emphasize the use of eleme
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Competency Questions as Executable Plans: a Controlled RAG Architecture for Cultural Heritage Storytelling
arXiv:2604.02545v1 Announce Type: new Abstract: The preservation of intangible cultural heritage is a critical challenge as collective memory fades over time. W
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Mitigating LLM biases toward spurious social contexts using direct preference optimization
arXiv:2604.02585v1 Announce Type: new Abstract: LLMs are increasingly used for high-stakes decision-making, yet their sensitivity to spurious contextual informa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Do Audio-Visual Large Language Models Really See and Hear?
arXiv:2604.02605v1 Announce Type: new Abstract: Audio-Visual Large Language Models (AVLLMs) are emerging as unified interfaces to multimodal perception. We pres
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
AutoVerifier: An Agentic Automated Verification Framework Using Large Language Models
arXiv:2604.02617v1 Announce Type: new Abstract: Scientific and Technical Intelligence (S&TI) analysis requires verifying complex technical claims across rapidly
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
OntoKG: Ontology-Oriented Knowledge Graph Construction with Intrinsic-Relational Routing
arXiv:2604.02618v1 Announce Type: new Abstract: Organizing a large-scale knowledge graph into a typed property graph requires structural decisions -- which enti
DeepCamp AI