📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 4,506 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (11721) ArXiv cs.AI Dev.to · FORUM WEB Dev.to AI Forbes Innovation OpenAI News Hugging Face Blog

LoopGuard: Breaking Self-Reinforcing Attention Loops via Dynamic KV Cache Intervention

arXiv:2604.10044v1 Announce Type: new Abstract: Through systematic experiments on long-context generation, we observe a damaging failure mode in which decoding

ArXiv cs.AI 📄 Paper 1d ago

Learning Hierarchical and Geometry-Aware Graph Representations for Text-to-CAD

arXiv:2604.10075v1 Announce Type: new Abstract: Text-to-CAD code generation is a long-horizon task that translates textual instructions into long sequences of i

ArXiv cs.AI 📄 Paper 1d ago

Ontological Trajectory Forecasting via Finite Semigroup Iteration and Lie Algebra Approximation in Geopolitical Knowledge Graphs

arXiv:2604.10087v1 Announce Type: new Abstract: We present EL-DRUIN, an ontological reasoning system for geopolitical intelligence analysis that combines formal

ArXiv cs.AI 📄 Paper 1d ago

Trust Your Memory: Verifiable Control of Smart Homes through Reinforcement Learning with Multi-dimensional Rewards

arXiv:2604.10110v1 Announce Type: new Abstract: Large Language Models (LLMs) have become a key foundation for enabling personalized smart home experiences. Whil

ArXiv cs.AI 📄 Paper 1d ago

Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration

arXiv:2604.10150v1 Announce Type: new Abstract: Generative listwise reranking leverages global context for superior retrieval but is plagued by intrinsic positi

ArXiv cs.AI 📄 Paper 1d ago

SpecMoE: A Fast and Efficient Mixture-of-Experts Inference via Self-Assisted Speculative Decoding

arXiv:2604.10152v1 Announce Type: new Abstract: The Mixture-of-Experts (MoE) architecture has emerged as a promising approach to mitigate the rising computation

ArXiv cs.AI 📄 Paper 1d ago

Inductive Reasoning for Temporal Knowledge Graphs with Emerging Entities

arXiv:2604.10164v1 Announce Type: new Abstract: Reasoning on Temporal Knowledge Graphs (TKGs) is essential for predicting future events and time-aware facts. Wh

ArXiv cs.AI 📄 Paper 1d ago

MAVEN-T: Multi-Agent enVironment-aware Enhanced Neural Trajectory predictor with Reinforcement Learning

arXiv:2604.10169v1 Announce Type: new Abstract: Trajectory prediction remains a critical yet challenging component in autonomous driving systems, requiring soph

ArXiv cs.AI 📄 Paper 1d ago

PoreDiT: A Scalable Generative Model for Large-Scale Digital Rock Reconstruction

arXiv:2604.10171v1 Announce Type: new Abstract: This manuscript presents PoreDiT, a novel generative model designed for high-efficiency digital rock reconstruct

ArXiv cs.AI 📄 Paper 1d ago

Credit-Budgeted ICPC-Style Coding: When Agents Must Pay for Every Decision

arXiv:2604.10182v1 Announce Type: new Abstract: Current evaluations of autonomous coding agents assume an unrealistic, infinite-resource environment. However, r

ArXiv cs.AI 📄 Paper 1d ago

Edu-MMBias: A Three-Tier Multimodal Benchmark for Auditing Social Bias in Vision-Language Models under Educational Contexts

arXiv:2604.10200v1 Announce Type: new Abstract: As Vision-Language Models (VLMs) become integral to educational decision-making, ensuring their fairness is para

ArXiv cs.AI 📄 Paper 1d ago

Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models

arXiv:2604.10219v1 Announce Type: new Abstract: Multimodal Large Reasoning Models (MLRMs) have achieved remarkable strides in visual reasoning through test time

ArXiv cs.AI 📄 Paper 1d ago

SVSR: A Self-Verification and Self-Rectification Paradigm for Multimodal Reasoning

arXiv:2604.10228v1 Announce Type: new Abstract: Current multimodal models often suffer from shallow reasoning, leading to errors caused by incomplete or inconsi

ArXiv cs.AI 📄 Paper 1d ago

A Dual-Positive Monotone Parameterization for Multi-Segment Bids and a Validity Assessment Framework for Reinforcement Learning Agent-based Simulation of Electricity Markets

arXiv:2604.10252v1 Announce Type: new Abstract: Reinforcement learning agent-based simulation (RL-ABS) has become an important tool for electricity market mecha

ArXiv cs.AI 📄 Paper 1d ago

The Amazing Agent Race: Strong Tool Users, Weak Navigators

arXiv:2604.10261v1 Announce Type: new Abstract: Existing tool-use benchmarks for LLM agents are overwhelmingly linear: our analysis of six benchmarks shows 55 t

ArXiv cs.AI 📄 Paper 1d ago

STARS: Skill-Triggered Audit for Request-Conditioned Invocation Safety in Agent Systems

arXiv:2604.10286v1 Announce Type: new Abstract: Autonomous language-model agents increasingly rely on installable skills and tools to complete user tasks. Stati

ArXiv cs.AI 📄 Paper 1d ago

Dead Cognitions: A Census of Misattributed Insights

arXiv:2604.10288v1 Announce Type: new Abstract: This essay identifies a failure mode of AI chat systems that we term attribution laundering: the model performs

ArXiv cs.AI 📄 Paper 1d ago

AI Organizations are More Effective but Less Aligned than Individual Agents

arXiv:2604.10290v1 Announce Type: new Abstract: AI is increasingly deployed in multi-agent systems; however, most research considers only the behavior of indivi

ArXiv cs.AI 📄 Paper 1d ago

TimeSeriesExamAgent: Creating Time Series Reasoning Benchmarks at Scale

arXiv:2604.10291v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown promising performance in time series modeling tasks, but do they truly u

ArXiv cs.AI 📄 Paper 1d ago

Gypscie: A Cross-Platform AI Artifact Management System

arXiv:2604.10311v1 Announce Type: new Abstract: Artificial Intelligence (AI) models, encompassing both traditional machine learning (ML) and more advanced appro

ArXiv cs.AI 📄 Paper 1d ago

From GPT-3 to GPT-5: Mapping their capabilities, scope, limitations, and consequences

arXiv:2604.10332v1 Announce Type: new Abstract: We present the progress of the GPT family from GPT-3 through GPT-3.5, GPT-4, GPT-4 Turbo, GPT-4o, GPT-4.1, and t

ArXiv cs.AI 📄 Paper 1d ago

Zero-shot World Models Are Developmentally Efficient Learners

arXiv:2604.10333v1 Announce Type: new Abstract: Young children demonstrate early abilities to understand their physical world, estimating depth, motion, object

ArXiv cs.AI 📄 Paper 1d ago

VeriTrans: Fine-Tuned LLM-Assisted NL-to-PL Translation via a Deterministic Neuro-Symbolic Pipeline

arXiv:2604.10341v1 Announce Type: new Abstract: \textbf{VeriTrans} is a reliability-first ML system that compiles natural-language requirements into solver-read

ArXiv cs.AI 📄 Paper 1d ago

ClawVM: Harness-Managed Virtual Memory for Stateful Tool-Using LLM Agents

arXiv:2604.10352v1 Announce Type: new Abstract: Stateful tool-using LLM agents treat the context window as working memory, yet today's agent harnesses manage re