📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (5901) ArXiv cs.AI Forbes Innovation OpenAI News Dev.to AI Hugging Face Blog Hackernoon

Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement

arXiv:2604.06155v1 Announce Type: cross Abstract: Whether Large Language Models (LLMs) develop coherent internal world models remains a core debate. While conve

ArXiv cs.AI 📄 Paper 5h ago

MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control

arXiv:2604.06156v1 Announce Type: cross Abstract: MLLMs have been successfully applied to multimodal embedding tasks, yet their generative reasoning capabilitie

ArXiv cs.AI 📄 Paper 5h ago

DiffHDR: Re-Exposing LDR Videos with Video Diffusion Models

arXiv:2604.06161v1 Announce Type: cross Abstract: Most digital videos are stored in 8-bit low dynamic range (LDR) formats, where much of the original high dynam

ArXiv cs.AI 📄 Paper 5h ago

In-Place Test-Time Training

arXiv:2604.06169v1 Announce Type: cross Abstract: The static ``train then deploy" paradigm fundamentally limits Large Language Models (LLMs) from dynamically ad

ArXiv cs.AI 📄 Paper 5h ago

Solving a Stackelberg Game on Transportation Networks in a Dynamic Crime Scenario: A Mixed Approach on Multi-Layer Networks

arXiv:2406.14514v4 Announce Type: replace Abstract: Interdicting a criminal with limited police resources is a challenging task as the criminal changes location

ArXiv cs.AI 📄 Paper 5h ago

UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces

arXiv:2505.00472v2 Announce Type: replace Abstract: Agentic Artificial Intelligence (AI) constitutes a transformative paradigm in the evolution of intelligent a

ArXiv cs.AI 📄 Paper 5h ago

Advancing AI Research Assistants with Expert-Involved Learning

arXiv:2505.04638v5 Announce Type: replace Abstract: Large language models (LLMs) and large multimodal models (LMMs) promise to accelerate biomedical discovery,

ArXiv cs.AI 📄 Paper 5h ago

Beyond Syntax: Action Semantics Learning for App Agents

arXiv:2506.17697v3 Announce Type: replace Abstract: The recent development of Large Language Models (LLMs) enables the rise of App agents that interpret user in

ArXiv cs.AI 📄 Paper 5h ago

URSA: The Universal Research and Scientific Agent

arXiv:2506.22653v2 Announce Type: replace Abstract: Large language models (LLMs) have moved far beyond their initial form as simple chatbots, now carrying out c

ArXiv cs.AI 📄 Paper 5h ago

MedGemma Technical Report

arXiv:2507.05201v4 Announce Type: replace Abstract: Artificial intelligence (AI) has significant potential in healthcare applications, but its training and depl

ArXiv cs.AI 📄 Paper 5h ago

Modelling Cascading Physical Climate Risk in Supply Chains with Adaptive Firms: A Spatial Agent-Based Framework

arXiv:2509.18633v4 Announce Type: replace Abstract: We present an open-source Python framework for modelling cascading physical climate risk in a spatial supply

ArXiv cs.AI 📄 Paper 5h ago

Multiplayer Nash Preference Optimization

arXiv:2509.23102v3 Announce Type: replace Abstract: Reinforcement learning from human feedback (RLHF) has emerged as the standard paradigm for aligning large la

ArXiv cs.AI 📄 Paper 5h ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

arXiv:2509.25454v4 Announce Type: replace Abstract: Although RLVR has become an essential component for developing advanced reasoning skills in language models,

ArXiv cs.AI 📄 Paper 5h ago

Hypothesis-Driven Feature Manifold Analysis in LLMs via Supervised Multi-Dimensional Scaling

arXiv:2510.01025v2 Announce Type: replace Abstract: The linear representation hypothesis states that language models (LMs) encode concepts as directions in thei

ArXiv cs.AI 📄 Paper 5h ago

TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering

arXiv:2510.07432v2 Announce Type: replace Abstract: Large language models (LLMs) exhibit strong symbolic and compositional reasoning, yet they struggle with tim

ArXiv cs.AI 📄 Paper 5h ago

DRIFT: Decompose, Retrieve, Illustrate, then Formalize Theorems

arXiv:2510.10815v4 Announce Type: replace Abstract: Automating the formalization of mathematical statements for theorem proving remains a major challenge for La

ArXiv cs.AI 📄 Paper 5h ago

Toward Virtuous Reinforcement Learning: A Critique and Roadmap

arXiv:2512.04246v2 Announce Type: replace Abstract: This paper critiques common patterns in machine ethics for Reinforcement Learning (RL) and argues for a virt

ArXiv cs.AI 📄 Paper 5h ago

Robust AI Security and Alignment: A Sisyphean Endeavor?

arXiv:2512.10100v2 Announce Type: replace Abstract: This manuscript establishes information-theoretic limitations for robustness of AI security and alignment by

ArXiv cs.AI 📄 Paper 5h ago

EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration

arXiv:2512.19396v2 Announce Type: replace Abstract: Contemporary GUI agents, while increasingly capable due to advances in Large Vision-Language Models (VLMs),

ArXiv cs.AI 📄 Paper 5h ago

RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training

arXiv:2602.05765v2 Announce Type: replace Abstract: Reinforcement learning (RL) has emerged as a critical paradigm for post-training Vision-Language-Action (VLA

ArXiv cs.AI 📄 Paper 5h ago

Emergent Introspection in AI is Content-Agnostic

arXiv:2603.05414v2 Announce Type: replace Abstract: Introspection is a foundational cognitive ability, but its mechanism is not well understood. Recent work has

ArXiv cs.AI 📄 Paper 5h ago

AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

arXiv:2603.21357v2 Announce Type: replace Abstract: LLM agents fail on the majority of real-world tasks -- GPT-4o succeeds on fewer than 15% of WebArena navigat

ArXiv cs.AI 📄 Paper 5h ago

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

arXiv:2604.01591v2 Announce Type: replace Abstract: We introduce ThinkTwice, a simple two-phase framework that jointly optimizes LLMs to solve reasoning problem

ArXiv cs.AI 📄 Paper 5h ago

Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models

arXiv:2407.14971v3 Announce Type: replace-cross Abstract: Vision-Language Models (VLMs) rely heavily on pretrained vision encoders to support downstream tasks s