📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news

arXiv:2604.02600v1 Announce Type: cross Abstract: Developing a novel research idea is hard. It must be distinct enough from prior work to claim a contribution w

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Poison Once, Exploit Forever: Environment-Injected Memory Poisoning Attacks on Web Agents

arXiv:2604.02623v1 Announce Type: cross Abstract: Memory makes LLM-based web agents personalized, powerful, yet exploitable. By storing past interactions to per

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Smart Transfer: Leveraging Vision Foundation Model for Rapid Building Damage Mapping with Post-Earthquake VHR Imagery

arXiv:2604.02627v1 Announce Type: cross Abstract: Living in a changing climate, human society now faces more frequent and severe natural disasters than ever bef

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Toys that listen, talk, and play: Understanding Children's Sensemaking and Interactions with AI Toys

arXiv:2604.02629v1 Announce Type: cross Abstract: Generative AI (genAI) is increasingly being integrated into children's everyday lives, not only through screen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Analytic Drift Resister for Non-Exemplar Continual Graph Learning

arXiv:2604.02633v1 Announce Type: cross Abstract: Non-Exemplar Continual Graph Learning (NECGL) seeks to eliminate the privacy risks intrinsic to rehearsal-base

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Cross-Vehicle 3D Geometric Consistency for Self-Supervised Surround Depth Estimation on Articulated Vehicles

arXiv:2604.02639v1 Announce Type: cross Abstract: Surround depth estimation provides a cost-effective alternative to LiDAR for 3D perception in autonomous drivi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Speaking of Language: Reflections on Metalanguage Research in NLP

arXiv:2604.02645v1 Announce Type: cross Abstract: This work aims to shine a spotlight on the topic of metalanguage. We first define metalanguage, link it to NLP

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers

arXiv:2604.02648v1 Announce Type: cross Abstract: The autonomous discovery of bugs remains a significant challenge in modern software development. Compared to c

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4d ago

Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

arXiv:2604.02651v1 Announce Type: cross Abstract: Graph neural networks (GNNs) are widely used for learning on graph datasets derived from various real-world sc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Generalization Limits of Reinforcement Learning Alignment

arXiv:2604.02652v1 Announce Type: cross Abstract: The safety of large language models (LLMs) relies on alignment techniques such as reinforcement learning from

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Low-Rank Compression of Pretrained Models via Randomized Subspace Iteration

arXiv:2604.02659v1 Announce Type: cross Abstract: The massive scale of pretrained models has made efficient compression essential for practical deployment. Low-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Too Polite to Disagree: Understanding Sycophancy Propagation in Multi-Agent Systems

arXiv:2604.02668v1 Announce Type: cross Abstract: Large language models (LLMs) often exhibit sycophancy: agreement with user stance even when it conflicts with

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Do Agent Societies Develop Intellectual Elites? The Hidden Power Laws of Collective Cognition in LLM Multi-Agent Systems

arXiv:2604.02674v1 Announce Type: cross Abstract: Large Language Model (LLM) multi-agent systems are increasingly deployed as interacting agent societies, yet s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Eligibility-Aware Evidence Synthesis: An Agentic Framework for Clinical Trial Meta-Analysis

arXiv:2604.02678v1 Announce Type: cross Abstract: Clinical evidence synthesis requires identifying relevant trials from large registries and aggregating results

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Finding Belief Geometries with Sparse Autoencoders

arXiv:2604.02685v1 Announce Type: cross Abstract: Understanding the geometric structure of internal representations is a central goal of mechanistic interpretab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Beyond Semantic Manipulation: Token-Space Attacks on Reward Models

arXiv:2604.02686v1 Announce Type: cross Abstract: Reward models (RMs) are widely used as optimization targets in reinforcement learning from human feedback (RLH

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Efficient3D: A Unified Framework for Adaptive and Debiased Token Reduction in 3D MLLMs

arXiv:2604.02689v1 Announce Type: cross Abstract: Recent advances in Multimodal Large Language Models (MLLMs) have expanded reasoning capabilities into 3D domai

ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 4d ago

DocShield: Towards AI Document Safety via Evidence-Grounded Agentic Reasoning

arXiv:2604.02694v1 Announce Type: cross Abstract: The rapid progress of generative AI has enabled increasingly realistic text-centric image forgeries, posing ma

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Trivial Vocabulary Bans Improve LLM Reasoning More Than Deep Linguistic Constraints

arXiv:2604.02699v1 Announce Type: cross Abstract: A previous study reported that E-Prime (English without the verb "to be") selectively altered reasoning in lan

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Evaluating the Formal Reasoning Capabilities of Large Language Models through Chomsky Hierarchy

arXiv:2604.02709v1 Announce Type: cross Abstract: The formal reasoning capabilities of LLMs are crucial for advancing automated software engineering. However, e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

V2X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views

arXiv:2604.02710v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) have shown strong potential for autonomous driving, yet existing benc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications

arXiv:2604.02719v1 Announce Type: cross Abstract: We introduce MOMO, the first multi-sensor foundation model for Mars remote sensing. MOMO uses model merge to i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

IndustryCode: A Benchmark for Industry Code Generation

arXiv:2604.02729v1 Announce Type: cross Abstract: Code generation and comprehension by Large Language Models (LLMs) have emerged as core drivers of industrial i

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago

Cross Event Detection and Topic Evolution Mining in cross events for Man Made Disasters in Social Media Streams

arXiv:2604.02740v1 Announce Type: cross Abstract: Social media is widely used to share information globally and it also aids to gain attention from the world. W