3,169 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AIForbes InnovationOpenAI NewsDev.to AIHugging Face BlogHackernoon
ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 4d ago
LitPivot: Developing Well-Situated Research Ideas Through Dynamic Contextualization and Critique within the Literature Landscape
arXiv:2604.02600v1 Announce Type: cross Abstract: Developing a novel research idea is hard. It must be distinct enough from prior work to claim a contribution w
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Poison Once, Exploit Forever: Environment-Injected Memory Poisoning Attacks on Web Agents
arXiv:2604.02623v1 Announce Type: cross Abstract: Memory makes LLM-based web agents personalized, powerful, yet exploitable. By storing past interactions to per
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Smart Transfer: Leveraging Vision Foundation Model for Rapid Building Damage Mapping with Post-Earthquake VHR Imagery
arXiv:2604.02627v1 Announce Type: cross Abstract: Living in a changing climate, human society now faces more frequent and severe natural disasters than ever bef
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Toys that listen, talk, and play: Understanding Children's Sensemaking and Interactions with AI Toys
arXiv:2604.02629v1 Announce Type: cross Abstract: Generative AI (genAI) is increasingly being integrated into children's everyday lives, not only through screen
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Analytic Drift Resister for Non-Exemplar Continual Graph Learning
arXiv:2604.02633v1 Announce Type: cross Abstract: Non-Exemplar Continual Graph Learning (NECGL) seeks to eliminate the privacy risks intrinsic to rehearsal-base
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Cross-Vehicle 3D Geometric Consistency for Self-Supervised Surround Depth Estimation on Articulated Vehicles
arXiv:2604.02639v1 Announce Type: cross Abstract: Surround depth estimation provides a cost-effective alternative to LiDAR for 3D perception in autonomous drivi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Speaking of Language: Reflections on Metalanguage Research in NLP
arXiv:2604.02645v1 Announce Type: cross Abstract: This work aims to shine a spotlight on the topic of metalanguage. We first define metalanguage, link it to NLP
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers
arXiv:2604.02648v1 Announce Type: cross Abstract: The autonomous discovery of bugs remains a significant challenge in modern software development. Compared to c
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 4d ago
Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training
arXiv:2604.02651v1 Announce Type: cross Abstract: Graph neural networks (GNNs) are widely used for learning on graph datasets derived from various real-world sc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Generalization Limits of Reinforcement Learning Alignment
arXiv:2604.02652v1 Announce Type: cross Abstract: The safety of large language models (LLMs) relies on alignment techniques such as reinforcement learning from
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Low-Rank Compression of Pretrained Models via Randomized Subspace Iteration
arXiv:2604.02659v1 Announce Type: cross Abstract: The massive scale of pretrained models has made efficient compression essential for practical deployment. Low-
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Too Polite to Disagree: Understanding Sycophancy Propagation in Multi-Agent Systems
arXiv:2604.02668v1 Announce Type: cross Abstract: Large language models (LLMs) often exhibit sycophancy: agreement with user stance even when it conflicts with
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Do Agent Societies Develop Intellectual Elites? The Hidden Power Laws of Collective Cognition in LLM Multi-Agent Systems
arXiv:2604.02674v1 Announce Type: cross Abstract: Large Language Model (LLM) multi-agent systems are increasingly deployed as interacting agent societies, yet s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Eligibility-Aware Evidence Synthesis: An Agentic Framework for Clinical Trial Meta-Analysis
arXiv:2604.02678v1 Announce Type: cross Abstract: Clinical evidence synthesis requires identifying relevant trials from large registries and aggregating results
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Finding Belief Geometries with Sparse Autoencoders
arXiv:2604.02685v1 Announce Type: cross Abstract: Understanding the geometric structure of internal representations is a central goal of mechanistic interpretab
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Beyond Semantic Manipulation: Token-Space Attacks on Reward Models
arXiv:2604.02686v1 Announce Type: cross Abstract: Reward models (RMs) are widely used as optimization targets in reinforcement learning from human feedback (RLH
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Efficient3D: A Unified Framework for Adaptive and Debiased Token Reduction in 3D MLLMs
arXiv:2604.02689v1 Announce Type: cross Abstract: Recent advances in Multimodal Large Language Models (MLLMs) have expanded reasoning capabilities into 3D domai
ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 4d ago
DocShield: Towards AI Document Safety via Evidence-Grounded Agentic Reasoning
arXiv:2604.02694v1 Announce Type: cross Abstract: The rapid progress of generative AI has enabled increasingly realistic text-centric image forgeries, posing ma
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Trivial Vocabulary Bans Improve LLM Reasoning More Than Deep Linguistic Constraints
arXiv:2604.02699v1 Announce Type: cross Abstract: A previous study reported that E-Prime (English without the verb "to be") selectively altered reasoning in lan
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Evaluating the Formal Reasoning Capabilities of Large Language Models through Chomsky Hierarchy
arXiv:2604.02709v1 Announce Type: cross Abstract: The formal reasoning capabilities of LLMs are crucial for advancing automated software engineering. However, e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
V2X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views
arXiv:2604.02710v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) have shown strong potential for autonomous driving, yet existing benc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications
arXiv:2604.02719v1 Announce Type: cross Abstract: We introduce MOMO, the first multi-sensor foundation model for Mars remote sensing. MOMO uses model merge to i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
IndustryCode: A Benchmark for Industry Code Generation
arXiv:2604.02729v1 Announce Type: cross Abstract: Code generation and comprehension by Large Language Models (LLMs) have emerged as core drivers of industrial i
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Cross Event Detection and Topic Evolution Mining in cross events for Man Made Disasters in Social Media Streams
arXiv:2604.02740v1 Announce Type: cross Abstract: Social media is widely used to share information globally and it also aids to gain attention from the world. W