📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (12754) ArXiv cs.AI Dev.to · FORUM WEB Dev.to AI Forbes Innovation OpenAI News Hugging Face Blog

DeepTest Tool Competition 2026: Benchmarking an LLM-Based Automotive Assistant

arXiv:2604.12615v1 Announce Type: new Abstract: This report summarizes the results of the first edition of the Large Language Model (LLM) Testing competition, h

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Every Picture Tells a Dangerous Story: Memory-Augmented Multi-Agent Jailbreak Attacks on VLMs

arXiv:2604.12616v1 Announce Type: new Abstract: The rapid evolution of Vision-Language Models (VLMs) has catalyzed unprecedented capabilities in artificial inte

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

arXiv:2604.12627v1 Announce Type: new Abstract: RLVR improves reasoning in large language models, but its effectiveness is often limited by severe reward sparsi

ArXiv cs.AI 📄 Paper 2d ago

RPRA: Predicting an LLM-Judge for Efficient but Performant Inference

arXiv:2604.12634v1 Announce Type: new Abstract: Large language models (LLMs) face a fundamental trade-off between computational efficiency (e.g., number of para

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Broadening the Applicability of Conditional Syntax Splitting for Reasoning from Conditional Belief Bases

arXiv:2604.12660v1 Announce Type: new Abstract: In nonmonotonic reasoning from conditional belief bases, an inference operator satisfying syntax splitting postu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Human-Centric Topic Modeling with Goal-Prompted Contrastive Learning and Optimal Transport

arXiv:2604.12663v1 Announce Type: new Abstract: Existing topic modeling methods, from LDA to recent neural and LLM-based approaches, which focus mainly on stati

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Safe reinforcement learning with online filtering for fatigue-predictive human-robot task planning and allocation in production

arXiv:2604.12667v1 Announce Type: new Abstract: Human-robot collaborative manufacturing, a core aspect of Industry 5.0, emphasizes ergonomics to enhance worker

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

A hierarchical spatial-aware algorithm with efficient reinforcement learning for human-robot task planning and allocation in production

arXiv:2604.12669v1 Announce Type: new Abstract: In advanced manufacturing systems, humans and robots collaborate to conduct the production process. Effective ta

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

MISID: A Multimodal Multi-turn Dataset for Complex Intent Recognition in Strategic Deception Games

arXiv:2604.12700v1 Announce Type: new Abstract: Understanding human intent in complex multi-turn interactions remains a fundamental challenge in human-computer

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Transferable Expertise for Autonomous Agents via Real-World Case-Based Learning

arXiv:2604.12717v1 Announce Type: new Abstract: LLM-based autonomous agents perform well on general reasoning tasks but still struggle to reliably use task stru

ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 2d ago

Can AI Tools Transform Low-Demand Math Tasks? An Evaluation of Task Modification Capabilities

arXiv:2604.12743v1 Announce Type: new Abstract: While recent research has explored AI tools' ability to classify the quality of mathematical tasks (arXiv:2603.0

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding

arXiv:2604.12812v1 Announce Type: new Abstract: Existing Multimodal Large Language Models (MLLMs) suffer from significant performance degradation on the long do

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

RePAIR: Interactive Machine Unlearning through Prompt-Aware Model Repair

arXiv:2604.12820v1 Announce Type: new Abstract: Large language models (LLMs) inherently absorb harmful knowledge, misinformation, and personal data during pretr

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Artificial Intelligence for Modeling and Simulation of Mixed Automated and Human Traffic

arXiv:2604.12857v1 Announce Type: new Abstract: Autonomous vehicles (AVs) are now operating on public roads, which makes their testing and validation more criti

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 2d ago

From edges to meaning: Semantic line sketches as a cognitive scaffold for ancient pictograph invention

arXiv:2604.12865v1 Announce Type: new Abstract: Humans readily recognize objects from sparse line drawings, a capacity that appears early in development and per

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

QuarkMedSearch: A Long-Horizon Deep Search Agent for Exploring Medical Intelligence

arXiv:2604.12867v1 Announce Type: new Abstract: As agentic foundation models continue to evolve, how to further improve their performance in vertical domains ha

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

LIFE -- an energy efficient advanced continual learning agentic AI framework for frontier systems

arXiv:2604.12874v1 Announce Type: new Abstract: The rapid advancement of AI has changed the character of HPC usage such as dimensioning, provisioning, and execu

ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 2d ago

AISafetyBenchExplorer: A Metric-Aware Catalogue of AI Safety Benchmarks Reveals Fragmented Measurement and Weak Benchmark Governance

arXiv:2604.12875v1 Announce Type: new Abstract: The rapid expansion of large language model (LLM) safety evaluation has produced a substantial benchmark ecosyst

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

BEAM: Bi-level Memory-adaptive Algorithmic Evolution for LLM-Powered Heuristic Design

arXiv:2604.12898v1 Announce Type: new Abstract: Large Language Model-based Hyper Heuristic (LHH) has recently emerged as an efficient way for automatic heuristi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Drawing on Memory: Dual-Trace Encoding Improves Cross-Session Recall in LLM Agents

arXiv:2604.12948v1 Announce Type: new Abstract: LLM agents with persistent memory store information as flat factual records, providing little context for tempor

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Modeling Co-Pilots for Text-to-Model Translation

arXiv:2604.12955v1 Announce Type: new Abstract: There is growing interest in leveraging large language models (LLMs) for text-to-model translation and optimizat

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Cycle-Consistent Search: Question Reconstructability as a Proxy Reward for Search Agent Training

arXiv:2604.12967v1 Announce Type: new Abstract: Reinforcement Learning (RL) has shown strong potential for optimizing search agents in complex information retri

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 2d ago

Bilevel Late Acceptance Hill Climbing for the Electric Capacitated Vehicle Routing Problem

arXiv:2604.13013v1 Announce Type: new Abstract: This paper tackles the Electric Capacitated Vehicle Routing Problem (E-CVRP) through a bilevel optimization fram

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

PAL: Personal Adaptive Learner

arXiv:2604.13017v1 Announce Type: new Abstract: AI-driven education platforms have made some progress in personalisation, yet most remain constrained to static