📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (12754)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI
📄 Paper
2d ago
DeepTest Tool Competition 2026: Benchmarking an LLM-Based Automotive Assistant
arXiv:2604.12615v1 Announce Type: new Abstract: This report summarizes the results of the first edition of the Large Language Model (LLM) Testing competition, h
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
Every Picture Tells a Dangerous Story: Memory-Augmented Multi-Agent Jailbreak Attacks on VLMs
arXiv:2604.12616v1 Announce Type: new Abstract: The rapid evolution of Vision-Language Models (VLMs) has catalyzed unprecedented capabilities in artificial inte
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2d ago
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance
arXiv:2604.12627v1 Announce Type: new Abstract: RLVR improves reasoning in large language models, but its effectiveness is often limited by severe reward sparsi
ArXiv cs.AI
📄 Paper
2d ago
RPRA: Predicting an LLM-Judge for Efficient but Performant Inference
arXiv:2604.12634v1 Announce Type: new Abstract: Large language models (LLMs) face a fundamental trade-off between computational efficiency (e.g., number of para
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
Broadening the Applicability of Conditional Syntax Splitting for Reasoning from Conditional Belief Bases
arXiv:2604.12660v1 Announce Type: new Abstract: In nonmonotonic reasoning from conditional belief bases, an inference operator satisfying syntax splitting postu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2d ago
Human-Centric Topic Modeling with Goal-Prompted Contrastive Learning and Optimal Transport
arXiv:2604.12663v1 Announce Type: new Abstract: Existing topic modeling methods, from LDA to recent neural and LLM-based approaches, which focus mainly on stati
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
Safe reinforcement learning with online filtering for fatigue-predictive human-robot task planning and allocation in production
arXiv:2604.12667v1 Announce Type: new Abstract: Human-robot collaborative manufacturing, a core aspect of Industry 5.0, emphasizes ergonomics to enhance worker
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
A hierarchical spatial-aware algorithm with efficient reinforcement learning for human-robot task planning and allocation in production
arXiv:2604.12669v1 Announce Type: new Abstract: In advanced manufacturing systems, humans and robots collaborate to conduct the production process. Effective ta
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2d ago
MISID: A Multimodal Multi-turn Dataset for Complex Intent Recognition in Strategic Deception Games
arXiv:2604.12700v1 Announce Type: new Abstract: Understanding human intent in complex multi-turn interactions remains a fundamental challenge in human-computer
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
Transferable Expertise for Autonomous Agents via Real-World Case-Based Learning
arXiv:2604.12717v1 Announce Type: new Abstract: LLM-based autonomous agents perform well on general reasoning tasks but still struggle to reliably use task stru
ArXiv cs.AI
🛠️ AI Tools & Apps
📄 Paper
⚡ AI Lesson
2d ago
Can AI Tools Transform Low-Demand Math Tasks? An Evaluation of Task Modification Capabilities
arXiv:2604.12743v1 Announce Type: new Abstract: While recent research has explored AI tools' ability to classify the quality of mathematical tasks (arXiv:2603.0
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2d ago
DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
arXiv:2604.12812v1 Announce Type: new Abstract: Existing Multimodal Large Language Models (MLLMs) suffer from significant performance degradation on the long do
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2d ago
RePAIR: Interactive Machine Unlearning through Prompt-Aware Model Repair
arXiv:2604.12820v1 Announce Type: new Abstract: Large language models (LLMs) inherently absorb harmful knowledge, misinformation, and personal data during pretr
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
Artificial Intelligence for Modeling and Simulation of Mixed Automated and Human Traffic
arXiv:2604.12857v1 Announce Type: new Abstract: Autonomous vehicles (AVs) are now operating on public roads, which makes their testing and validation more criti
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
2d ago
From edges to meaning: Semantic line sketches as a cognitive scaffold for ancient pictograph invention
arXiv:2604.12865v1 Announce Type: new Abstract: Humans readily recognize objects from sparse line drawings, a capacity that appears early in development and per
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
QuarkMedSearch: A Long-Horizon Deep Search Agent for Exploring Medical Intelligence
arXiv:2604.12867v1 Announce Type: new Abstract: As agentic foundation models continue to evolve, how to further improve their performance in vertical domains ha
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
LIFE -- an energy efficient advanced continual learning agentic AI framework for frontier systems
arXiv:2604.12874v1 Announce Type: new Abstract: The rapid advancement of AI has changed the character of HPC usage such as dimensioning, provisioning, and execu
ArXiv cs.AI
🛡️ AI Safety & Ethics
📄 Paper
⚡ AI Lesson
2d ago
AISafetyBenchExplorer: A Metric-Aware Catalogue of AI Safety Benchmarks Reveals Fragmented Measurement and Weak Benchmark Governance
arXiv:2604.12875v1 Announce Type: new Abstract: The rapid expansion of large language model (LLM) safety evaluation has produced a substantial benchmark ecosyst
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2d ago
BEAM: Bi-level Memory-adaptive Algorithmic Evolution for LLM-Powered Heuristic Design
arXiv:2604.12898v1 Announce Type: new Abstract: Large Language Model-based Hyper Heuristic (LHH) has recently emerged as an efficient way for automatic heuristi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2d ago
Drawing on Memory: Dual-Trace Encoding Improves Cross-Session Recall in LLM Agents
arXiv:2604.12948v1 Announce Type: new Abstract: LLM agents with persistent memory store information as flat factual records, providing little context for tempor
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2d ago
Modeling Co-Pilots for Text-to-Model Translation
arXiv:2604.12955v1 Announce Type: new Abstract: There is growing interest in leveraging large language models (LLMs) for text-to-model translation and optimizat
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
Cycle-Consistent Search: Question Reconstructability as a Proxy Reward for Search Agent Training
arXiv:2604.12967v1 Announce Type: new Abstract: Reinforcement Learning (RL) has shown strong potential for optimizing search agents in complex information retri
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
2d ago
Bilevel Late Acceptance Hill Climbing for the Electric Capacitated Vehicle Routing Problem
arXiv:2604.13013v1 Announce Type: new Abstract: This paper tackles the Electric Capacitated Vehicle Routing Problem (E-CVRP) through a bilevel optimization fram
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
2d ago
PAL: Personal Adaptive Learner
arXiv:2604.13017v1 Announce Type: new Abstract: AI-driven education platforms have made some progress in personalisation, yet most remain constrained to static
DeepCamp AI