📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AI Forbes Innovation OpenAI News Dev.to AI Hugging Face Blog Hackernoon

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

arXiv:2603.00495v2 Announce Type: replace Abstract: We introduce AI Runtime Infrastructure, a distinct execution-time layer that operates above the model and be

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality

arXiv:2603.05912v2 Announce Type: replace Abstract: Search-augmented LLM agents can produce deep research reports (DRRs), but verifying claim-level factuality r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation

arXiv:2603.08388v4 Announce Type: replace Abstract: We propose a Hierarchical Error-Corrective Graph FrameworkforAutonomousAgentswithLLM-BasedActionGeneration(H

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Collective AI can amplify tiny perturbations into divergent decisions

arXiv:2603.09127v2 Announce Type: replace Abstract: Large language models are increasingly deployed not as single assistants but as committees whose members del

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

A Self-Evolving Defect Detection Framework for Industrial Photovoltaic Systems

arXiv:2603.14869v2 Announce Type: replace Abstract: Reliable photovoltaic (PV) power generation requires timely detection of module defects that may reduce ener

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI

arXiv:2603.18104v2 Announce Type: replace Abstract: Prevailing AI training infrastructure assumes reverse-mode automatic differentiation over IEEE-754 arithmeti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

An Onto-Relational-Sophic Framework for Governing Synthetic Minds

arXiv:2603.18633v2 Announce Type: replace Abstract: The rapid evolution of artificial intelligence, from task-specific systems to foundation models exhibiting b

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

AgentSocialBench: Evaluating Privacy Risks in Human-Centered Agentic Social Networks

arXiv:2604.01487v2 Announce Type: replace Abstract: With the rise of personalized, persistent LLM agent frameworks such as OpenClaw, human-centered agentic soci

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Domain-constrained knowledge representation: A modal framework

arXiv:2604.01770v2 Announce Type: replace Abstract: Knowledge graphs store large numbers of relations efficiently, but they remain weak at representing a quiete

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models

arXiv:2006.04363v2 Announce Type: replace-cross Abstract: Dyna-style reinforcement learning (RL) agents improve sample efficiency over model-free RL agents by u

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

arXiv:2406.14194v3 Announce Type: replace-cross Abstract: The emergence of Large Vision-Language Models (LVLMs) marks significant strides towards achieving gene

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Detecting and Characterising Mobile App Metamorphosis in Google Play Store

arXiv:2407.14565v2 Announce Type: replace-cross Abstract: App markets have evolved into highly competitive and dynamic environments for developers. While the tr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models

arXiv:2408.11871v3 Announce Type: replace-cross Abstract: Fake news significantly influences decision-making processes by misleading individuals, organizations,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

SPRIG: Improving Large Language Model Performance by System Prompt Optimization

arXiv:2410.14826v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have shown impressive capabilities in many scenarios, but their performan

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

arXiv:2410.21169v5 Announce Type: replace-cross Abstract: Document parsing (DP) transforms unstructured or semi-structured documents into structured, machine-re

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Certified Training with Branch-and-Bound for Lyapunov-stable Neural Control

arXiv:2411.18235v3 Announce Type: replace-cross Abstract: We study the problem of learning verifiably Lyapunov-stable neural controllers that provably satisfy t

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Talk to Right Specialists: Iterative Routing in Multi-agent Systems for Question Answering

arXiv:2501.07813v2 Announce Type: replace-cross Abstract: Retrieval-augmented generation (RAG) agents are increasingly deployed to answer questions over local k

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Human-AI Collaborative Game Testing with Vision Language Models

arXiv:2501.11782v2 Announce Type: replace-cross Abstract: As modern video games become increasingly complex, traditional manual testing methods are proving cost

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 2d ago

Post-detection inference for sequential changepoint localization

arXiv:2502.06096v5 Announce Type: replace-cross Abstract: This paper addresses a fundamental but largely unexplored challenge in sequential changepoint analysis

ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 2d ago

Cyber-Physical Systems Security: A Comprehensive Review of Anomaly Detection Techniques

arXiv:2502.13256v2 Announce Type: replace-cross Abstract: In an increasingly interconnected world, Cyber-Physical Systems (CPS) are essential to critical indust

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Implicit Bias-Like Patterns in Reasoning Models

arXiv:2503.11572v4 Announce Type: replace-cross Abstract: Implicit biases refer to automatic mental processes that shape perceptions, judgments, and behaviors.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

BalancedDPO: Adaptive Multi-Metric Alignment

arXiv:2503.12575v2 Announce Type: replace-cross Abstract: Diffusion models have achieved remarkable progress in text-to-image generation, yet aligning them with

ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 2d ago

Open3DBench: Open-Source Benchmark for 3D-IC Backend Implementation and PPA Evaluation

arXiv:2503.12946v2 Announce Type: replace-cross Abstract: This work introduces Open3DBench, an open-source 3D-IC backend implementation benchmark built upon the

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 2d ago

Causality-Based Scores Alignment in Explainable Data Management

arXiv:2503.14469v5 Announce Type: replace-cross Abstract: Different attribution scores have been proposed to quantify the relevance of database tuples for query