📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,601 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (17438) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation Medium · Programming Medium · AI

On the Geometry of Receiver Operating Characteristic and Precision-Recall Curves

arXiv:2504.02169v3 Announce Type: replace-cross Abstract: We study the geometry of Receiver Operating Characteristic (ROC) and Precision-Recall (PR) curves in b

ArXiv cs.AI 📄 Paper 1w ago

Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning

arXiv:2505.15467v2 Announce Type: replace-cross Abstract: Large language models have achieved remarkable success in various tasks. However, it is challenging fo

ArXiv cs.AI 📄 Paper 1w ago

SEW: Self-Evolving Agentic Workflows for Automated Code Generation

arXiv:2505.18646v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have demonstrated effectiveness in code generation tasks. To enable LLMs

ArXiv cs.AI 📄 Paper 1w ago

Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning

arXiv:2505.19261v2 Announce Type: replace-cross Abstract: Current text-to-image diffusion generation typically employs complete-text conditioning. Due to the in

ArXiv cs.AI 📄 Paper 1w ago

SpecBranch: Speculative Decoding via Hybrid Drafting and Rollback-Aware Branch Parallelism

arXiv:2506.01979v4 Announce Type: replace-cross Abstract: Recently, speculative decoding (SD) has emerged as a promising technique to accelerate LLM inference b

ArXiv cs.AI 📄 Paper 1w ago

HSG-12M: A Large-Scale Benchmark of Spatial Multigraphs from the Energy Spectra of Non-Hermitian Crystals

arXiv:2506.08618v4 Announce Type: replace-cross Abstract: AI is transforming scientific research by revealing new ways to understand complex physical systems, b

ArXiv cs.AI 📄 Paper 1w ago

Fast AI Model Partition for Split Learning over Edge Networks

arXiv:2507.01041v4 Announce Type: replace-cross Abstract: Split learning (SL) is a distributed learning paradigm that can enable computation-intensive artificia

ArXiv cs.AI 📄 Paper 1w ago

Global optimization tailored for graphics processing units: Complete and rigorous search for large-scale nonlinear minimization

arXiv:2507.01770v4 Announce Type: replace-cross Abstract: This paper introduces a numerical method to enclose the global minimum of a nonlinear function subject

ArXiv cs.AI 📄 Paper 1w ago

Mobile GUI Agents under Real-world Threats: Are We There Yet?

arXiv:2507.04227v2 Announce Type: replace-cross Abstract: Recent years have witnessed a rapid development of mobile GUI agents powered by large language models

ArXiv cs.AI 📄 Paper 1w ago

A document is worth a structured record: Principled inductive bias design for document recognition

arXiv:2507.08458v2 Announce Type: replace-cross Abstract: Many document types use intrinsic, convention-driven structures that serve to encode precise and struc

ArXiv cs.AI 📄 Paper 1w ago

Simulation as Supervision: Mechanistic Pretraining for Scientific Discovery

arXiv:2507.08977v4 Announce Type: replace-cross Abstract: Scientific modeling faces a tradeoff between the interpretability of mechanistic theory and the predic

ArXiv cs.AI 📄 Paper 1w ago

Automatic Road Subsurface Distress Recognition from Ground Penetrating Radar Images using Deep Learning-based Cross-verification

arXiv:2507.11081v3 Announce Type: replace-cross Abstract: Ground penetrating radar (GPR) has become a rapid and non-destructive solution for road subsurface dis

ArXiv cs.AI 📄 Paper 1w ago

Improved particle swarm optimization algorithm: multi-target trajectory optimization for swarm drones

arXiv:2507.13647v2 Announce Type: replace-cross Abstract: Real-time trajectory planning for unmanned aerial vehicles (UAVs) in dynamic environments remains a ke

ArXiv cs.AI 📄 Paper 1w ago

ChemDFM-R: A Chemical Reasoning LLM Enhanced with Atomized Chemical Knowledge

arXiv:2507.21990v4 Announce Type: replace-cross Abstract: Atomized chemical knowledge, such as functional group information of molecules and reactions, plays a

ArXiv cs.AI 📄 Paper 1w ago

BRAIN: Bias-Mitigation Continual Learning Approach to Vision-Brain Understanding

arXiv:2508.18187v2 Announce Type: replace-cross Abstract: Memory decay makes it harder for the human brain to recognize visual objects and retain details. Conse

ArXiv cs.AI 📄 Paper 1w ago

Variation in Verification: Understanding Verification Dynamics in Large Language Models

arXiv:2509.17995v2 Announce Type: replace-cross Abstract: Recent advances have shown that scaling test-time computation enables large language models (LLMs) to

ArXiv cs.AI 📄 Paper 1w ago

Safe-SAIL: Towards a Fine-grained Safety Landscape of Large Language Models via Sparse Autoencoder Interpretation Framework

arXiv:2509.18127v3 Announce Type: replace-cross Abstract: Sparse autoencoders (SAEs) enable interpretability research by decomposing entangled model activations

ArXiv cs.AI 📄 Paper 1w ago

DyBBT: Dynamic Balance via Bandit-inspired Targeting for Dialog Policy with Cognitive Dual-Systems

arXiv:2509.19695v3 Announce Type: replace-cross Abstract: Task oriented dialog systems often rely on static exploration strategies that do not adapt to dynamic

ArXiv cs.AI 📄 Paper 1w ago

HiCoLoRA: Addressing Context-Prompt Misalignment via Hierarchical Collaborative LoRA for Zero-Shot DST

arXiv:2509.19742v4 Announce Type: replace-cross Abstract: Zero-shot Dialog State Tracking (zs-DST) is essential for enabling Task-Oriented Dialog Systems (TODs)

ArXiv cs.AI 📄 Paper 1w ago

SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From

arXiv:2509.26404v2 Announce Type: replace-cross Abstract: Fingerprinting Large Language Models (LLMs)is essential for provenance verification and model attribut

ArXiv cs.AI 📄 Paper 1w ago

Benchmarking Foundation Models with Retrieval-Augmented Generation in Olympic-Level Physics Problem Solving

arXiv:2510.00919v3 Announce Type: replace-cross Abstract: Retrieval-augmented generation (RAG) with foundation models has achieved strong performance across div

ArXiv cs.AI 📄 Paper 1w ago

LLM as Attention-Informed NTM and Topic Modeling as long-input Generation: Interpretability and long-Context Capability

arXiv:2510.03174v2 Announce Type: replace-cross Abstract: Topic modeling aims to produce interpretable topic representations and topic--document correspondences

ArXiv cs.AI 📄 Paper 1w ago

Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain

arXiv:2510.05159v4 Announce Type: replace-cross Abstract: While finetuning AI agents on interaction data -- such as web browsing or tool use -- improves their c

ArXiv cs.AI 📄 Paper 1w ago

GTCN-G: A Residual Graph-Temporal Fusion Network for Imbalanced Intrusion Detection

arXiv:2510.07285v3 Announce Type: replace-cross Abstract: The escalating complexity of network threats and the inherent class imbalance in traffic data present