6,601 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,601 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (17438) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 📄 Paper 1w ago
SpecBranch: Speculative Decoding via Hybrid Drafting and Rollback-Aware Branch Parallelism
arXiv:2506.01979v4 Announce Type: replace-cross Abstract: Recently, speculative decoding (SD) has emerged as a promising technique to accelerate LLM inference b
ArXiv cs.AI 📄 Paper 1w ago
HSG-12M: A Large-Scale Benchmark of Spatial Multigraphs from the Energy Spectra of Non-Hermitian Crystals
arXiv:2506.08618v4 Announce Type: replace-cross Abstract: AI is transforming scientific research by revealing new ways to understand complex physical systems, b
ArXiv cs.AI 📄 Paper 1w ago
Fast AI Model Partition for Split Learning over Edge Networks
arXiv:2507.01041v4 Announce Type: replace-cross Abstract: Split learning (SL) is a distributed learning paradigm that can enable computation-intensive artificia
ArXiv cs.AI 📄 Paper 1w ago
Global optimization tailored for graphics processing units: Complete and rigorous search for large-scale nonlinear minimization
arXiv:2507.01770v4 Announce Type: replace-cross Abstract: This paper introduces a numerical method to enclose the global minimum of a nonlinear function subject
ArXiv cs.AI 📄 Paper 1w ago
Mobile GUI Agents under Real-world Threats: Are We There Yet?
arXiv:2507.04227v2 Announce Type: replace-cross Abstract: Recent years have witnessed a rapid development of mobile GUI agents powered by large language models
ArXiv cs.AI 📄 Paper 1w ago
A document is worth a structured record: Principled inductive bias design for document recognition
arXiv:2507.08458v2 Announce Type: replace-cross Abstract: Many document types use intrinsic, convention-driven structures that serve to encode precise and struc
ArXiv cs.AI 📄 Paper 1w ago
Simulation as Supervision: Mechanistic Pretraining for Scientific Discovery
arXiv:2507.08977v4 Announce Type: replace-cross Abstract: Scientific modeling faces a tradeoff between the interpretability of mechanistic theory and the predic
ArXiv cs.AI 📄 Paper 1w ago
Automatic Road Subsurface Distress Recognition from Ground Penetrating Radar Images using Deep Learning-based Cross-verification
arXiv:2507.11081v3 Announce Type: replace-cross Abstract: Ground penetrating radar (GPR) has become a rapid and non-destructive solution for road subsurface dis
ArXiv cs.AI 📄 Paper 1w ago
Improved particle swarm optimization algorithm: multi-target trajectory optimization for swarm drones
arXiv:2507.13647v2 Announce Type: replace-cross Abstract: Real-time trajectory planning for unmanned aerial vehicles (UAVs) in dynamic environments remains a ke
ArXiv cs.AI 📄 Paper 1w ago
ChemDFM-R: A Chemical Reasoning LLM Enhanced with Atomized Chemical Knowledge
arXiv:2507.21990v4 Announce Type: replace-cross Abstract: Atomized chemical knowledge, such as functional group information of molecules and reactions, plays a
ArXiv cs.AI 📄 Paper 1w ago
BRAIN: Bias-Mitigation Continual Learning Approach to Vision-Brain Understanding
arXiv:2508.18187v2 Announce Type: replace-cross Abstract: Memory decay makes it harder for the human brain to recognize visual objects and retain details. Conse
ArXiv cs.AI 📄 Paper 1w ago
Variation in Verification: Understanding Verification Dynamics in Large Language Models
arXiv:2509.17995v2 Announce Type: replace-cross Abstract: Recent advances have shown that scaling test-time computation enables large language models (LLMs) to
ArXiv cs.AI 📄 Paper 1w ago
Safe-SAIL: Towards a Fine-grained Safety Landscape of Large Language Models via Sparse Autoencoder Interpretation Framework
arXiv:2509.18127v3 Announce Type: replace-cross Abstract: Sparse autoencoders (SAEs) enable interpretability research by decomposing entangled model activations
ArXiv cs.AI 📄 Paper 1w ago
DyBBT: Dynamic Balance via Bandit-inspired Targeting for Dialog Policy with Cognitive Dual-Systems
arXiv:2509.19695v3 Announce Type: replace-cross Abstract: Task oriented dialog systems often rely on static exploration strategies that do not adapt to dynamic
ArXiv cs.AI 📄 Paper 1w ago
HiCoLoRA: Addressing Context-Prompt Misalignment via Hierarchical Collaborative LoRA for Zero-Shot DST
arXiv:2509.19742v4 Announce Type: replace-cross Abstract: Zero-shot Dialog State Tracking (zs-DST) is essential for enabling Task-Oriented Dialog Systems (TODs)
ArXiv cs.AI 📄 Paper 1w ago
SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From
arXiv:2509.26404v2 Announce Type: replace-cross Abstract: Fingerprinting Large Language Models (LLMs)is essential for provenance verification and model attribut
ArXiv cs.AI 📄 Paper 1w ago
Benchmarking Foundation Models with Retrieval-Augmented Generation in Olympic-Level Physics Problem Solving
arXiv:2510.00919v3 Announce Type: replace-cross Abstract: Retrieval-augmented generation (RAG) with foundation models has achieved strong performance across div
ArXiv cs.AI 📄 Paper 1w ago
LLM as Attention-Informed NTM and Topic Modeling as long-input Generation: Interpretability and long-Context Capability
arXiv:2510.03174v2 Announce Type: replace-cross Abstract: Topic modeling aims to produce interpretable topic representations and topic--document correspondences
ArXiv cs.AI 📄 Paper 1w ago
Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain
arXiv:2510.05159v4 Announce Type: replace-cross Abstract: While finetuning AI agents on interaction data -- such as web browsing or tool use -- improves their c
ArXiv cs.AI 📄 Paper 1w ago
GTCN-G: A Residual Graph-Temporal Fusion Network for Imbalanced Intrusion Detection
arXiv:2510.07285v3 Announce Type: replace-cross Abstract: The escalating complexity of network threats and the inherent class imbalance in traffic data present