📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 4,742 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (12131) ArXiv cs.AI Dev.to · FORUM WEB Dev.to AI Forbes Innovation OpenAI News Hugging Face Blog

CascadeDebate: Multi-Agent Deliberation for Cost-Aware LLM Cascades

arXiv:2604.12262v1 Announce Type: cross Abstract: Cascaded LLM systems coordinate models of varying sizes with human experts to balance accuracy, cost, and abst

ArXiv cs.AI 📄 Paper 1d ago

MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer

arXiv:2604.12281v1 Announce Type: cross Abstract: Style transfer aims to render a content image with the visual characteristics of a reference style while prese

ArXiv cs.AI 📄 Paper 1d ago

Local-Splitter: A Measurement Study of Seven Tactics for Reducing Cloud LLM Token Usage on Coding-Agent Workloads

arXiv:2604.12301v1 Announce Type: cross Abstract: We present a systematic measurement study of seven tactics for reducing cloud LLM token usage when a small loc

ArXiv cs.AI 📄 Paper 1d ago

GCA Framework: A Gulf-Grounded Dataset and Agentic Pipeline for Climate Decision Support

arXiv:2604.12306v1 Announce Type: cross Abstract: Climate decision-making in the Gulf increasingly demands systems that can translate heterogeneous scientific a

ArXiv cs.AI 📄 Paper 1d ago

Is Vibe Coding the Future? An Empirical Assessment of LLM Generated Codes for Construction Safety

arXiv:2604.12311v1 Announce Type: cross Abstract: The emergence of vibe coding, a paradigm where non-technical users instruct Large Language Models (LLMs) to ge

ArXiv cs.AI 📄 Paper 1d ago

EgoEsportsQA: An Egocentric Video Benchmark for Perception and Reasoning in Esports

arXiv:2604.12320v1 Announce Type: cross Abstract: While video large language models (Video-LLMs) excel in understanding slow-paced, real-world egocentric videos

ArXiv cs.AI 📄 Paper 1d ago

Black-Box Optimization From Small Offline Datasets via Meta Learning with Synthetic Tasks

arXiv:2604.12325v1 Announce Type: cross Abstract: We consider the problem of offline black-box optimization, where the goal is to discover optimal designs (e.g.

ArXiv cs.AI 📄 Paper 1d ago

GeM-EA: A Generative and Meta-learning Enhanced Evolutionary Algorithm for Streaming Data-Driven Optimization

arXiv:2604.12336v1 Announce Type: cross Abstract: Streaming Data-Driven Optimization (SDDO) problems arise in many applications where data arrive continuously a

ArXiv cs.AI 📄 Paper 1d ago

FRTSearch: Unified Detection and Parameter Inference of Fast Radio Transients using Instance Segmentation

arXiv:2604.12344v1 Announce Type: cross Abstract: The exponential growth of data from modern radio telescopes presents a significant challenge to traditional si

ArXiv cs.AI 📄 Paper 1d ago

Scaffold-Conditioned Preference Triplets for Controllable Molecular Optimization with Large Language Models

arXiv:2604.12350v1 Announce Type: cross Abstract: Molecular property optimization is central to drug discovery, yet many deep learning methods rely on black-box

ArXiv cs.AI 📄 Paper 1d ago

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

arXiv:2604.12374v1 Announce Type: cross Abstract: We describe the pre-training, post-training, and quantization of Nemotron 3 Super, a 120 billion (active 12 bi

ArXiv cs.AI 📄 Paper 1d ago

Cooperative Memory Paging with Keyword Bookmarks for Long-Horizon LLM Conversations

arXiv:2604.12376v1 Announce Type: cross Abstract: When LLM conversations grow beyond the context window, old content must be evicted -- but how does the model r

ArXiv cs.AI 📄 Paper 1d ago

SCRIPT: A Subcharacter Compositional Representation Injection Module for Korean Pre-Trained Language Models

arXiv:2604.12377v1 Announce Type: cross Abstract: Korean is a morphologically rich language with a featural writing system in which each character is systematic

ArXiv cs.AI 📄 Paper 1d ago

Beyond Output Correctness: Benchmarking and Evaluating Large Language Model Reasoning in Coding Tasks

arXiv:2604.12379v1 Announce Type: cross Abstract: Large language models (LLMs) increasingly rely on explicit reasoning to solve coding tasks, yet evaluating the

ArXiv cs.AI 📄 Paper 1d ago

Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models

arXiv:2604.12391v1 Announce Type: cross Abstract: In this paper, we present Chain-of-Models Pre-Training (CoM-PT), a novel performance-lossless training acceler

ArXiv cs.AI 📄 Paper 1d ago

Security and Resilience in Autonomous Vehicles: A Proactive Design Approach

arXiv:2604.12408v1 Announce Type: cross Abstract: Autonomous vehicles (AVs) promise efficient, clean and cost-effective transportation systems, but their relian

ArXiv cs.AI 📄 Paper 1d ago

RACF: A Resilient Autonomous Car Framework with Object Distance Correction

arXiv:2604.12418v1 Announce Type: cross Abstract: Autonomous vehicles are increasingly deployed in safety-critical applications, where sensing failures or cyber

ArXiv cs.AI 📄 Paper 1d ago

Decoding by Perturbation: Mitigating MLLM Hallucinations via Dynamic Textual Perturbation

arXiv:2604.12424v1 Announce Type: cross Abstract: Multimodal Large Language Models frequently suffer from inference hallucinations, partially stemming from lang

ArXiv cs.AI 📄 Paper 1d ago

IAD-Unify: A Region-Grounded Unified Model for Industrial Anomaly Segmentation, Understanding, and Generation

arXiv:2604.12440v1 Announce Type: cross Abstract: Real-world industrial inspection requires not only localizing defects, but also explaining them in natural lan

ArXiv cs.AI 📄 Paper 1d ago

X-VC: Zero-shot Streaming Voice Conversion in Codec Space

arXiv:2604.12456v1 Announce Type: cross Abstract: Zero-shot voice conversion (VC) aims to convert a source utterance into the voice of an unseen target speaker

ArXiv cs.AI 📄 Paper 1d ago

Euler-inspired Decoupling Neural Operator for Efficient Pansharpening

arXiv:2604.12463v1 Announce Type: cross Abstract: Pansharpening aims to synthesize high-resolution multispectral (HR-MS) images by fusing the spatial textures o

ArXiv cs.AI 📄 Paper 1d ago

From Kinematics to Dynamics: Learning to Refine Hybrid Plans for Physically Feasible Execution

arXiv:2604.12474v1 Announce Type: cross Abstract: In many robotic tasks, agents must traverse a sequence of spatial regions to complete a mission. Such problems

ArXiv cs.AI 📄 Paper 1d ago

Mining Large Language Models for Low-Resource Language Data: Comparing Elicitation Strategies for Hausa and Fongbe

arXiv:2604.12477v1 Announce Type: cross Abstract: Large language models (LLMs) are trained on data contributed by low-resource language communities, yet the lin

ArXiv cs.AI 📄 Paper 1d ago

Audio Source Separation in Reverberant Environments using $\beta$-divergence based Nonnegative Factorization

arXiv:2604.12480v1 Announce Type: cross Abstract: In Gaussian model-based multichannel audio source separation, the likelihood of observed mixtures of source si