📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 4,742 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (12131)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI
📄 Paper
1d ago
CascadeDebate: Multi-Agent Deliberation for Cost-Aware LLM Cascades
arXiv:2604.12262v1 Announce Type: cross Abstract: Cascaded LLM systems coordinate models of varying sizes with human experts to balance accuracy, cost, and abst
ArXiv cs.AI
📄 Paper
1d ago
MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer
arXiv:2604.12281v1 Announce Type: cross Abstract: Style transfer aims to render a content image with the visual characteristics of a reference style while prese
ArXiv cs.AI
📄 Paper
1d ago
Local-Splitter: A Measurement Study of Seven Tactics for Reducing Cloud LLM Token Usage on Coding-Agent Workloads
arXiv:2604.12301v1 Announce Type: cross Abstract: We present a systematic measurement study of seven tactics for reducing cloud LLM token usage when a small loc
ArXiv cs.AI
📄 Paper
1d ago
GCA Framework: A Gulf-Grounded Dataset and Agentic Pipeline for Climate Decision Support
arXiv:2604.12306v1 Announce Type: cross Abstract: Climate decision-making in the Gulf increasingly demands systems that can translate heterogeneous scientific a
ArXiv cs.AI
📄 Paper
1d ago
Is Vibe Coding the Future? An Empirical Assessment of LLM Generated Codes for Construction Safety
arXiv:2604.12311v1 Announce Type: cross Abstract: The emergence of vibe coding, a paradigm where non-technical users instruct Large Language Models (LLMs) to ge
ArXiv cs.AI
📄 Paper
1d ago
EgoEsportsQA: An Egocentric Video Benchmark for Perception and Reasoning in Esports
arXiv:2604.12320v1 Announce Type: cross Abstract: While video large language models (Video-LLMs) excel in understanding slow-paced, real-world egocentric videos
ArXiv cs.AI
📄 Paper
1d ago
Black-Box Optimization From Small Offline Datasets via Meta Learning with Synthetic Tasks
arXiv:2604.12325v1 Announce Type: cross Abstract: We consider the problem of offline black-box optimization, where the goal is to discover optimal designs (e.g.
ArXiv cs.AI
📄 Paper
1d ago
GeM-EA: A Generative and Meta-learning Enhanced Evolutionary Algorithm for Streaming Data-Driven Optimization
arXiv:2604.12336v1 Announce Type: cross Abstract: Streaming Data-Driven Optimization (SDDO) problems arise in many applications where data arrive continuously a
ArXiv cs.AI
📄 Paper
1d ago
FRTSearch: Unified Detection and Parameter Inference of Fast Radio Transients using Instance Segmentation
arXiv:2604.12344v1 Announce Type: cross Abstract: The exponential growth of data from modern radio telescopes presents a significant challenge to traditional si
ArXiv cs.AI
📄 Paper
1d ago
Scaffold-Conditioned Preference Triplets for Controllable Molecular Optimization with Large Language Models
arXiv:2604.12350v1 Announce Type: cross Abstract: Molecular property optimization is central to drug discovery, yet many deep learning methods rely on black-box
ArXiv cs.AI
📄 Paper
1d ago
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv:2604.12374v1 Announce Type: cross Abstract: We describe the pre-training, post-training, and quantization of Nemotron 3 Super, a 120 billion (active 12 bi
ArXiv cs.AI
📄 Paper
1d ago
Cooperative Memory Paging with Keyword Bookmarks for Long-Horizon LLM Conversations
arXiv:2604.12376v1 Announce Type: cross Abstract: When LLM conversations grow beyond the context window, old content must be evicted -- but how does the model r
ArXiv cs.AI
📄 Paper
1d ago
SCRIPT: A Subcharacter Compositional Representation Injection Module for Korean Pre-Trained Language Models
arXiv:2604.12377v1 Announce Type: cross Abstract: Korean is a morphologically rich language with a featural writing system in which each character is systematic
ArXiv cs.AI
📄 Paper
1d ago
Beyond Output Correctness: Benchmarking and Evaluating Large Language Model Reasoning in Coding Tasks
arXiv:2604.12379v1 Announce Type: cross Abstract: Large language models (LLMs) increasingly rely on explicit reasoning to solve coding tasks, yet evaluating the
ArXiv cs.AI
📄 Paper
1d ago
Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models
arXiv:2604.12391v1 Announce Type: cross Abstract: In this paper, we present Chain-of-Models Pre-Training (CoM-PT), a novel performance-lossless training acceler
ArXiv cs.AI
📄 Paper
1d ago
Security and Resilience in Autonomous Vehicles: A Proactive Design Approach
arXiv:2604.12408v1 Announce Type: cross Abstract: Autonomous vehicles (AVs) promise efficient, clean and cost-effective transportation systems, but their relian
ArXiv cs.AI
📄 Paper
1d ago
RACF: A Resilient Autonomous Car Framework with Object Distance Correction
arXiv:2604.12418v1 Announce Type: cross Abstract: Autonomous vehicles are increasingly deployed in safety-critical applications, where sensing failures or cyber
ArXiv cs.AI
📄 Paper
1d ago
Decoding by Perturbation: Mitigating MLLM Hallucinations via Dynamic Textual Perturbation
arXiv:2604.12424v1 Announce Type: cross Abstract: Multimodal Large Language Models frequently suffer from inference hallucinations, partially stemming from lang
ArXiv cs.AI
📄 Paper
1d ago
IAD-Unify: A Region-Grounded Unified Model for Industrial Anomaly Segmentation, Understanding, and Generation
arXiv:2604.12440v1 Announce Type: cross Abstract: Real-world industrial inspection requires not only localizing defects, but also explaining them in natural lan
ArXiv cs.AI
📄 Paper
1d ago
X-VC: Zero-shot Streaming Voice Conversion in Codec Space
arXiv:2604.12456v1 Announce Type: cross Abstract: Zero-shot voice conversion (VC) aims to convert a source utterance into the voice of an unseen target speaker
ArXiv cs.AI
📄 Paper
1d ago
Euler-inspired Decoupling Neural Operator for Efficient Pansharpening
arXiv:2604.12463v1 Announce Type: cross Abstract: Pansharpening aims to synthesize high-resolution multispectral (HR-MS) images by fusing the spatial textures o
ArXiv cs.AI
📄 Paper
1d ago
From Kinematics to Dynamics: Learning to Refine Hybrid Plans for Physically Feasible Execution
arXiv:2604.12474v1 Announce Type: cross Abstract: In many robotic tasks, agents must traverse a sequence of spatial regions to complete a mission. Such problems
ArXiv cs.AI
📄 Paper
1d ago
Mining Large Language Models for Low-Resource Language Data: Comparing Elicitation Strategies for Hausa and Fongbe
arXiv:2604.12477v1 Announce Type: cross Abstract: Large language models (LLMs) are trained on data contributed by low-resource language communities, yet the lin
ArXiv cs.AI
📄 Paper
1d ago
Audio Source Separation in Reverberant Environments using $\beta$-divergence based Nonnegative Factorization
arXiv:2604.12480v1 Announce Type: cross Abstract: In Gaussian model-based multichannel audio source separation, the likelihood of observed mixtures of source si
DeepCamp AI