📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 2,972 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (5878) ArXiv cs.AI Forbes Innovation OpenAI News Dev.to AI Hugging Face Blog Hackernoon

Zero-Shot Quantization via Weight-Space Arithmetic

arXiv:2604.03420v1 Announce Type: cross Abstract: We show that robustness to post-training quantization (PTQ) is a transferable direction in weight space. We ca

ArXiv cs.AI 📄 Paper 1d ago

AEGIS: Scaling Long-Sequence Homomorphic Encrypted Transformer Inference via Hybrid Parallelism on Multi-GPU Systems

arXiv:2604.03425v1 Announce Type: cross Abstract: Fully Homomorphic Encryption (FHE) enables privacy-preserving Transformer inference, but long-sequence encrypt

ArXiv cs.AI 📄 Paper 1d ago

Inference-Path Optimization via Circuit Duplication in Frozen Visual Transformers for Marine Species Classification

arXiv:2604.03428v1 Announce Type: cross Abstract: Automated underwater species classification is constrained by annotation cost and environmental variation that

ArXiv cs.AI 📄 Paper 1d ago

MetaSAEs: Joint Training with a Decomposability Penalty Produces More Atomic Sparse Autoencoder Latents

arXiv:2604.03436v1 Announce Type: cross Abstract: Sparse autoencoders (SAEs) are increasingly used for safety-relevant applications including alignment detectio

ArXiv cs.AI 📄 Paper 1d ago

Agile Story-Point Estimation: Is RAG a Better Way to Go?

arXiv:2604.03443v1 Announce Type: cross Abstract: The sprint-based iterative approach in the Agile software development method allows continuous feedback and ad

ArXiv cs.AI 📄 Paper 1d ago

Measuring LLM Trust Allocation Across Conflicting Software Artifacts

arXiv:2604.03447v1 Announce Type: cross Abstract: LLM-based software engineering assistants fail not only by producing incorrect outputs, but also by allocating

ArXiv cs.AI 📄 Paper 1d ago

ExpressEdit: Fast Editing of Stylized Facial Expressions with Diffusion Models in Photoshop

arXiv:2604.03448v1 Announce Type: cross Abstract: Facial expressions of characters are a vital component of visual storytelling. While current AI image editing

ArXiv cs.AI 📄 Paper 1d ago

RDFace: A Benchmark Dataset for Rare Disease Facial Image Analysis under Extreme Data Scarcity and Phenotype-Aware Synthetic Generation

arXiv:2604.03454v1 Announce Type: cross Abstract: Rare diseases often manifest with distinctive facial phenotypes in children, offering valuable diagnostic cues

ArXiv cs.AI 📄 Paper 1d ago

Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution

arXiv:2604.03472v1 Announce Type: cross Abstract: Co-evolutionary self-play, where one language model generates problems and another solves them, promises auton

ArXiv cs.AI 📄 Paper 1d ago

Evolutionary Search for Automated Design of Uncertainty Quantification Methods

arXiv:2604.03473v1 Announce Type: cross Abstract: Uncertainty quantification (UQ) methods for large language models are predominantly designed by hand based on

ArXiv cs.AI 📄 Paper 1d ago

Fine-tuning DeepSeek-OCR-2 for Molecular Structure Recognition

arXiv:2604.03476v1 Announce Type: cross Abstract: Optical Chemical Structure Recognition (OCSR) is critical for converting 2D molecular diagrams from printed li

ArXiv cs.AI 📄 Paper 1d ago

Large Language Models Align with the Human Brain during Creative Thinking

arXiv:2604.03480v1 Announce Type: cross Abstract: Creative thinking is a fundamental aspect of human cognition, and divergent thinking-the capacity to generate

ArXiv cs.AI 📄 Paper 1d ago

VisionClaw: Always-On AI Agents through Smart Glasses

arXiv:2604.03486v1 Announce Type: cross Abstract: We present VisionClaw, an always-on wearable AI agent that integrates live egocentric perception with agentic

ArXiv cs.AI 📄 Paper 1d ago

Sim2Real-AD: A Modular Sim-to-Real Framework for Deploying VLM-Guided Reinforcement Learning in Real-World Autonomous Driving

arXiv:2604.03497v1 Announce Type: cross Abstract: Deploying reinforcement learning policies trained in simulation to real autonomous vehicles remains a fundamen

ArXiv cs.AI 📄 Paper 1d ago

The Augmentation Trap: AI Productivity and the Cost of Cognitive Offloading

arXiv:2604.03501v1 Announce Type: cross Abstract: Experimental evidence confirms that AI tools raise worker productivity, but also that sustained use can erode

ArXiv cs.AI 📄 Paper 1d ago

Inside the Scaffold: A Source-Code Taxonomy of Coding Agent Architectures

arXiv:2604.03515v1 Announce Type: cross Abstract: LLM-based coding agents can localize bugs, generate patches, and run tests with diminishing human oversight, y

ArXiv cs.AI 📄 Paper 1d ago

Optimizing Neurorobot Policy under Limited Demonstration Data through Preference Regret

arXiv:2604.03523v1 Announce Type: cross Abstract: Robot reinforcement learning from demonstrations (RLfD) assumes that expert data is abundant; this is usually

ArXiv cs.AI 📄 Paper 1d ago

Determined by User Needs: A Salient Object Detection Rationale Beyond Conventional Visual Stimuli

arXiv:2604.03526v1 Announce Type: cross Abstract: Existing \textbf{s}alient \textbf{o}bject \textbf{d}etection (SOD) methods adopt a \textbf{passive} visual sti

ArXiv cs.AI 📄 Paper 1d ago

Incentives shape how humans co-create with generative AI

arXiv:2604.03529v1 Announce Type: cross Abstract: Generative AI is quickly becoming an integral part of people's everyday workflows. Early evidence has shown th

ArXiv cs.AI 📄 Paper 1d ago

LangFIR: Discovering Sparse Language-Specific Features from Monolingual Data for Language Steering

arXiv:2604.03532v1 Announce Type: cross Abstract: Large language models (LLMs) show strong multilingual capabilities, yet reliably controlling the language of t

ArXiv cs.AI 📄 Paper 1d ago

AgenticFlict: A Large-Scale Dataset of Merge Conflicts in AI Coding Agent Pull Requests on GitHub

arXiv:2604.03551v1 Announce Type: cross Abstract: Software Engineering 3.0 marks a paradigm shift in software development, in which AI coding agents are no long

ArXiv cs.AI 📄 Paper 1d ago

CRAFT: Video Diffusion for Bimanual Robot Data Generation

arXiv:2604.03552v1 Announce Type: cross Abstract: Bimanual robot learning from demonstrations is fundamentally limited by the cost and narrow visual diversity o

ArXiv cs.AI 📄 Paper 1d ago

Focus Matters: Phase-Aware Suppression for Hallucination in Vision-Language Models

arXiv:2604.03556v1 Announce Type: cross Abstract: Large Vision-Language Models (LVLMs) have achieved impressive progress in multimodal reasoning, yet they remai

ArXiv cs.AI 📄 Paper 1d ago

SecPI: Secure Code Generation with Reasoning Models via Security Reasoning Internalization

arXiv:2604.03587v1 Announce Type: cross Abstract: Reasoning language models (RLMs) are increasingly used in programming. Yet, even state-of-the-art RLMs frequen