📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 2,972 articles · Updated every 3 hours · View all news
ArXiv cs.AI
📄 Paper
1d ago
Zero-Shot Quantization via Weight-Space Arithmetic
arXiv:2604.03420v1 Announce Type: cross Abstract: We show that robustness to post-training quantization (PTQ) is a transferable direction in weight space. We ca
ArXiv cs.AI
📄 Paper
1d ago
AEGIS: Scaling Long-Sequence Homomorphic Encrypted Transformer Inference via Hybrid Parallelism on Multi-GPU Systems
arXiv:2604.03425v1 Announce Type: cross Abstract: Fully Homomorphic Encryption (FHE) enables privacy-preserving Transformer inference, but long-sequence encrypt
ArXiv cs.AI
📄 Paper
1d ago
Inference-Path Optimization via Circuit Duplication in Frozen Visual Transformers for Marine Species Classification
arXiv:2604.03428v1 Announce Type: cross Abstract: Automated underwater species classification is constrained by annotation cost and environmental variation that
ArXiv cs.AI
📄 Paper
1d ago
MetaSAEs: Joint Training with a Decomposability Penalty Produces More Atomic Sparse Autoencoder Latents
arXiv:2604.03436v1 Announce Type: cross Abstract: Sparse autoencoders (SAEs) are increasingly used for safety-relevant applications including alignment detectio
ArXiv cs.AI
📄 Paper
1d ago
Agile Story-Point Estimation: Is RAG a Better Way to Go?
arXiv:2604.03443v1 Announce Type: cross Abstract: The sprint-based iterative approach in the Agile software development method allows continuous feedback and ad
ArXiv cs.AI
📄 Paper
1d ago
Measuring LLM Trust Allocation Across Conflicting Software Artifacts
arXiv:2604.03447v1 Announce Type: cross Abstract: LLM-based software engineering assistants fail not only by producing incorrect outputs, but also by allocating
ArXiv cs.AI
📄 Paper
1d ago
ExpressEdit: Fast Editing of Stylized Facial Expressions with Diffusion Models in Photoshop
arXiv:2604.03448v1 Announce Type: cross Abstract: Facial expressions of characters are a vital component of visual storytelling. While current AI image editing
ArXiv cs.AI
📄 Paper
1d ago
RDFace: A Benchmark Dataset for Rare Disease Facial Image Analysis under Extreme Data Scarcity and Phenotype-Aware Synthetic Generation
arXiv:2604.03454v1 Announce Type: cross Abstract: Rare diseases often manifest with distinctive facial phenotypes in children, offering valuable diagnostic cues
ArXiv cs.AI
📄 Paper
1d ago
Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution
arXiv:2604.03472v1 Announce Type: cross Abstract: Co-evolutionary self-play, where one language model generates problems and another solves them, promises auton
ArXiv cs.AI
📄 Paper
1d ago
Evolutionary Search for Automated Design of Uncertainty Quantification Methods
arXiv:2604.03473v1 Announce Type: cross Abstract: Uncertainty quantification (UQ) methods for large language models are predominantly designed by hand based on
ArXiv cs.AI
📄 Paper
1d ago
Fine-tuning DeepSeek-OCR-2 for Molecular Structure Recognition
arXiv:2604.03476v1 Announce Type: cross Abstract: Optical Chemical Structure Recognition (OCSR) is critical for converting 2D molecular diagrams from printed li
ArXiv cs.AI
📄 Paper
1d ago
Large Language Models Align with the Human Brain during Creative Thinking
arXiv:2604.03480v1 Announce Type: cross Abstract: Creative thinking is a fundamental aspect of human cognition, and divergent thinking-the capacity to generate
ArXiv cs.AI
📄 Paper
1d ago
VisionClaw: Always-On AI Agents through Smart Glasses
arXiv:2604.03486v1 Announce Type: cross Abstract: We present VisionClaw, an always-on wearable AI agent that integrates live egocentric perception with agentic
ArXiv cs.AI
📄 Paper
1d ago
Sim2Real-AD: A Modular Sim-to-Real Framework for Deploying VLM-Guided Reinforcement Learning in Real-World Autonomous Driving
arXiv:2604.03497v1 Announce Type: cross Abstract: Deploying reinforcement learning policies trained in simulation to real autonomous vehicles remains a fundamen
ArXiv cs.AI
📄 Paper
1d ago
The Augmentation Trap: AI Productivity and the Cost of Cognitive Offloading
arXiv:2604.03501v1 Announce Type: cross Abstract: Experimental evidence confirms that AI tools raise worker productivity, but also that sustained use can erode
ArXiv cs.AI
📄 Paper
1d ago
Inside the Scaffold: A Source-Code Taxonomy of Coding Agent Architectures
arXiv:2604.03515v1 Announce Type: cross Abstract: LLM-based coding agents can localize bugs, generate patches, and run tests with diminishing human oversight, y
ArXiv cs.AI
📄 Paper
1d ago
Optimizing Neurorobot Policy under Limited Demonstration Data through Preference Regret
arXiv:2604.03523v1 Announce Type: cross Abstract: Robot reinforcement learning from demonstrations (RLfD) assumes that expert data is abundant; this is usually
ArXiv cs.AI
📄 Paper
1d ago
Determined by User Needs: A Salient Object Detection Rationale Beyond Conventional Visual Stimuli
arXiv:2604.03526v1 Announce Type: cross Abstract: Existing \textbf{s}alient \textbf{o}bject \textbf{d}etection (SOD) methods adopt a \textbf{passive} visual sti
ArXiv cs.AI
📄 Paper
1d ago
Incentives shape how humans co-create with generative AI
arXiv:2604.03529v1 Announce Type: cross Abstract: Generative AI is quickly becoming an integral part of people's everyday workflows. Early evidence has shown th
ArXiv cs.AI
📄 Paper
1d ago
LangFIR: Discovering Sparse Language-Specific Features from Monolingual Data for Language Steering
arXiv:2604.03532v1 Announce Type: cross Abstract: Large language models (LLMs) show strong multilingual capabilities, yet reliably controlling the language of t
ArXiv cs.AI
📄 Paper
1d ago
AgenticFlict: A Large-Scale Dataset of Merge Conflicts in AI Coding Agent Pull Requests on GitHub
arXiv:2604.03551v1 Announce Type: cross Abstract: Software Engineering 3.0 marks a paradigm shift in software development, in which AI coding agents are no long
ArXiv cs.AI
📄 Paper
1d ago
CRAFT: Video Diffusion for Bimanual Robot Data Generation
arXiv:2604.03552v1 Announce Type: cross Abstract: Bimanual robot learning from demonstrations is fundamentally limited by the cost and narrow visual diversity o
ArXiv cs.AI
📄 Paper
1d ago
Focus Matters: Phase-Aware Suppression for Hallucination in Vision-Language Models
arXiv:2604.03556v1 Announce Type: cross Abstract: Large Vision-Language Models (LVLMs) have achieved impressive progress in multimodal reasoning, yet they remai
ArXiv cs.AI
📄 Paper
1d ago
SecPI: Secure Code Generation with Reasoning Models via Security Reasoning Internalization
arXiv:2604.03587v1 Announce Type: cross Abstract: Reasoning language models (RLMs) are increasingly used in programming. Yet, even state-of-the-art RLMs frequen
DeepCamp AI