6,347 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,347 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (16751) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 📄 Paper 1w ago
OSC: Hardware Efficient W4A4 Quantization via Outlier Separation in Channel Dimension
arXiv:2604.12782v1 Announce Type: cross Abstract: While 4-bit quantization is essential for high-throughput deployment of Large Language Models, activation outl
ArXiv cs.AI 📄 Paper 1w ago
VFA: Relieving Vector Operations in Flash Attention with Global Maximum Pre-computation
arXiv:2604.12798v1 Announce Type: cross Abstract: FlashAttention-style online softmax enables exact attention computation with linear memory by streaming score
ArXiv cs.AI 📄 Paper 1w ago
Efficiency of Proportional Mechanisms in Online Auto-Bidding Advertising
arXiv:2604.12799v1 Announce Type: cross Abstract: The rise of automated bidding strategies in online advertising presents new challenges in designing and analyz
ArXiv cs.AI 📄 Paper 1w ago
Rethinking Satellite Image Restoration for Onboard AI: A Lightweight Learning-Based Approach
arXiv:2604.12807v1 Announce Type: cross Abstract: Satellite image restoration aims to improve image quality by compensating for degradations (e.g., noise and bl
ArXiv cs.AI 📄 Paper 1w ago
Algorithmic Analysis of Dense Associative Memory: Finite-Size Guarantees and Adversarial Robustness
arXiv:2604.12811v1 Announce Type: cross Abstract: Dense Associative Memory (DAM) generalizes Hopfield networks through higher-order interactions and achieves st
ArXiv cs.AI 📄 Paper 1w ago
Loop Corrections to the Training and Generalization Errors of Random Feature Models
arXiv:2604.12827v1 Announce Type: cross Abstract: We investigate random feature models in which neural networks sampled from a prescribed initialization ensembl
ArXiv cs.AI 📄 Paper 1w ago
Detecting and refurbishing ground truth errors during training of deep learning-based echocardiography segmentation models
arXiv:2604.12832v1 Announce Type: cross Abstract: Deep learning-based medical image segmentation typically relies on ground truth (GT) labels obtained through m
ArXiv cs.AI 📄 Paper 1w ago
FastGrasp: Learning-based Whole-body Control method for Fast Dexterous Grasping with Mobile Manipulators
arXiv:2604.12879v1 Announce Type: cross Abstract: Fast grasping is critical for mobile robots in logistics, manufacturing, and service applications. Existing me
ArXiv cs.AI 📄 Paper 1w ago
Towards Long-horizon Agentic Multimodal Search
arXiv:2604.12890v1 Announce Type: cross Abstract: Multimodal deep search agents have shown great potential in solving complex tasks by iteratively collecting te
ArXiv cs.AI 📄 Paper 1w ago
Round-Trip Translation Reveals What Frontier Multilingual Benchmarks Miss
arXiv:2604.12911v1 Announce Type: cross Abstract: Multilingual benchmarks guide the development of frontier models. Yet multilingual evaluations reported by fro
ArXiv cs.AI 📄 Paper 1w ago
CoDe-R: Refining Decompiler Output with LLMs via Rationale Guidance and Adaptive Inference
arXiv:2604.12913v1 Announce Type: cross Abstract: Binary decompilation is a critical reverse engineering task aimed at reconstructing high-level source code fro
ArXiv cs.AI 📄 Paper 1w ago
Distorted or Fabricated? A Survey on Hallucination in Video LLMs
arXiv:2604.12944v1 Announce Type: cross Abstract: Despite significant progress in video-language modeling, hallucinations remain a persistent challenge in Video
ArXiv cs.AI 📄 Paper 1w ago
Parallax: Why AI Agents That Think Must Never Act
arXiv:2604.12986v1 Announce Type: cross Abstract: Autonomous AI agents are rapidly transitioning from experimental tools to operational infrastructure, with pro
ArXiv cs.AI 📄 Paper 1w ago
ROSE: An Intent-Centered Evaluation Metric for NL2SQL
arXiv:2604.12988v1 Announce Type: cross Abstract: Execution Accuracy (EX), the widely used metric for evaluating the effectiveness of Natural Language to SQL (N
ArXiv cs.AI 📄 Paper 1w ago
LogicEval: A Systematic Framework for Evaluating Automated Repair Techniques for Logical Vulnerabilities in Real-World Software
arXiv:2604.12994v1 Announce Type: cross Abstract: Logical vulnerabilities in software stem from flaws in program logic rather than memory safety, which can lead
ArXiv cs.AI 📄 Paper 1w ago
One Token Away from Collapse: The Fragility of Instruction-Tuned Helpfulness
arXiv:2604.13006v1 Announce Type: cross Abstract: Instruction-tuned large language models produce helpful, structured responses, but how robust is this helpfuln
ArXiv cs.AI 📄 Paper 1w ago
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation
arXiv:2604.13010v1 Announce Type: cross Abstract: On-policy distillation (OPD) has emerged as an efficient post-training paradigm for large language models. How
ArXiv cs.AI 📄 Paper 1w ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
arXiv:2604.13016v1 Announce Type: cross Abstract: On-policy distillation (OPD) has become a core technique in the post-training of large language models, yet it
ArXiv cs.AI 📄 Paper 1w ago
Representation geometry shapes task performance in vision-language modeling for CT enterography
arXiv:2604.13021v1 Announce Type: cross Abstract: Computed tomography (CT) enterography is a primary imaging modality for assessing inflammatory bowel disease (
ArXiv cs.AI 📄 Paper 1w ago
Visual Preference Optimization with Rubric Rewards
arXiv:2604.13029v1 Announce Type: cross Abstract: The effectiveness of Direct Preference Optimization (DPO) depends on preference data that reflect the quality