📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 6,347 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (16751) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation Medium · Programming Medium · AI

CLASP: Class-Adaptive Layer Fusion and Dual-Stage Pruning for Multimodal Large Language Models

arXiv:2604.12767v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) suffer from substantial computational overhead due to the high redund

ArXiv cs.AI 📄 Paper 1w ago

Cognition-Inspired Dual-Stream Semantic Enhancement for Vision-Based Dynamic Emotion Modeling

arXiv:2604.12777v1 Announce Type: cross Abstract: The human brain constructs emotional percepts not by processing facial expressions in isolation, but through a

ArXiv cs.AI 📄 Paper 1w ago

DoseRAD2026 Challenge dataset: AI accelerated photon and proton dose calculation for radiotherapy

arXiv:2604.12778v1 Announce Type: cross Abstract: Purpose: Accurate dose calculation is essential in radiotherapy for precise tumor irradiation while sparing he

ArXiv cs.AI 📄 Paper 1w ago

Efficient Adversarial Training via Criticality-Aware Fine-Tuning

arXiv:2604.12780v1 Announce Type: cross Abstract: Vision Transformer (ViT) models have achieved remarkable performance across various vision tasks, with scalabi

ArXiv cs.AI 📄 Paper 1w ago

OSC: Hardware Efficient W4A4 Quantization via Outlier Separation in Channel Dimension

arXiv:2604.12782v1 Announce Type: cross Abstract: While 4-bit quantization is essential for high-throughput deployment of Large Language Models, activation outl

ArXiv cs.AI 📄 Paper 1w ago

VFA: Relieving Vector Operations in Flash Attention with Global Maximum Pre-computation

arXiv:2604.12798v1 Announce Type: cross Abstract: FlashAttention-style online softmax enables exact attention computation with linear memory by streaming score

ArXiv cs.AI 📄 Paper 1w ago

Efficiency of Proportional Mechanisms in Online Auto-Bidding Advertising

arXiv:2604.12799v1 Announce Type: cross Abstract: The rise of automated bidding strategies in online advertising presents new challenges in designing and analyz

ArXiv cs.AI 📄 Paper 1w ago

Rethinking Satellite Image Restoration for Onboard AI: A Lightweight Learning-Based Approach

arXiv:2604.12807v1 Announce Type: cross Abstract: Satellite image restoration aims to improve image quality by compensating for degradations (e.g., noise and bl

ArXiv cs.AI 📄 Paper 1w ago

Algorithmic Analysis of Dense Associative Memory: Finite-Size Guarantees and Adversarial Robustness

arXiv:2604.12811v1 Announce Type: cross Abstract: Dense Associative Memory (DAM) generalizes Hopfield networks through higher-order interactions and achieves st

ArXiv cs.AI 📄 Paper 1w ago

Loop Corrections to the Training and Generalization Errors of Random Feature Models

arXiv:2604.12827v1 Announce Type: cross Abstract: We investigate random feature models in which neural networks sampled from a prescribed initialization ensembl

ArXiv cs.AI 📄 Paper 1w ago

Detecting and refurbishing ground truth errors during training of deep learning-based echocardiography segmentation models

arXiv:2604.12832v1 Announce Type: cross Abstract: Deep learning-based medical image segmentation typically relies on ground truth (GT) labels obtained through m

ArXiv cs.AI 📄 Paper 1w ago

FastGrasp: Learning-based Whole-body Control method for Fast Dexterous Grasping with Mobile Manipulators

arXiv:2604.12879v1 Announce Type: cross Abstract: Fast grasping is critical for mobile robots in logistics, manufacturing, and service applications. Existing me

ArXiv cs.AI 📄 Paper 1w ago

Towards Long-horizon Agentic Multimodal Search

arXiv:2604.12890v1 Announce Type: cross Abstract: Multimodal deep search agents have shown great potential in solving complex tasks by iteratively collecting te

ArXiv cs.AI 📄 Paper 1w ago

Round-Trip Translation Reveals What Frontier Multilingual Benchmarks Miss

arXiv:2604.12911v1 Announce Type: cross Abstract: Multilingual benchmarks guide the development of frontier models. Yet multilingual evaluations reported by fro

ArXiv cs.AI 📄 Paper 1w ago

CoDe-R: Refining Decompiler Output with LLMs via Rationale Guidance and Adaptive Inference

arXiv:2604.12913v1 Announce Type: cross Abstract: Binary decompilation is a critical reverse engineering task aimed at reconstructing high-level source code fro

ArXiv cs.AI 📄 Paper 1w ago

Distorted or Fabricated? A Survey on Hallucination in Video LLMs

arXiv:2604.12944v1 Announce Type: cross Abstract: Despite significant progress in video-language modeling, hallucinations remain a persistent challenge in Video

ArXiv cs.AI 📄 Paper 1w ago

Parallax: Why AI Agents That Think Must Never Act

arXiv:2604.12986v1 Announce Type: cross Abstract: Autonomous AI agents are rapidly transitioning from experimental tools to operational infrastructure, with pro

ArXiv cs.AI 📄 Paper 1w ago

ROSE: An Intent-Centered Evaluation Metric for NL2SQL

arXiv:2604.12988v1 Announce Type: cross Abstract: Execution Accuracy (EX), the widely used metric for evaluating the effectiveness of Natural Language to SQL (N

ArXiv cs.AI 📄 Paper 1w ago

LogicEval: A Systematic Framework for Evaluating Automated Repair Techniques for Logical Vulnerabilities in Real-World Software

arXiv:2604.12994v1 Announce Type: cross Abstract: Logical vulnerabilities in software stem from flaws in program logic rather than memory safety, which can lead

ArXiv cs.AI 📄 Paper 1w ago

One Token Away from Collapse: The Fragility of Instruction-Tuned Helpfulness

arXiv:2604.13006v1 Announce Type: cross Abstract: Instruction-tuned large language models produce helpful, structured responses, but how robust is this helpfuln

ArXiv cs.AI 📄 Paper 1w ago

Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation

arXiv:2604.13010v1 Announce Type: cross Abstract: On-policy distillation (OPD) has emerged as an efficient post-training paradigm for large language models. How

ArXiv cs.AI 📄 Paper 1w ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

arXiv:2604.13016v1 Announce Type: cross Abstract: On-policy distillation (OPD) has become a core technique in the post-training of large language models, yet it

ArXiv cs.AI 📄 Paper 1w ago

Representation geometry shapes task performance in vision-language modeling for CT enterography

arXiv:2604.13021v1 Announce Type: cross Abstract: Computed tomography (CT) enterography is a primary imaging modality for assessing inflammatory bowel disease (

ArXiv cs.AI 📄 Paper 1w ago

Visual Preference Optimization with Rubric Rewards

arXiv:2604.13029v1 Announce Type: cross Abstract: The effectiveness of Direct Preference Optimization (DPO) depends on preference data that reflect the quality