📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AI Forbes Innovation OpenAI News Dev.to AI Hugging Face Blog Hackernoon

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 2d ago

Integer-Only Operations on Extreme Learning Machine Test Time Classification

arXiv:2604.04363v1 Announce Type: cross Abstract: We present a theoretical analysis and empirical evaluations of a novel set of techniques for computational cos

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 2d ago

Context is All You Need

arXiv:2604.04364v1 Announce Type: cross Abstract: Artificial Neural Networks (ANNs) are increasingly deployed across diverse real-world settings, where they mus

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Towards Considerate Human-Robot Coexistence: A Dual-Space Framework of Robot Design and Human Perception in Healthcare

arXiv:2604.04374v1 Announce Type: cross Abstract: The rapid advancement of robotics, spanning expanded capabilities, more intuitive interaction, and more integr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Compressible Softmax-Attended Language under Incompressible Attention

arXiv:2604.04384v1 Announce Type: cross Abstract: Across every attention head in five transformer language models (124M--7B parameters, four architecture famili

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

How Alignment Routes: Localizing, Scaling, and Controlling Policy Circuits in Language Models

arXiv:2604.04385v1 Announce Type: cross Abstract: We identify a recurring sparse routing mechanism in alignment-trained language models: a gate attention head r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment

arXiv:2604.04410v1 Announce Type: cross Abstract: Aligning language models with human preferences is essential for ensuring their safety and reliability. Althou

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and Responses in Visual Document Understanding

arXiv:2604.04411v1 Announce Type: cross Abstract: Visual document understanding (VDU) is a challenging task for large vision language models (LVLMs), requiring

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

arXiv:2604.04418v1 Announce Type: cross Abstract: As LLMs are deployed in high-stakes settings, users must judge the correctness of individual responses, often

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Is Prompt Selection Necessary for Task-Free Online Continual Learning?

arXiv:2604.04420v1 Announce Type: cross Abstract: Task-free online continual learning has recently emerged as a realistic paradigm for addressing continual lear

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Training Transformers in Cosine Coefficient Space

arXiv:2604.04440v1 Announce Type: cross Abstract: We parameterize the weight matrices of a transformer in the two-dimensional discrete cosine transform (DCT) do

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Conversational Control with Ontologies for Large Language Models: A Lightweight Framework for Constrained Generation

arXiv:2604.04450v1 Announce Type: cross Abstract: Conversational agents based on Large Language Models (LLMs) have recently emerged as powerful tools for human-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

DP-OPD: Differentially Private On-Policy Distillation for Language Models

arXiv:2604.04461v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly adapted to proprietary and domain-specific corpora that contain

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

MC-GenRef: Annotation-free mammography microcalcification segmentation with generative posterior refinement

arXiv:2604.04470v1 Announce Type: cross Abstract: Microcalcification (MC) analysis is clinically important in screening mammography because clustered puncta can

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2d ago

MAVEN: A Mesh-Aware Volumetric Encoding Network for Simulating 3D Flexible Deformation

arXiv:2604.04474v1 Announce Type: cross Abstract: Deep learning-based approaches, particularly graph neural networks (GNNs), have gained prominence in simulatin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Discrete Prototypical Memories for Federated Time Series Foundation Models

arXiv:2604.04475v1 Announce Type: cross Abstract: Leveraging Large Language Models (LLMs) as federated learning (FL)-based time series foundation models offers

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

ECG Biometrics with ArcFace-Inception: External Validation on MIMIC and HEEDB

arXiv:2604.04485v1 Announce Type: cross Abstract: ECG biometrics has been studied mainly on small cohorts and short inter-session intervals, leaving open how id

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

RAVEN: Radar Adaptive Vision Encoders for Efficient Chirp-wise Object Detection and Segmentation

arXiv:2604.04490v1 Announce Type: cross Abstract: This paper presents RAVEN, a computationally efficient deep learning architecture for FMCW radar perception. T

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models

arXiv:2604.04493v1 Announce Type: cross Abstract: The rapid growth of large language models (LLMs) presents significant deployment challenges due to their massi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

One Model for All: Multi-Objective Controllable Language Models

arXiv:2604.04497v1 Announce Type: cross Abstract: Aligning large language models (LLMs) with human preferences is critical for enhancing LLMs' safety, helpfulne

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

GAIN: Multiplicative Modulation for Domain Adaptation

arXiv:2604.04516v1 Announce Type: cross Abstract: Adapting LLMs to new domains causes forgetting because standard methods (full fine-tuning, LoRA) inject new di

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 2d ago

Reproducibility study on how to find Spurious Correlations, Shortcut Learning, Clever Hans or Group-Distributional non-robustness and how to fix them

arXiv:2604.04518v1 Announce Type: cross Abstract: Deep Neural Networks (DNNs) are increasingly utilized in high-stakes domains like medical diagnostics and auto

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2d ago

ENCRUST: Encapsulated Substitution and Agentic Refinement on a Live Scaffold for Safe C-to-Rust Translation

arXiv:2604.04527v1 Announce Type: cross Abstract: We present Encapsulated Substitution and Agentic Refinement on a Live Scaffold for Safe C-to-Rust Translation,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Multilingual Prompt Localization for Agent-as-a-Judge: Language and Backbone Sensitivity in Requirement-Level Evaluation

arXiv:2604.04532v1 Announce Type: cross Abstract: Evaluation language is typically treated as a fixed English default in agentic code benchmarks, yet we show th

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

StableTTA: Training-Free Test-Time Adaptation that Improves Model Accuracy on ImageNet1K to 96%

arXiv:2604.04552v1 Announce Type: cross Abstract: Ensemble methods are widely used to improve predictive performance, but their effectiveness often comes at the