📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,298 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (15180) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation Medium · Programming Medium · AI

Lessons Without Borders? Evaluating Cultural Alignment of LLMs Using Multilingual Story Moral Generation

arXiv:2604.08797v1 Announce Type: cross Abstract: Stories are key to transmitting values across cultures, but their interpretation varies across linguistic and

ArXiv cs.AI 📄 Paper 1w ago

Scrapyard AI

arXiv:2604.08803v1 Announce Type: cross Abstract: This paper considers AI model churn as an opportunity for frugal investigation of large AI models. It describe

ArXiv cs.AI 📄 Paper 1w ago

Building Better Environments for Autonomous Cyber Defence

arXiv:2604.08805v1 Announce Type: cross Abstract: In November 2025, the authors ran a workshop on the topic of what makes a good reinforcement learning (RL) env

ArXiv cs.AI 📄 Paper 1w ago

SenBen: Sensitive Scene Graphs for Explainable Content Moderation

arXiv:2604.08819v1 Announce Type: cross Abstract: Content moderation systems classify images as safe or unsafe but lack spatial grounding and interpretability:

ArXiv cs.AI 📄 Paper 1w ago

HiFloat4 Format for Language Model Pre-training on Ascend NPUs

arXiv:2604.08826v1 Announce Type: cross Abstract: Large foundation models have become central to modern machine learning, with performance scaling predictably w

ArXiv cs.AI 📄 Paper 1w ago

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs

arXiv:2604.08846v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) have been shown to be vulnerable to malicious queries that can elicit

ArXiv cs.AI 📄 Paper 1w ago

Scalable High-Recall Constraint-Satisfaction-Based Information Retrieval for Clinical Trials Matching

arXiv:2604.08849v1 Announce Type: cross Abstract: Clinical trials are central to evidence-based medicine, yet many struggle to meet enrollment targets, despite

ArXiv cs.AI 📄 Paper 1w ago

AI-Induced Human Responsibility (AIHR) in AI-Human teams

arXiv:2604.08866v1 Announce Type: cross Abstract: As organizations increasingly deploy AI as a teammate rather than a standalone tool, morally consequential mis

ArXiv cs.AI 📄 Paper 1w ago

AudioGuard: Toward Comprehensive Audio Safety Protection Across Diverse Threat Models

arXiv:2604.08867v1 Announce Type: cross Abstract: Audio has rapidly become a primary interface for foundation models, powering real-time voice assistants. Ensur

ArXiv cs.AI 📄 Paper 1w ago

MedFormer-UR: Uncertainty-Routed Transformer for Medical Image Classification

arXiv:2604.08868v1 Announce Type: cross Abstract: To ensure safe clinical integration, deep learning models must provide more than just high accuracy; they requ

ArXiv cs.AI 📄 Paper 1w ago

Temporal Dropout Risk in Learning Analytics: A Harmonized Survival Benchmark Across Dynamic and Early-Window Representations

arXiv:2604.08870v1 Announce Type: cross Abstract: Student dropout is a persistent concern in Learning Analytics, yet comparative studies frequently evaluate pre

ArXiv cs.AI 📄 Paper 1w ago

A Mathematical Framework for Temporal Modeling and Counterfactual Policy Simulation of Student Dropout

arXiv:2604.08874v1 Announce Type: cross Abstract: This study proposes a temporal modeling framework with a counterfactual policy-simulation layer for student dr

ArXiv cs.AI 📄 Paper 1w ago

Revisiting the Capacity Gap in Chain-of-Thought Distillation from a Practical Perspective

arXiv:2604.08880v1 Announce Type: cross Abstract: Chain-of-thought (CoT) distillation transfers reasoning behaviors from a strong teacher to a smaller student,

ArXiv cs.AI 📄 Paper 1w ago

HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation

arXiv:2604.08883v1 Announce Type: cross Abstract: Inspired by the general Vision-and-Language Navigation (VLN) task, aerial VLN has attracted widespread attenti

ArXiv cs.AI 📄 Paper 1w ago

HM-Bench: A Comprehensive Benchmark for Multimodal Large Language Models in Hyperspectral Remote Sensing

arXiv:2604.08884v1 Announce Type: cross Abstract: While multimodal large language models (MLLMs) have made significant strides in natural image understanding, t

ArXiv cs.AI 📄 Paper 1w ago

A Closer Look at the Application of Causal Inference in Graph Representation Learning

arXiv:2604.08890v1 Announce Type: cross Abstract: Modeling causal relationships in graph representation learning remains a fundamental challenge. Existing appro

ArXiv cs.AI 📄 Paper 1w ago

Adaptive Dual Residual U-Net with Attention Gate and Multiscale Spatial Attention Mechanisms (ADRUwAMS)

arXiv:2604.08893v1 Announce Type: cross Abstract: Glioma is a harmful brain tumor that requires early detection to ensure better health results. Early detection

ArXiv cs.AI 📄 Paper 1w ago

Ge$^\text{2}$mS-T: Multi-Dimensional Grouping for Ultra-High Energy Efficiency in Spiking Transformer

arXiv:2604.08894v1 Announce Type: cross Abstract: Spiking Neural Networks (SNNs) offer superior energy efficiency over Artificial Neural Networks (ANNs). Howeve

ArXiv cs.AI 📄 Paper 1w ago

Large-Scale Universal Defect Generation: Foundation Models and Datasets

arXiv:2604.08915v1 Announce Type: cross Abstract: Existing defect/anomaly generation methods often rely on few-shot learning, which overfits to specific defect

ArXiv cs.AI 📄 Paper 1w ago

Beyond Relevance: Utility-Centric Retrieval in the LLM Era

arXiv:2604.08920v1 Announce Type: cross Abstract: Information retrieval systems have traditionally optimized for topical relevance-the degree to which retrieved

ArXiv cs.AI 📄 Paper 1w ago

MuTSE: A Human-in-the-Loop Multi-use Text Simplification Evaluator

arXiv:2604.08947v1 Announce Type: cross Abstract: As Large Language Models (LLMs) become increasingly prevalent in text simplification, systematically evaluatin

ArXiv cs.AI 📄 Paper 1w ago

WOMBET: World Model-based Experience Transfer for Robust and Sample-efficient Reinforcement Learning

arXiv:2604.08958v1 Announce Type: cross Abstract: Reinforcement learning (RL) in robotics is often limited by the cost and risk of data collection, motivating e

ArXiv cs.AI 📄 Paper 1w ago

Aligned Agents, Biased Swarm: Measuring Bias Amplification in Multi-Agent Systems

arXiv:2604.08963v1 Announce Type: cross Abstract: While Multi-Agent Systems (MAS) are increasingly deployed for complex workflows, their emergent properties-par

ArXiv cs.AI 📄 Paper 1w ago

Litmus (Re)Agent: A Benchmark and Agentic System for Predictive Evaluation of Multilingual Models

arXiv:2604.08970v1 Announce Type: cross Abstract: We study predictive multilingual evaluation: estimating how well a model will perform on a task in a target la