5,298 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,298 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (15180) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 📄 Paper 1w ago
HiFloat4 Format for Language Model Pre-training on Ascend NPUs
arXiv:2604.08826v1 Announce Type: cross Abstract: Large foundation models have become central to modern machine learning, with performance scaling predictably w
ArXiv cs.AI 📄 Paper 1w ago
Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs
arXiv:2604.08846v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) have been shown to be vulnerable to malicious queries that can elicit
ArXiv cs.AI 📄 Paper 1w ago
Scalable High-Recall Constraint-Satisfaction-Based Information Retrieval for Clinical Trials Matching
arXiv:2604.08849v1 Announce Type: cross Abstract: Clinical trials are central to evidence-based medicine, yet many struggle to meet enrollment targets, despite
ArXiv cs.AI 📄 Paper 1w ago
AI-Induced Human Responsibility (AIHR) in AI-Human teams
arXiv:2604.08866v1 Announce Type: cross Abstract: As organizations increasingly deploy AI as a teammate rather than a standalone tool, morally consequential mis
ArXiv cs.AI 📄 Paper 1w ago
AudioGuard: Toward Comprehensive Audio Safety Protection Across Diverse Threat Models
arXiv:2604.08867v1 Announce Type: cross Abstract: Audio has rapidly become a primary interface for foundation models, powering real-time voice assistants. Ensur
ArXiv cs.AI 📄 Paper 1w ago
MedFormer-UR: Uncertainty-Routed Transformer for Medical Image Classification
arXiv:2604.08868v1 Announce Type: cross Abstract: To ensure safe clinical integration, deep learning models must provide more than just high accuracy; they requ
ArXiv cs.AI 📄 Paper 1w ago
Temporal Dropout Risk in Learning Analytics: A Harmonized Survival Benchmark Across Dynamic and Early-Window Representations
arXiv:2604.08870v1 Announce Type: cross Abstract: Student dropout is a persistent concern in Learning Analytics, yet comparative studies frequently evaluate pre
ArXiv cs.AI 📄 Paper 1w ago
A Mathematical Framework for Temporal Modeling and Counterfactual Policy Simulation of Student Dropout
arXiv:2604.08874v1 Announce Type: cross Abstract: This study proposes a temporal modeling framework with a counterfactual policy-simulation layer for student dr
ArXiv cs.AI 📄 Paper 1w ago
Revisiting the Capacity Gap in Chain-of-Thought Distillation from a Practical Perspective
arXiv:2604.08880v1 Announce Type: cross Abstract: Chain-of-thought (CoT) distillation transfers reasoning behaviors from a strong teacher to a smaller student,
ArXiv cs.AI 📄 Paper 1w ago
HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation
arXiv:2604.08883v1 Announce Type: cross Abstract: Inspired by the general Vision-and-Language Navigation (VLN) task, aerial VLN has attracted widespread attenti
ArXiv cs.AI 📄 Paper 1w ago
HM-Bench: A Comprehensive Benchmark for Multimodal Large Language Models in Hyperspectral Remote Sensing
arXiv:2604.08884v1 Announce Type: cross Abstract: While multimodal large language models (MLLMs) have made significant strides in natural image understanding, t
ArXiv cs.AI 📄 Paper 1w ago
A Closer Look at the Application of Causal Inference in Graph Representation Learning
arXiv:2604.08890v1 Announce Type: cross Abstract: Modeling causal relationships in graph representation learning remains a fundamental challenge. Existing appro
ArXiv cs.AI 📄 Paper 1w ago
Adaptive Dual Residual U-Net with Attention Gate and Multiscale Spatial Attention Mechanisms (ADRUwAMS)
arXiv:2604.08893v1 Announce Type: cross Abstract: Glioma is a harmful brain tumor that requires early detection to ensure better health results. Early detection
ArXiv cs.AI 📄 Paper 1w ago
Ge$^\text{2}$mS-T: Multi-Dimensional Grouping for Ultra-High Energy Efficiency in Spiking Transformer
arXiv:2604.08894v1 Announce Type: cross Abstract: Spiking Neural Networks (SNNs) offer superior energy efficiency over Artificial Neural Networks (ANNs). Howeve
ArXiv cs.AI 📄 Paper 1w ago
Large-Scale Universal Defect Generation: Foundation Models and Datasets
arXiv:2604.08915v1 Announce Type: cross Abstract: Existing defect/anomaly generation methods often rely on few-shot learning, which overfits to specific defect
ArXiv cs.AI 📄 Paper 1w ago
Beyond Relevance: Utility-Centric Retrieval in the LLM Era
arXiv:2604.08920v1 Announce Type: cross Abstract: Information retrieval systems have traditionally optimized for topical relevance-the degree to which retrieved
ArXiv cs.AI 📄 Paper 1w ago
MuTSE: A Human-in-the-Loop Multi-use Text Simplification Evaluator
arXiv:2604.08947v1 Announce Type: cross Abstract: As Large Language Models (LLMs) become increasingly prevalent in text simplification, systematically evaluatin
ArXiv cs.AI 📄 Paper 1w ago
WOMBET: World Model-based Experience Transfer for Robust and Sample-efficient Reinforcement Learning
arXiv:2604.08958v1 Announce Type: cross Abstract: Reinforcement learning (RL) in robotics is often limited by the cost and risk of data collection, motivating e
ArXiv cs.AI 📄 Paper 1w ago
Aligned Agents, Biased Swarm: Measuring Bias Amplification in Multi-Agent Systems
arXiv:2604.08963v1 Announce Type: cross Abstract: While Multi-Agent Systems (MAS) are increasingly deployed for complex workflows, their emergent properties-par
ArXiv cs.AI 📄 Paper 1w ago
Litmus (Re)Agent: A Benchmark and Agentic System for Predictive Evaluation of Multilingual Models
arXiv:2604.08970v1 Announce Type: cross Abstract: We study predictive multilingual evaluation: estimating how well a model will perform on a task in a target la