📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news

arXiv:2603.27942v1 Announce Type: cross Abstract: Japanese scene text poses challenges that multilingual benchmarks often fail to capture, including mixed scrip

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Vision-Language Models

arXiv:2603.27982v1 Announce Type: cross Abstract: Vision-language models (VLMs) achieve strong performance on many benchmarks, yet a basic reliability question

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

FedFG: Privacy-Preserving and Robust Federated Learning via Flow-Matching Generation

arXiv:2603.27986v1 Announce Type: cross Abstract: Federated learning (FL) enables distributed clients to collaboratively train a global model using local privat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution Alignment

arXiv:2603.27987v1 Announce Type: cross Abstract: The high cost and accessibility problem associated with large datasets hinder the development of large-scale v

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

arXiv:2603.27991v1 Announce Type: cross Abstract: Interactive documents help readers engage with complex ideas through dynamic visualization, interactive animat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers

arXiv:2603.28013v1 Announce Type: cross Abstract: We present a stage-decomposed analysis of prompt injection attacks against five frontier LLM agents. Prior wor

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

arXiv:2603.28032v1 Announce Type: cross Abstract: The convergence of low-altitude economies, embodied intelligence, and air-ground cooperative systems creates g

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago

Bit-Identical Medical Deep Learning via Structured Orthogonal Initialization

arXiv:2603.28040v1 Announce Type: cross Abstract: Deep learning training is non-deterministic: identical code with different random seeds produces models that a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Synonymix: Unified Group Personas for Generative Simulations

arXiv:2603.28066v1 Announce Type: cross Abstract: Generative agent simulations operate at two scales: individual personas for character interaction, and populat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

arXiv:2603.28069v1 Announce Type: cross Abstract: Grounding has become a fundamental capability of vision-language models (VLMs). Most existing VLMs point by ge

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions

arXiv:2603.28086v1 Announce Type: cross Abstract: Voice design from natural language aims to generate speaker timbres directly from free-form textual descriptio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models

arXiv:2603.28103v1 Announce Type: cross Abstract: Parliamentary proceedings represent a rich yet challenging resource for computational analysis, particularly w

ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 1w ago

Quid est VERITAS? A Modular Framework for Archival Document Analysis

arXiv:2603.28108v1 Announce Type: cross Abstract: The digitisation of historical documents has traditionally been conceived as a process limited to character-le

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data

arXiv:2603.28122v1 Announce Type: cross Abstract: Integrating quantum circuits into deep learning pipelines remains challenging due to heuristic design limitati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Does Claude's Constitution Have a Culture?

arXiv:2603.28123v1 Announce Type: cross Abstract: Constitutional AI (CAI) aligns language models with explicitly stated normative principles, offering a transpa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

arXiv:2603.28130v1 Announce Type: cross Abstract: We introduce Multilingual Document Parsing Benchmark, the first benchmark for multilingual digital and photogr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

RecycleLoRA: Rank-Revealing QR-Based Dual-LoRA Subspace Adaptation for Domain Generalized Semantic Segmentation

arXiv:2603.28142v1 Announce Type: cross Abstract: Domain Generalized Semantic Segmentation (DGSS) aims to maintain robust performance across unseen target domai

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Evaluating Privilege Usage of Agents on Real-World Tools

arXiv:2603.28166v1 Announce Type: cross Abstract: Equipping LLM agents with real-world tools can substantially improve productivity. However, granting agents au

ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 1w ago

Skillful Kilometer-Scale Regional Weather Forecasting via Global and Regional Coupling

arXiv:2603.28173v1 Announce Type: cross Abstract: Data-driven weather models have advanced global medium-range forecasting, yet high-resolution regional predict

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Designing AI for Real Users -- Accessibility Gaps in Retail AI Front-End

arXiv:2603.28196v1 Announce Type: cross Abstract: As AI becomes embedded in customer-facing systems, ethical scrutiny has largely focused on models, data, and g

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models

arXiv:2603.28204v1 Announce Type: cross Abstract: Reinforcement learning from verifiable rewards (RLVR) has significantly advanced the reasoning capabilities of

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

An Optimal Battery-Free Approach for Emission Reduction by Storing Solar Surplus in Building Thermal Mass

arXiv:2603.28217v1 Announce Type: cross Abstract: Decarbonization in buildings calls for advanced control strategies that coordinate on-site renewables, grid el

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago

TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation

arXiv:2603.28233v1 Announce Type: cross Abstract: Accurate and efficient perception is essential for autonomous driving, where segmentation tasks such as drivab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

DiffAttn: Diffusion-Based Drivers' Visual Attention Prediction with LLM-Enhanced Semantic Reasoning

arXiv:2603.28251v1 Announce Type: cross Abstract: Drivers' visual attention provides critical cues for anticipating latent hazards and directly shapes decision-