3,273 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AIForbes InnovationOpenAI NewsDev.to AIHugging Face BlogHackernoon
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
ViviDoc: Generating Interactive Documents through Human-Agent Collaboration
arXiv:2603.27991v1 Announce Type: cross Abstract: Interactive documents help readers engage with complex ideas through dynamic visualization, interactive animat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers
arXiv:2603.28013v1 Announce Type: cross Abstract: We present a stage-decomposed analysis of prompt injection attacks against five frontier LLM agents. Prior wor
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence
arXiv:2603.28032v1 Announce Type: cross Abstract: The convergence of low-altitude economies, embodied intelligence, and air-ground cooperative systems creates g
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago
Bit-Identical Medical Deep Learning via Structured Orthogonal Initialization
arXiv:2603.28040v1 Announce Type: cross Abstract: Deep learning training is non-deterministic: identical code with different random seeds produces models that a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Synonymix: Unified Group Personas for Generative Simulations
arXiv:2603.28066v1 Announce Type: cross Abstract: Generative agent simulations operate at two scales: individual personas for character interaction, and populat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
MolmoPoint: Better Pointing for VLMs with Grounding Tokens
arXiv:2603.28069v1 Announce Type: cross Abstract: Grounding has become a fundamental capability of vision-language models (VLMs). Most existing VLMs point by ge
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions
arXiv:2603.28086v1 Announce Type: cross Abstract: Voice design from natural language aims to generate speaker timbres directly from free-form textual descriptio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models
arXiv:2603.28103v1 Announce Type: cross Abstract: Parliamentary proceedings represent a rich yet challenging resource for computational analysis, particularly w
ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 1w ago
Quid est VERITAS? A Modular Framework for Archival Document Analysis
arXiv:2603.28108v1 Announce Type: cross Abstract: The digitisation of historical documents has traditionally been conceived as a process limited to character-le
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data
arXiv:2603.28122v1 Announce Type: cross Abstract: Integrating quantum circuits into deep learning pipelines remains challenging due to heuristic design limitati
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Does Claude's Constitution Have a Culture?
arXiv:2603.28123v1 Announce Type: cross Abstract: Constitutional AI (CAI) aligns language models with explicitly stated normative principles, offering a transpa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios
arXiv:2603.28130v1 Announce Type: cross Abstract: We introduce Multilingual Document Parsing Benchmark, the first benchmark for multilingual digital and photogr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
RecycleLoRA: Rank-Revealing QR-Based Dual-LoRA Subspace Adaptation for Domain Generalized Semantic Segmentation
arXiv:2603.28142v1 Announce Type: cross Abstract: Domain Generalized Semantic Segmentation (DGSS) aims to maintain robust performance across unseen target domai
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Evaluating Privilege Usage of Agents on Real-World Tools
arXiv:2603.28166v1 Announce Type: cross Abstract: Equipping LLM agents with real-world tools can substantially improve productivity. However, granting agents au
ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 1w ago
Skillful Kilometer-Scale Regional Weather Forecasting via Global and Regional Coupling
arXiv:2603.28173v1 Announce Type: cross Abstract: Data-driven weather models have advanced global medium-range forecasting, yet high-resolution regional predict
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Designing AI for Real Users -- Accessibility Gaps in Retail AI Front-End
arXiv:2603.28196v1 Announce Type: cross Abstract: As AI becomes embedded in customer-facing systems, ethical scrutiny has largely focused on models, data, and g
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models
arXiv:2603.28204v1 Announce Type: cross Abstract: Reinforcement learning from verifiable rewards (RLVR) has significantly advanced the reasoning capabilities of
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
An Optimal Battery-Free Approach for Emission Reduction by Storing Solar Surplus in Building Thermal Mass
arXiv:2603.28217v1 Announce Type: cross Abstract: Decarbonization in buildings calls for advanced control strategies that coordinate on-site renewables, grid el
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago
TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation
arXiv:2603.28233v1 Announce Type: cross Abstract: Accurate and efficient perception is essential for autonomous driving, where segmentation tasks such as drivab
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
DiffAttn: Diffusion-Based Drivers' Visual Attention Prediction with LLM-Enhanced Semantic Reasoning
arXiv:2603.28251v1 Announce Type: cross Abstract: Drivers' visual attention provides critical cues for anticipating latent hazards and directly shapes decision-