📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 1,234 articles · Updated every 3 hours · View all news

arXiv:2512.13607v2 Announce Type: replace-cross Abstract: Building general-purpose reasoning models with reinforcement learning (RL) entails substantial cross-d

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations

arXiv:2512.14080v2 Announce Type: replace-cross Abstract: Mixture of Experts (MoE) models have emerged as the de facto architecture for scaling up language mode

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

PathFinder: Advancing Path Loss Prediction for Single-to-Multi-Transmitter Scenario

arXiv:2512.14150v3 Announce Type: replace-cross Abstract: Radio path loss prediction (RPP) is critical for optimizing 5G networks and enabling IoT, smart city,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Dual-objective Language Models: Training Efficiency Without Overfitting

arXiv:2512.14549v3 Announce Type: replace-cross Abstract: This paper combines autoregressive and masked-diffusion training objectives without any architectural

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

MRG-R1: Reinforcement Learning for Clinically Aligned Medical Report Generation

arXiv:2512.16145v2 Announce Type: replace-cross Abstract: Medical report generation aims to automatically produce radiology-style reports from medical images, s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs

arXiv:2512.16378v3 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) expand beyond text, integrating speech as a native modality has given

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

The Dual-State Architecture for Reliable LLM Agents

arXiv:2512.20660v2 Announce Type: replace-cross Abstract: Large Language Models deployed as code generation agents exhibit stochastic behavior incompatible with

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago

RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution

arXiv:2601.07855v2 Announce Type: replace-cross Abstract: For 3D perception systems to operate reliably in real-world environments, they must remain robust to e

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

Incorporating Q&A Nuggets into Retrieval-Augmented Generation

arXiv:2601.13222v2 Announce Type: replace-cross Abstract: RAGE systems integrate ideas from automatic evaluation (E) into Retrieval-augmented Generation (RAG).

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

arXiv:2601.13227v2 Announce Type: replace-cross Abstract: RAG systems are increasingly evaluated and optimized using LLM judges, an approach that is rapidly bec

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago

CARPE: Context-Aware Image Representation Prioritization via Ensemble for Large Vision-Language Models

arXiv:2601.13622v3 Announce Type: replace-cross Abstract: Large vision-language models (LVLMs) are typically trained using autoregressive language modeling obje

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

NRR-Phi: Text-to-State Mapping for Ambiguity Preservation in LLM Inference

arXiv:2601.19933v5 Announce Type: replace-cross Abstract: Large language models exhibit a systematic tendency toward early semantic commitment: given ambiguous

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Values from Casual Conversations

arXiv:2601.22440v2 Announce Type: replace-cross Abstract: Does AI understand human values? While this remains an open philosophical question, we take a pragmati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions

arXiv:2602.00095v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) hold significant promise for revolutionizing traditional educ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

PISCO: Precise Video Instance Insertion with Sparse Control

arXiv:2602.08277v2 Announce Type: replace-cross Abstract: The landscape of AI video generation is undergoing a pivotal shift: moving beyond general generation -

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

SWE Context Bench: A Benchmark for Context Learning in Coding

arXiv:2602.08316v2 Announce Type: replace-cross Abstract: Large language models are increasingly used as programming agents for repository level software engine

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap

arXiv:2602.09678v2 Announce Type: replace-cross Abstract: Since 1887, administrative law has navigated a "capability-accountability trap": technological change

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

The Effective Depth Paradox: Evaluating the Relationship between Architectural Topology and Trainability in Deep CNNs

arXiv:2602.13298v2 Announce Type: replace-cross Abstract: This paper investigates the relationship between convolutional neural network (CNN) and image recognit

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference

arXiv:2602.18846v2 Announce Type: replace-cross Abstract: Vision-language models (VLMs) have achieved remarkable multimodal understanding and reasoning capabili

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

PedaCo-Gen: Scaffolding Pedagogical Agency in Human-AI Collaborative Video Authoring

arXiv:2602.19623v2 Announce Type: replace-cross Abstract: While advancements in Text-to-Video (T2V) generative AI offer a promising path toward democratizing co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis

arXiv:2602.20207v2 Announce Type: replace-cross Abstract: Knowledge editing in Large Language Models (LLMs) aims to update the model's prediction for a specific

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago

Towards single-shot coherent imaging via overlap-free ptychography

arXiv:2602.21361v2 Announce Type: replace-cross Abstract: Ptychographic imaging at synchrotron and XFEL sources requires dense overlapping scans, limiting throu

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

SpotIt+: Verification-based Text-to-SQL Evaluation with Database Constraints

arXiv:2603.04334v2 Announce Type: replace-cross Abstract: We present SpotIt+, an open-source tool for evaluating Text-to-SQL systems via bounded equivalence ver

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

arXiv:2603.14267v3 Announce Type: replace-cross Abstract: Video dubbing has broad applications in filmmaking, multimedia creation, and assistive speech technolo