3,273 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AIForbes InnovationOpenAI NewsDev.to AIHugging Face BlogHackernoon
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Incoherence in Goal-Conditioned Autoregressive Models
arXiv:2510.06545v2 Announce Type: replace-cross Abstract: We investigate mathematically the notion of incoherence: a structural issue with reinforcement learnin
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Fair Indivisible Payoffs through Shapley Value
arXiv:2510.24906v2 Announce Type: replace-cross Abstract: We consider the problem of payoff division in indivisible coalitional games, where the value of the gr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
E-Scores for (In)Correctness Assessment of Generative Model Outputs
arXiv:2510.25770v2 Announce Type: replace-cross Abstract: While generative models, especially large language models (LLMs), are ubiquitous in today's world, pri
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback
arXiv:2511.08225v2 Announce Type: replace-cross Abstract: As teachers increasingly turn to GenAI in their educational practice, we need robust methods to benchm
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago
Seeing Beyond the Image: ECG and Anatomical Knowledge-Guided Myocardial Scar Segmentation from Late Gadolinium-Enhanced Images
arXiv:2511.14702v4 Announce Type: replace-cross Abstract: Accurate segmentation of myocardial scar from late gadolinium enhanced (LGE) cardiac MRI is essential
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
DuoTok: Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling
arXiv:2511.20224v2 Announce Type: replace-cross Abstract: Audio tokenization bridges continuous waveforms and multi-track music language models. In dual-track m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Structured Prompts Improve Evaluation of Language Models
arXiv:2511.20836v3 Announce Type: replace-cross Abstract: As language models (LMs) are increasingly adopted across domains, high-quality benchmarking frameworks
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
OmniFusion: Simultaneous Multilingual Multimodal Translations via Modular Fusion
arXiv:2512.00234v2 Announce Type: replace-cross Abstract: There has been significant progress in open-source text-only translation large language models (LLMs)
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1w ago
Enhancing Floor Plan Recognition: A Hybrid Mix-Transformer and U-Net Approach for Precise Wall Segmentation
arXiv:2512.02413v3 Announce Type: replace-cross Abstract: Automatic 3D reconstruction of indoor spaces from 2D floor plans necessitates high-precision semantic
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Lumos: Let there be Language Model System Certification
arXiv:2512.02966v2 Announce Type: replace-cross Abstract: We introduce the first principled framework, Lumos, for specifying and formally certifying Language Mo
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1w ago
Geometric-Photometric Event-based 3D Gaussian Ray Tracing
arXiv:2512.18640v2 Announce Type: replace-cross Abstract: Event cameras offer a high temporal resolution over traditional frame-based cameras, which makes them
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Bypassing Prompt Injection Detectors through Evasive Injections
arXiv:2602.00750v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly used in interactive and retrieval-augmented systems, but
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
On the Non-Identifiability of Steering Vectors in Large Language Models
arXiv:2602.06801v4 Announce Type: replace-cross Abstract: Activation steering methods are widely used to control large language model (LLM) behavior and are oft
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability-Plasticity Tradeoff
arXiv:2602.08040v3 Announce Type: replace-cross Abstract: Deep neural networks trained on nonstationary data must balance stability (i.e., retaining prior knowl
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Evaluating LLM-Generated ACSL Annotations for Formal Verification
arXiv:2602.13851v2 Announce Type: replace-cross Abstract: Formal specifications are crucial for building verifiable and dependable software systems, yet generat
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
CoCoDiff: Correspondence-Consistent Diffusion Model for Fine-grained Style Transfer
arXiv:2602.14464v2 Announce Type: replace-cross Abstract: Transferring visual style between images while preserving semantic correspondence between similar obje
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Chat-Based Support Alone May Not Be Enough: Comparing Conversational and Embedded LLM Feedback for Mathematical Proof Learning
arXiv:2602.18807v2 Announce Type: replace-cross Abstract: We evaluate GPTutor, an LLM-powered tutoring system for an undergraduate discrete mathematics course.
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving
arXiv:2602.23499v2 Announce Type: replace-cross Abstract: Collecting a high-quality dataset is a critical task that demands meticulous attention to detail, as o
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration
arXiv:2603.03823v4 Announce Type: replace-cross Abstract: Large language model (LLM)-powered agents have demonstrated strong capabilities in automating software
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago
Mousse: Rectifying the Geometry of Muon with Curvature-Aware Preconditioning
arXiv:2603.09697v2 Announce Type: replace-cross Abstract: Recent advances in spectral optimization, notably Muon, have demonstrated that constraining update ste