📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 8,253 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (21843)
ArXiv cs.AIDev.to AIMedium · AIMedium · ProgrammingForbes InnovationMedium · Machine Learning
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
Rewriting Video: Text-Driven Reauthoring of Video Footage
arXiv:2601.08565v2 Announce Type: replace-cross Abstract: Video is a powerful medium for communication and storytelling, yet reauthoring existing footage remain
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning
arXiv:2601.11109v3 Announce Type: replace-cross Abstract: Vision-as-inverse-graphics, the concept of reconstructing images into editable programs, remains chall
ArXiv cs.AI
💻 AI-Assisted Coding
📄 Paper
⚡ AI Lesson
3w ago
How AI Coding Agents Modify Code: A Large-Scale Study of GitHub Pull Requests
arXiv:2601.17581v3 Announce Type: replace-cross Abstract: AI coding agents are increasingly acting as autonomous contributors by generating and submitting pull
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
3w ago
Teaching Machine Learning Fundamentals with LEGO Robotics
arXiv:2601.19376v2 Announce Type: replace-cross Abstract: This paper presents the web-based platform Machine Learning with Bricks and an accompanying two-day co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Self-Improving Pretraining: using post-trained models to pretrain better models
arXiv:2601.21343v3 Announce Type: replace-cross Abstract: Large language models are classically trained in stages: pretraining on raw text followed by post-trai
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Predicting Intermittent Job Failure Categories for Diagnosis Using Few-Shot Fine-Tuned Language Models
arXiv:2601.22264v2 Announce Type: replace-cross Abstract: In principle, Continuous Integration (CI) pipeline failures provide valuable feedback to developers on
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
InfoTok: Information-Theoretic Regularization for Capacity-Constrained Shared Visual Tokenization in Unified MLLMs
arXiv:2602.01554v2 Announce Type: replace-cross Abstract: Unified multimodal large language models (MLLMs) aim to unify image understanding and image generation
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models
arXiv:2602.04448v2 Announce Type: replace-cross Abstract: Mixture-of-Experts (MoE) language models introduce unique challenges for safety alignment due to their
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
3w ago
ST-BiBench: Benchmarking Multi-Stream Multimodal Coordination in Bimanual Embodied Tasks for MLLMs
arXiv:2602.08392v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) have significantly advanced the landscape of embodied AI, yet
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations
arXiv:2602.09924v3 Announce Type: replace-cross Abstract: Running LLMs with extended reasoning on every problem is expensive, but determining which inputs actua
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
MoltNet: Understanding Social Behavior of AI Agents in the Agent-Native MoltBook
arXiv:2602.13458v2 Announce Type: replace-cross Abstract: Large-scale communities of AI agents are becoming increasingly prevalent, creating new environments fo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
WIMLE: Uncertainty-Aware World Models with IMLE for Sample-Efficient Continuous Control
arXiv:2602.14351v2 Announce Type: replace-cross Abstract: Model-based reinforcement learning promises strong sample efficiency but often underperforms in practi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Explainable Token-level Noise Filtering for LLM Fine-tuning Datasets
arXiv:2602.14536v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have seen remarkable advancements, achieving state-of-the-art results in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Flow Map Language Models: One-step Language Modeling via Continuous Denoising
arXiv:2602.16813v2 Announce Type: replace-cross Abstract: Language models based on discrete diffusion have attracted widespread interest for their potential to
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Autorubric: Unifying Rubric-based LLM Evaluation
arXiv:2603.00077v2 Announce Type: replace-cross Abstract: Techniques for reliable rubric-based LLM evaluation -- ensemble judging, bias mitigation, few-shot cal
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
"When to Hand Off, When to Work Together": Expanding Human-Agent Co-Creative Collaboration through Concurrent Interaction
arXiv:2603.02050v3 Announce Type: replace-cross Abstract: Human collaborators coordinate dynamically through process visibility and workspace awareness, yet AI
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
3w ago
The Malignant Tail: Spectral Segregation of Label Noise in Over-Parameterized Networks
arXiv:2603.02293v2 Announce Type: replace-cross Abstract: While implicit regularization facilitates benign overfitting in low-noise regimes, recent theoretical
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
Mathematicians in the age of AI
arXiv:2603.03684v3 Announce Type: replace-cross Abstract: Recent developments show that AI can prove research-level theorems in mathematics, both formally and i
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
NePPO: Near-Potential Policy Optimization for General-Sum Multi-Agent Reinforcement Learning
arXiv:2603.06977v2 Announce Type: replace-cross Abstract: Multi-agent reinforcement learning (MARL) is increasingly used to design learning-enabled agents that
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
PlayWorld: Learning Robot World Models from Autonomous Play
arXiv:2603.09030v3 Announce Type: replace-cross Abstract: Action-conditioned video models offer a promising path to building general-purpose robot simulators th
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction
arXiv:2603.10047v2 Announce Type: replace-cross Abstract: Hallucinations in large language models (LLMs) are outputs that are syntactically coherent but factual
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
3w ago
Hindsight-Anchored Policy Optimization: Turning Failure into Feedback in Sparse Reward Settings
arXiv:2603.11321v2 Announce Type: replace-cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a promising paradigm for post-tra
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Truth as a Compression Artifact in Language Model Training
arXiv:2603.11749v3 Announce Type: replace-cross Abstract: Why do language models trained on contradictory data prefer correct answers? In controlled experiments
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
Security Considerations for Artificial Intelligence Agents
arXiv:2603.12230v2 Announce Type: replace-cross Abstract: This article, a lightly adapted version of Perplexity's response to NIST/CAISI Request for Information
DeepCamp AI