📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AI Forbes Innovation OpenAI News Dev.to AI Hugging Face Blog Hackernoon

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2d ago

Low-Bitrate Video Compression through Semantic-Conditioned Diffusion

arXiv:2512.00408v2 Announce Type: replace-cross Abstract: Traditional video codecs optimized for pixel fidelity collapse at ultra-low bitrates and produce sever

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

ToG-Bench: Task-Oriented Spatio-Temporal Grounding in Egocentric Videos

arXiv:2512.03666v2 Announce Type: replace-cross Abstract: A core capability towards general embodied intelligence lies in localizing task-relevant objects from

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

MPCFormer: A physics-informed data-driven approach for explainable socially-aware autonomous driving

arXiv:2512.03795v2 Announce Type: replace-cross Abstract: Autonomous Driving (AD) vehicles still struggle to exhibit human-like behavior in highly dynamic and i

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 2d ago

ContextDrag: Precise Drag-Based Image Editing via Context-Preserving Token Injection and Position-Aligned Attention

arXiv:2512.08477v2 Announce Type: replace-cross Abstract: Drag-based image editing enables intuitive visual manipulation through point-based drag operations. Ex

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 2d ago

A fine-grained look at causal effects in causal spaces

arXiv:2512.11919v3 Announce Type: replace-cross Abstract: The notion of causal effect is fundamental across many scientific disciplines. Traditionally, quantita

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

arXiv:2512.18388v2 Announce Type: replace-cross Abstract: Generative AI has democratized content creation, but popular chatbot-based interfaces often prioritize

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios

arXiv:2512.18470v5 Announce Type: replace-cross Abstract: Existing benchmarks for AI coding agents focus on isolated, single-issue tasks such as fixing a bug or

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation

arXiv:2601.00263v2 Announce Type: replace-cross Abstract: Counterfactuals refer to minimally edited inputs that cause a model's prediction to change, serving as

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Path Integral Solution for Dissipative Generative Dynamics

arXiv:2601.00860v2 Announce Type: replace-cross Abstract: Can purely mechanical systems generate intelligent language? We prove that dissipative quantum dynamic

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Bridging the Semantic Gap for Categorical Data Clustering via Large Language Models

arXiv:2601.01162v2 Announce Type: replace-cross Abstract: Categorical data are prevalent in domains such as healthcare, marketing, and bioinformatics, where clu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Projected Autoregression: Autoregressive Language Generation in Continuous State Space

arXiv:2601.04854v3 Announce Type: replace-cross Abstract: Standard autoregressive language models generate text by repeatedly selecting a discrete next token, c

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

Rewriting Video: Text-Driven Reauthoring of Video Footage

arXiv:2601.08565v2 Announce Type: replace-cross Abstract: Video is a powerful medium for communication and storytelling, yet reauthoring existing footage remain

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

arXiv:2601.11109v3 Announce Type: replace-cross Abstract: Vision-as-inverse-graphics, the concept of reconstructing images into editable programs, remains chall

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2d ago

How AI Coding Agents Modify Code: A Large-Scale Study of GitHub Pull Requests

arXiv:2601.17581v3 Announce Type: replace-cross Abstract: AI coding agents are increasingly acting as autonomous contributors by generating and submitting pull

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 2d ago

Teaching Machine Learning Fundamentals with LEGO Robotics

arXiv:2601.19376v2 Announce Type: replace-cross Abstract: This paper presents the web-based platform Machine Learning with Bricks and an accompanying two-day co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Self-Improving Pretraining: using post-trained models to pretrain better models

arXiv:2601.21343v3 Announce Type: replace-cross Abstract: Large language models are classically trained in stages: pretraining on raw text followed by post-trai

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Predicting Intermittent Job Failure Categories for Diagnosis Using Few-Shot Fine-Tuned Language Models

arXiv:2601.22264v2 Announce Type: replace-cross Abstract: In principle, Continuous Integration (CI) pipeline failures provide valuable feedback to developers on

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

InfoTok: Information-Theoretic Regularization for Capacity-Constrained Shared Visual Tokenization in Unified MLLMs

arXiv:2602.01554v2 Announce Type: replace-cross Abstract: Unified multimodal large language models (MLLMs) aim to unify image understanding and image generation

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models

arXiv:2602.04448v2 Announce Type: replace-cross Abstract: Mixture-of-Experts (MoE) language models introduce unique challenges for safety alignment due to their

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 2d ago

ST-BiBench: Benchmarking Multi-Stream Multimodal Coordination in Bimanual Embodied Tasks for MLLMs

arXiv:2602.08392v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) have significantly advanced the landscape of embodied AI, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations

arXiv:2602.09924v3 Announce Type: replace-cross Abstract: Running LLMs with extended reasoning on every problem is expensive, but determining which inputs actua

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2d ago

MoltNet: Understanding Social Behavior of AI Agents in the Agent-Native MoltBook

arXiv:2602.13458v2 Announce Type: replace-cross Abstract: Large-scale communities of AI agents are becoming increasingly prevalent, creating new environments fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

WIMLE: Uncertainty-Aware World Models with IMLE for Sample-Efficient Continuous Control

arXiv:2602.14351v2 Announce Type: replace-cross Abstract: Model-based reinforcement learning promises strong sample efficiency but often underperforms in practi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2d ago

Explainable Token-level Noise Filtering for LLM Fine-tuning Datasets

arXiv:2602.14536v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have seen remarkable advancements, achieving state-of-the-art results in