📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (12867) ArXiv cs.AI Dev.to · FORUM WEB Dev.to AI Forbes Innovation OpenAI News Hugging Face Blog

No More Stale Feedback: Co-Evolving Critics for Open-World Agent Learning

arXiv:2601.06794v2 Announce Type: replace Abstract: Critique-guided reinforcement learning (RL) has emerged as a powerful paradigm for training LLM agents by au

ArXiv cs.AI 📄 Paper 2d ago

PrivacyReasoner: Can LLM Emulate a Human-like Privacy Mind?

arXiv:2601.09152v2 Announce Type: replace Abstract: Prior work on LLM-based privacy focuses on norm judgment over synthetic vignettes, rather than how people th

ArXiv cs.AI 📄 Paper 2d ago

LatentRefusal: Latent-Signal Refusal for Unanswerable Text-to-SQL Queries

arXiv:2601.10398v3 Announce Type: replace Abstract: In LLM-based text-to-SQL systems, unanswerable and underspecified user queries may generate not only incorre

ArXiv cs.AI 📄 Paper 2d ago

WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents

arXiv:2603.05044v2 Announce Type: replace Abstract: Current paradigms for training GUI agents are fundamentally limited by a reliance on either unsafe, non-repr

ArXiv cs.AI 📄 Paper 2d ago

WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces

arXiv:2603.05295v3 Announce Type: replace Abstract: We introduce WebChain, the largest open-source dataset of human-annotated trajectories on real-world website

ArXiv cs.AI 📄 Paper 2d ago

A Survey of Multimodal Mathematical Reasoning: From Perception, Alignment to Reasoning

arXiv:2603.08291v3 Announce Type: replace Abstract: Multimodal Mathematical Reasoning (MMR) has recently attracted increasing attention for its capability to so

ArXiv cs.AI 📄 Paper 2d ago

Reasoning Graphs: Self-Improving, Deterministic RAG through Evidence-Centric Feedback

arXiv:2604.07595v2 Announce Type: replace Abstract: Language model agents reason from scratch on every query, discarding their chain of thought after each run.

ArXiv cs.AI 📄 Paper 2d ago

Pictorial and apictorial polygonal jigsaw puzzles from arbitrary number of crossing cuts

arXiv:2008.07644v3 Announce Type: replace-cross Abstract: Jigsaw puzzle solving, the problem of constructing a coherent whole from a set of non-overlapping unor

ArXiv cs.AI 📄 Paper 2d ago

Prompt Evolution for Generative AI: A Classifier-Guided Approach

arXiv:2305.16347v2 Announce Type: replace-cross Abstract: Synthesis of digital artifacts conditioned on user prompts has become an important paradigm facilitati

ArXiv cs.AI 📄 Paper 2d ago

A2-DIDM: Privacy-preserving Accumulator-enabled Auditing for Distributed Identity of DNN Model

arXiv:2405.04108v2 Announce Type: replace-cross Abstract: Recent booming development of Generative Artificial Intelligence (GenAI) has facilitated model commerc

ArXiv cs.AI 📄 Paper 2d ago

OmniHands: Towards Robust 4D Hand Mesh Recovery via A Versatile Transformer

arXiv:2405.20330v4 Announce Type: replace-cross Abstract: In this paper, we introduce OmniHands, a universal approach to recovering interactive hand meshes and

ArXiv cs.AI 📄 Paper 2d ago

animal2vec and MeerKAT: A self-supervised transformer for rare-event raw audio input and a large-scale reference dataset for bioacoustics

arXiv:2406.01253v3 Announce Type: replace-cross Abstract: Bioacoustic research, vital for understanding animal behavior, conservation, and ecology, faces a monu

ArXiv cs.AI 📄 Paper 2d ago

AdaMCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Multilingual Chain-of-Thought

arXiv:2501.16154v4 Announce Type: replace-cross Abstract: Large language models (LLMs) have shown impressive multilingual capabilities through pretraining on di

ArXiv cs.AI 📄 Paper 2d ago

RegD: Hierarchical Embeddings via Dissimilarity between Arbitrary Euclidean Regions

arXiv:2501.17518v3 Announce Type: replace-cross Abstract: Hierarchical data is common in many domains like life sciences and e-commerce, and its embeddings ofte

ArXiv cs.AI 📄 Paper 2d ago

Large Language Models are Powerful Electronic Health Record Encoders

arXiv:2502.17403v5 Announce Type: replace-cross Abstract: Electronic Health Records (EHRs) offer considerable potential for clinical prediction, but their compl

ArXiv cs.AI 📄 Paper 2d ago

Siamese Foundation Models for Crystal Structure Prediction

arXiv:2503.10471v2 Announce Type: replace-cross Abstract: Predicting crystal structures from chemical compositions is a fundamental challenge in materials disco

ArXiv cs.AI 📄 Paper 2d ago

Fine-Tuning LLMs for Report Summarization: Analysis on Supervised and Unsupervised Data

arXiv:2503.10676v2 Announce Type: replace-cross Abstract: We study the efficacy of fine-tuning Large Language Models (LLMs) for the specific task of report (gov

ArXiv cs.AI 📄 Paper 2d ago

Characterizing higher-order representations through generative diffusion models explains human decoded neurofeedback performance

arXiv:2503.14333v4 Announce Type: replace-cross Abstract: Brains construct not only "first-order" representations of the environment but also "higher-order" rep

ArXiv cs.AI 📄 Paper 2d ago

On the Mathematical Relationship Between Layer Normalization and Dynamic Activation Functions

arXiv:2503.21708v4 Announce Type: replace-cross Abstract: Layer normalization (LN) is an essential component of modern neural networks. While many alternative t

ArXiv cs.AI 📄 Paper 2d ago

On the Geometry of Receiver Operating Characteristic and Precision-Recall Curves

arXiv:2504.02169v3 Announce Type: replace-cross Abstract: We study the geometry of Receiver Operating Characteristic (ROC) and Precision-Recall (PR) curves in b

ArXiv cs.AI 📄 Paper 2d ago

Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning

arXiv:2505.15467v2 Announce Type: replace-cross Abstract: Large language models have achieved remarkable success in various tasks. However, it is challenging fo

ArXiv cs.AI 📄 Paper 2d ago

SEW: Self-Evolving Agentic Workflows for Automated Code Generation

arXiv:2505.18646v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have demonstrated effectiveness in code generation tasks. To enable LLMs

ArXiv cs.AI 📄 Paper 2d ago

Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning

arXiv:2505.19261v2 Announce Type: replace-cross Abstract: Current text-to-image diffusion generation typically employs complete-text conditioning. Due to the in

ArXiv cs.AI 📄 Paper 2d ago

SpecBranch: Speculative Decoding via Hybrid Drafting and Rollback-Aware Branch Parallelism

arXiv:2506.01979v4 Announce Type: replace-cross Abstract: Recently, speculative decoding (SD) has emerged as a promising technique to accelerate LLM inference b