📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 9,436 articles · Updated every 3 hours · View all reads

arXiv:2604.00590v1 Announce Type: cross Abstract: In recent years, the scaling laws of recommendation models have attracted increasing attention, which govern t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Streaming Model Cascades for Semantic SQL

arXiv:2604.00660v1 Announce Type: cross Abstract: Modern data warehouses extend SQL with semantic operators that invoke large language models on each qualifying

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

Procela: Epistemic Governance in Mechanistic Simulations Under Structural Uncertainty

arXiv:2604.00675v1 Announce Type: cross Abstract: Mechanistic simulations typically assume fixed ontologies: variables, causal relationships, and resolution pol

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

Internal APIs Are All You Need: Shadow APIs, Shared Discovery, and the Case Against Browser-First Agent Architectures

arXiv:2604.00694v1 Announce Type: cross Abstract: Autonomous agents increasingly interact with the web, yet most websites remain designed for human browsers --

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Learning to Hint for Reinforcement Learning

arXiv:2604.00698v1 Announce Type: cross Abstract: Group Relative Policy Optimization (GRPO) is widely used for reinforcement learning with verifiable rewards, b

ArXiv cs.AI 🔐 Cybersecurity 📄 Paper ⚡ AI Lesson 1mo ago

AutoEG: Exploiting Known Third-Party Vulnerabilities in Black-Box Web Applications

arXiv:2604.00704v1 Announce Type: cross Abstract: Large-scale web applications are widely deployed with complex third-party components, inheriting security risk

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

To Memorize or to Retrieve: Scaling Laws for RAG-Considerate Pretraining

arXiv:2604.00715v1 Announce Type: cross Abstract: Retrieval-augmented generation (RAG) improves language model (LM) performance by providing relevant context at

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization

arXiv:2604.00717v1 Announce Type: cross Abstract: Non-stationarity arises from concurrent policy updates and leads to persistent environmental fluctuations. Exi

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

A CEFR-Inspired Classification Framework with Fuzzy C-Means To Automate Assessment of Programming Skills in Scratch

arXiv:2604.00730v1 Announce Type: cross Abstract: Context: Schools, training platforms, and technology firms increasingly need to assess programming proficiency

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction

arXiv:2604.00733v1 Announce Type: cross Abstract: The memory wall remains the primary bottleneck for training large language models on consumer hardware. We int

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

BioCOMPASS: Integrating Biomarkers into Transformer-Based Immunotherapy Response Prediction

arXiv:2604.00739v1 Announce Type: cross Abstract: Datasets used in immunotherapy response prediction are typically small in size, as well as diverse in cancer t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

IWP: Token Pruning as Implicit Weight Pruning in Large Vision Language Models

arXiv:2604.00757v1 Announce Type: cross Abstract: Large Vision Language Models show impressive performance across image and video understanding tasks, yet their

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Thinking Wrong in Silence: Backdoor Attacks on Continuous Latent Reasoning

arXiv:2604.00770v1 Announce Type: cross Abstract: A new generation of language models reasons entirely in continuous hidden states, producing no tokens and leav

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer

arXiv:2604.00785v1 Announce Type: cross Abstract: Pretraining Large Language Models (LLMs) from scratch requires massive amount of compute. Aurora super compute

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Routing-Free Mixture-of-Experts

arXiv:2604.00801v1 Announce Type: cross Abstract: Standard Mixture-of-Experts (MoE) models rely on centralized routing mechanisms that introduce rigid inductive

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

DVGT-2: Vision-Geometry-Action Model for Autonomous Driving at Scale

arXiv:2604.00813v1 Announce Type: cross Abstract: End-to-end autonomous driving has evolved from the conventional paradigm based on sparse perception into visio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Emotion Entanglement and Bayesian Inference for Multi-Dimensional Emotion Understanding

arXiv:2604.00819v1 Announce Type: cross Abstract: Understanding emotions in natural language is inherently a multi-dimensional reasoning problem, where multiple

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

arXiv:2604.00830v1 Announce Type: cross Abstract: Test-Time Learning (TTL) enables language agents to iteratively refine their performance through repeated inte

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection

arXiv:2604.00878v1 Announce Type: cross Abstract: Actor-level stance detection aims to determine an author expressed position toward specific geopolitical actor

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding

arXiv:2604.00886v1 Announce Type: cross Abstract: Document understanding and GUI interaction are among the highest-value applications of Vision-Language Models

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time

arXiv:2604.00917v1 Announce Type: cross Abstract: The rise of large language models for code has reshaped software development. Autonomous coding agents, able t

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago

Representation Selection via Cross-Model Agreement using Canonical Correlation Analysis

arXiv:2604.00921v1 Announce Type: cross Abstract: Modern vision pipelines increasingly rely on pretrained image encoders whose representations are reused across

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

Learning Quantised Structure-Preserving Motion Representations for Dance Fingerprinting

arXiv:2604.00927v1 Announce Type: cross Abstract: We present DANCEMATCH, an end-to-end framework for motion-based dance retrieval, the task of identifying seman

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

WARP: Guaranteed Inner-Layer Repair of NLP Transformers

arXiv:2604.00938v1 Announce Type: cross Abstract: Transformer-based NLP models remain vulnerable to adversarial perturbations, yet existing repair methods face