9,436 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 9,436 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (22951) ArXiv cs.AIDev.to AIMedium · ProgrammingMedium · Machine LearningMedium · AIMedium · Cybersecurity
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Learning to Hint for Reinforcement Learning
arXiv:2604.00698v1 Announce Type: cross Abstract: Group Relative Policy Optimization (GRPO) is widely used for reinforcement learning with verifiable rewards, b
ArXiv cs.AI 🔐 Cybersecurity 📄 Paper ⚡ AI Lesson 1mo ago
AutoEG: Exploiting Known Third-Party Vulnerabilities in Black-Box Web Applications
arXiv:2604.00704v1 Announce Type: cross Abstract: Large-scale web applications are widely deployed with complex third-party components, inheriting security risk
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
To Memorize or to Retrieve: Scaling Laws for RAG-Considerate Pretraining
arXiv:2604.00715v1 Announce Type: cross Abstract: Retrieval-augmented generation (RAG) improves language model (LM) performance by providing relevant context at
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization
arXiv:2604.00717v1 Announce Type: cross Abstract: Non-stationarity arises from concurrent policy updates and leads to persistent environmental fluctuations. Exi
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
A CEFR-Inspired Classification Framework with Fuzzy C-Means To Automate Assessment of Programming Skills in Scratch
arXiv:2604.00730v1 Announce Type: cross Abstract: Context: Schools, training platforms, and technology firms increasingly need to assess programming proficiency
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction
arXiv:2604.00733v1 Announce Type: cross Abstract: The memory wall remains the primary bottleneck for training large language models on consumer hardware. We int
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
BioCOMPASS: Integrating Biomarkers into Transformer-Based Immunotherapy Response Prediction
arXiv:2604.00739v1 Announce Type: cross Abstract: Datasets used in immunotherapy response prediction are typically small in size, as well as diverse in cancer t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
IWP: Token Pruning as Implicit Weight Pruning in Large Vision Language Models
arXiv:2604.00757v1 Announce Type: cross Abstract: Large Vision Language Models show impressive performance across image and video understanding tasks, yet their
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Thinking Wrong in Silence: Backdoor Attacks on Continuous Latent Reasoning
arXiv:2604.00770v1 Announce Type: cross Abstract: A new generation of language models reasons entirely in continuous hidden states, producing no tokens and leav
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer
arXiv:2604.00785v1 Announce Type: cross Abstract: Pretraining Large Language Models (LLMs) from scratch requires massive amount of compute. Aurora super compute
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Routing-Free Mixture-of-Experts
arXiv:2604.00801v1 Announce Type: cross Abstract: Standard Mixture-of-Experts (MoE) models rely on centralized routing mechanisms that introduce rigid inductive
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
DVGT-2: Vision-Geometry-Action Model for Autonomous Driving at Scale
arXiv:2604.00813v1 Announce Type: cross Abstract: End-to-end autonomous driving has evolved from the conventional paradigm based on sparse perception into visio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Emotion Entanglement and Bayesian Inference for Multi-Dimensional Emotion Understanding
arXiv:2604.00819v1 Announce Type: cross Abstract: Understanding emotions in natural language is inherently a multi-dimensional reasoning problem, where multiple
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies
arXiv:2604.00830v1 Announce Type: cross Abstract: Test-Time Learning (TTL) enables language agents to iteratively refine their performance through repeated inte
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection
arXiv:2604.00878v1 Announce Type: cross Abstract: Actor-level stance detection aims to determine an author expressed position toward specific geopolitical actor
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding
arXiv:2604.00886v1 Announce Type: cross Abstract: Document understanding and GUI interaction are among the highest-value applications of Vision-Language Models
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time
arXiv:2604.00917v1 Announce Type: cross Abstract: The rise of large language models for code has reshaped software development. Autonomous coding agents, able t
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago
Representation Selection via Cross-Model Agreement using Canonical Correlation Analysis
arXiv:2604.00921v1 Announce Type: cross Abstract: Modern vision pipelines increasingly rely on pretrained image encoders whose representations are reused across
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
Learning Quantised Structure-Preserving Motion Representations for Dance Fingerprinting
arXiv:2604.00927v1 Announce Type: cross Abstract: We present DANCEMATCH, an end-to-end framework for motion-based dance retrieval, the task of identifying seman
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
WARP: Guaranteed Inner-Layer Repair of NLP Transformers
arXiv:2604.00938v1 Announce Type: cross Abstract: Transformer-based NLP models remain vulnerable to adversarial perturbations, yet existing repair methods face