7,966 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 7,966 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (20947) ArXiv cs.AIDev.to AIForbes InnovationMedium · AIMedium · ProgrammingMedium · Cybersecurity
ArXiv cs.AI 📄 Paper 2w ago
ADD for Multi-Bit Image Watermarking
arXiv:2604.11491v1 Announce Type: cross Abstract: As generative models enable rapid creation of high-fidelity images, societal concerns about misinformation and
ArXiv cs.AI 📄 Paper 2w ago
Quantization Dominates Rank Reduction for KV-Cache Compression
arXiv:2604.11501v1 Announce Type: cross Abstract: We compare two strategies for compressing the KV cache in transformer inference: rank reduction (discard dimen
ArXiv cs.AI 📄 Paper 2w ago
METER: Evaluating Multi-Level Contextual Causal Reasoning in Large Language Models
arXiv:2604.11502v1 Announce Type: cross Abstract: Contextual causal reasoning is a critical yet challenging capability for Large Language Models (LLMs). Existin
ArXiv cs.AI 📄 Paper 2w ago
Deep Learning for Sequential Decision Making under Uncertainty: Foundations, Frameworks, and Frontiers
arXiv:2604.11507v1 Announce Type: cross Abstract: Artificial intelligence (AI) is moving increasingly beyond prediction to support decisions in complex, uncerta
ArXiv cs.AI 📄 Paper 2w ago
Not All Forgetting Is Equal: Architecture-Dependent Retention Dynamics in Fine-Tuned Image Classifiers
arXiv:2604.11508v1 Announce Type: cross Abstract: Fine-tuning pretrained image classifiers is standard practice, yet which individual samples are forgotten duri
ArXiv cs.AI 📄 Paper 2w ago
Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization
arXiv:2604.11510v1 Announce Type: cross Abstract: To encourage diverse exploration in reinforcement learning (RL) for large language models (LLMs) without compr
ArXiv cs.AI 📄 Paper 2w ago
EdgeCIM: A Hardware-Software Co-Design for CIM-Based Acceleration of Small Language Models
arXiv:2604.11512v1 Announce Type: cross Abstract: The growing demand for deploying Small Language Models (SLMs) on edge devices, including laptops, smartphones,
ArXiv cs.AI 📄 Paper 2w ago
From Translation to Superset: Benchmark-Driven Evolution of a Production AI Agent from Rust to Python
arXiv:2604.11518v1 Announce Type: cross Abstract: Cross-language migration of large software systems is a persistent engineering challenge, particularly when th
ArXiv cs.AI 📄 Paper 2w ago
SVD-Prune: Training-Free Token Pruning For Efficient Vision-Language Models
arXiv:2604.11530v1 Announce Type: cross Abstract: Vision-Language Models (VLM) have revolutionized multimodal learning by jointly processing visual and textual
ArXiv cs.AI 📄 Paper 2w ago
CLAY: Conditional Visual Similarity Modulation in Vision-Language Embedding Space
arXiv:2604.11539v1 Announce Type: cross Abstract: Human perception of visual similarity is inherently adaptive and subjective, depending on the users' interests
ArXiv cs.AI 📄 Paper 2w ago
NovBench: Evaluating Large Language Models on Academic Paper Novelty Assessment
arXiv:2604.11543v1 Announce Type: cross Abstract: Novelty is a core requirement in academic publishing and a central focus of peer review, yet the growing volum
ArXiv cs.AI 📄 Paper 2w ago
Time is Not a Label: Continuous Phase Rotation for Temporal Knowledge Graphs and Agentic Memory
arXiv:2604.11544v1 Announce Type: cross Abstract: Structured memory representations such as knowledge graphs are central to autonomous agents and other long-liv
ArXiv cs.AI 📄 Paper 2w ago
FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning
arXiv:2604.11556v1 Announce Type: cross Abstract: LLM-assisted software development has become increasingly prevalent, and can generate large-scale systems, suc
ArXiv cs.AI 📄 Paper 2w ago
bacpipe: a Python package to make bioacoustic deep learning models accessible
arXiv:2604.11560v1 Announce Type: cross Abstract: 1. Natural sounds have been recorded for millions of hours over the previous decades using passive acoustic mo
ArXiv cs.AI 📄 Paper 2w ago
Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo
arXiv:2604.11563v1 Announce Type: cross Abstract: Providing AI agents with reliable long-term memory that does not hallucinate remains an open problem. Current
ArXiv cs.AI 📄 Paper 2w ago
Minimizing classical resources in variational measurement-based quantum computation for generative modeling
arXiv:2604.11578v1 Announce Type: cross Abstract: Measurement-based quantum computation (MBQC) is a framework for quantum information processing in which a comp
ArXiv cs.AI 📄 Paper 2w ago
A Triadic Suffix Tokenization Scheme for Numerical Reasoning
arXiv:2604.11582v1 Announce Type: cross Abstract: Standard subword tokenization methods fragment numbers inconsistently, causing large language models (LLMs) to
ArXiv cs.AI 📄 Paper 2w ago
Layerwise Dynamics for In-Context Classification in Transformers
arXiv:2604.11613v1 Announce Type: cross Abstract: Transformers can perform in-context classification from a few labeled examples, yet the inference-time algorit
ArXiv cs.AI 📄 Paper 2w ago
CUTEv2: Unified and Configurable Matrix Extension for Diverse CPU Architectures with Minimal Design Overhead
arXiv:2604.11615v1 Announce Type: cross Abstract: Matrix extensions have emerged as an essential feature in modern CPUs to address the surging demands of AI wor
ArXiv cs.AI 📄 Paper 2w ago
SCNO: Spiking Compositional Neural Operator -- Towards a Neuromorphic Foundation Model for Nuclear PDE Solving
arXiv:2604.11625v1 Announce Type: cross Abstract: Neural operators have emerged as powerful surrogates for partial differential equation (PDE) solvers, yet they