📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news

arXiv:2603.28813v1 Announce Type: cross Abstract: In multi-agent debate (MAD) systems, performance gains are often reported; however, because the debate protoco

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

SkillTester: Benchmarking Utility and Security of Agent Skills

arXiv:2603.28815v1 Announce Type: cross Abstract: This technical report presents SkillTester, a tool for evaluating the utility and security of agent skills. It

ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 1w ago

ARTLAS: Mapping Art-Technology Institutions via Conceptual Axes, Text Embeddings, and Unsupervised Clustering

arXiv:2603.28816v1 Announce Type: cross Abstract: The global landscape of art-technology institutions, including festivals, biennials, research labs, conference

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

GUARD-SLM: Token Activation-Based Defense Against Jailbreak Attacks for Small Language Models

arXiv:2603.28817v1 Announce Type: cross Abstract: Small Language Models (SLMs) are emerging as efficient and economically viable alternatives to Large Language

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago

Time is Not Compute: Scaling Laws for Wall-Clock Constrained Training on Consumer GPUs

arXiv:2603.28823v1 Announce Type: cross Abstract: Scaling laws relate model quality to compute budget (FLOPs), but practitioners face wall-clock time constraint

ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 1w ago

SNEAKDOOR: Stealthy Backdoor Attacks against Distribution Matching-based Dataset Condensation

arXiv:2603.28824v1 Announce Type: cross Abstract: Dataset condensation aims to synthesize compact yet informative datasets that retain the training efficacy of

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Incentives, Equilibria, and the Limits of Healthcare AI: A Game-Theoretic Perspective

arXiv:2603.28825v1 Announce Type: cross Abstract: Artificial intelligence (AI) is widely promoted as a promising technological response to healthcare capacity a

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

GMA-SAWGAN-GP: A Novel Data Generative Framework to Enhance IDS Detection Performance

arXiv:2603.28838v1 Announce Type: cross Abstract: Intrusion Detection System (IDS) is often calibrated to known attacks and generalizes poorly to unknown threat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

OneComp: One-Line Revolution for Generative AI Model Compression

arXiv:2603.28845v1 Announce Type: cross Abstract: Deploying foundation models is increasingly constrained by memory footprint, latency, and hardware costs. Post

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

OptiMer: Optimal Distribution Vector Merging Is Better than Data Mixing for Continual Pre-Training

arXiv:2603.28858v1 Announce Type: cross Abstract: Continual pre-training is widely used to adapt LLMs to target languages and domains, yet the mixture ratio of

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

OccSim: Multi-kilometer Simulation with Long-horizon Occupancy World Models

arXiv:2603.28887v1 Announce Type: cross Abstract: Data-driven autonomous driving simulation has long been constrained by its heavy reliance on pre-recorded driv

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Robust Multi-Agent Reinforcement Learning for Small UAS Separation Assurance under GPS Degradation and Spoofing

arXiv:2603.28900v1 Announce Type: cross Abstract: We address robust separation assurance for small Unmanned Aircraft Systems (sUAS) under GPS degradation and sp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Beta-Scheduling: Momentum from Critical Damping as a Diagnostic and Correction Tool for Neural Network Training

arXiv:2603.28921v1 Announce Type: cross Abstract: Standard neural network training uses constant momentum (typically 0.9), a convention dating to 1964 with limi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Theory of Mind and Self-Attributions of Mentality are Dissociable in LLMs

arXiv:2603.28925v1 Announce Type: cross Abstract: Safety fine-tuning in Large Language Models (LLMs) seeks to suppress potentially harmful forms of mind-attribu

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago

Differentiable Initialization-Accelerated CPU-GPU Hybrid Combinatorial Scheduling

arXiv:2603.28943v1 Announce Type: cross Abstract: This paper presents a hybrid CPU-GPU framework for solving combinatorial scheduling problems formulated as Int

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 1w ago

Multi-Agent LLMs for Adaptive Acquisition in Bayesian Optimization

arXiv:2603.28959v1 Announce Type: cross Abstract: The exploration-exploitation trade-off is central to sequential decision-making and black-box optimization, ye

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

AutoWorld: Scaling Multi-Agent Traffic Simulation with Self-Supervised World Models

arXiv:2603.28963v1 Announce Type: cross Abstract: Multi-agent traffic simulation is central to developing and testing autonomous driving systems. Recent data-dr

ArXiv cs.AI 📄 Paper 1w ago

The Spectral Edge Thesis: A Mathematical Framework for Intra-Signal Phase Transitions in Neural Network Training

arXiv:2603.28964v1 Announce Type: cross Abstract: We develop the spectral edge thesis: phase transitions in neural network training -- grokking, capability gain

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 1w ago

Privacy Guard & Token Parsimony by Prompt and Context Handling and LLM Routing

arXiv:2603.28972v1 Announce Type: cross Abstract: The large-scale adoption of Large Language Models (LLMs) forces a trade-off between operational cost (OpEx) an

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper 1w ago

Design Principles for the Construction of a Benchmark Evaluating Security Operation Capabilities of Multi-agent AI Systems

arXiv:2603.28998v1 Announce Type: cross Abstract: As Large Language Models (LLMs) and multi-agent AI systems are demonstrating increasing potential in cybersecu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 1w ago

Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM Inference

arXiv:2603.29002v1 Announce Type: cross Abstract: Modern large language models (LLMs) increasingly depends on efficient long-context processing and generation m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Improving Efficiency of GPU Kernel Optimization Agents using a Domain-Specific Language and Speed-of-Light Guidance

arXiv:2603.29010v1 Announce Type: cross Abstract: Optimizing GPU kernels with LLM agents is an iterative process over a large design space. Every candidate must

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Human-Like Lifelong Memory: A Neuroscience-Grounded Architecture for Infinite Interaction

arXiv:2603.29023v1 Announce Type: cross Abstract: Large language models lack persistent, structured memory for long-term interaction and context-sensitive retri

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning

arXiv:2603.29025v1 Announce Type: cross Abstract: Large language models systematically fail when a salient surface cue conflicts with an unstated feasibility co