3,344 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,344 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (18215) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Sparse Visual Thought Circuits in Vision-Language Models
arXiv:2603.25075v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) improve interpretability in multimodal models, but it remains unclear whether SAE fea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents
arXiv:2603.25097v1 Announce Type: new Abstract: Large Language Model based agents increasingly operate in high stakes, multi turn settings where factual groundi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
When Sensing Varies with Contexts: Context-as-Transform for Tactile Few-Shot Class-Incremental Learning
arXiv:2603.25115v1 Announce Type: new Abstract: Few-Shot Class-Incremental Learning (FSCIL) can be particularly susceptible to acquisition contexts with only a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following
arXiv:2603.25133v1 Announce Type: new Abstract: Rubric-based evaluation has become a prevailing paradigm for evaluating instruction following in large language
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
UniAI-GraphRAG: Synergizing Ontology-Guided Extraction, Multi-Dimensional Clustering, and Dual-Channel Fusion for Robust Multi-Hop Reasoning
arXiv:2603.25152v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) systems face significant challenges in complex reasoning, multi-hop queries
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
arXiv:2603.25158v1 Announce Type: new Abstract: Equipping Large Language Model (LLM) agents with domain-specific skills is critical for tackling complex tasks.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
The Competence Shadow: Theory and Bounds of AI Assistance in Safety Engineering
arXiv:2603.25197v1 Announce Type: new Abstract: As AI assistants become integrated into safety engineering workflows for Physical AI systems, a critical questio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation
arXiv:2603.25266v1 Announce Type: new Abstract: Probabilistic abstract interpretation is a theory used to extract particular properties of a computer program wh
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4w ago
Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpretation to Neural Network Analysis
arXiv:2603.25273v1 Announce Type: new Abstract: The probabilistic abstract interpretation framework of neural network analysis analyzes a neural network by anal
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4w ago
A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion
arXiv:2603.25283v1 Announce Type: new Abstract: Gait is increasingly recognized as a vital sign, yet current approaches treat it as a symptom of specific pathol
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
SliderQuant: Accurate Post-Training Quantization for LLMs
arXiv:2603.25284v1 Announce Type: new Abstract: In this paper, we address post-training quantization (PTQ) for large language models (LLMs) from an overlooked p
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4w ago
DAGverse: Building Document-Grounded Semantic DAGs from Scientific Papers
arXiv:2603.25293v1 Announce Type: new Abstract: Directed Acyclic Graphs (DAGs) are widely used to represent structured knowledge in scientific and technical dom
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Evaluating Language Models for Harmful Manipulation
arXiv:2603.25326v1 Announce Type: new Abstract: Interest in the concept of AI-driven harmful manipulation is growing, yet current approaches to evaluating it ar
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automated and Human-Driven Vehicles
arXiv:2603.25328v1 Announce Type: new Abstract: Automated Vehicle (AV) control in mixed traffic, where AVs coexist with human-driven vehicles, poses significant
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Agentic Trust Coordination for Federated Learning through Adaptive Thresholding and Autonomous Decision Making in Sustainable and Resilient Industrial Networks
arXiv:2603.25334v1 Announce Type: new Abstract: Distributed intelligence in industrial networks increasingly integrates sensing, communication, and computation
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
4OPS: Structural Difficulty Modeling in Integer Arithmetic Puzzles
arXiv:2603.25356v1 Announce Type: new Abstract: Arithmetic puzzle games provide a controlled setting for studying difficulty in mathematical reasoning tasks, a
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4w ago
Does Structured Intent Representation Generalize? A Cross-Language, Cross-Model Empirical Study of 5W3H Prompting
arXiv:2603.25379v1 Announce Type: new Abstract: Does structured intent representation generalize across languages and models? We study PPS (Prompt Protocol Spec
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models
arXiv:2603.25412v1 Announce Type: new Abstract: Large language models (LLMs) increasingly rely on explicit chain-of-thought (CoT) reasoning to solve complex tas
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation
arXiv:2603.25415v1 Announce Type: new Abstract: Semantic world models enable embodied agents to reason about objects, relations, and spatial context beyond pure
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4w ago
Cross-Model Disagreement as a Label-Free Correctness Signal
arXiv:2603.25450v1 Announce Type: new Abstract: Detecting when a language model is wrong without ground truth labels is a fundamental challenge for safe deploym