📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 8,253 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (21843)
ArXiv cs.AIDev.to AIMedium · AIMedium · ProgrammingForbes InnovationMedium · Machine Learning
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Similarity Field Theory: A Mathematical Framework for Intelligence
arXiv:2509.18218v5 Announce Type: replace Abstract: We posit that transforming similarity relations form the structural basis of comprehensible dynamic systems.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
arXiv:2510.09901v2 Announce Type: replace Abstract: Computing has long served as a cornerstone of scientific discovery. Recently, a paradigm shift has emerged w
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges
arXiv:2510.23883v3 Announce Type: replace Abstract: Agentic AI systems powered by large language models (LLMs) and endowed with planning, tool use, memory, and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval
arXiv:2511.14130v2 Announce Type: replace Abstract: With the rapid progress of large language models (LLMs), financial information retrieval has become a critic
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
An Agent-Based Framework for the Automatic Validation of Mathematical Optimization Models
arXiv:2511.16383v2 Announce Type: replace Abstract: Recently, using Large Language Models (LLMs) to generate optimization models from natural language descripti
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
Agile Deliberation: Concept Deliberation for Subjective Visual Classification
arXiv:2512.10821v2 Announce Type: replace Abstract: From content moderation to content curation, applications requiring vision classifiers for visual concepts a
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows
arXiv:2512.13168v4 Announce Type: replace Abstract: We introduce FinWorkBench (a.k.a. Finch), a benchmark for evaluating agents on real-world, enterprise-grade
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
The Drill-Down and Fabricate Test (DDFT): A Protocol for Measuring Epistemic Robustness in Language Models
arXiv:2512.23850v2 Announce Type: replace Abstract: Current language model evaluations measure what models know under ideal conditions but not how robustly they
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation
arXiv:2601.05656v3 Announce Type: replace Abstract: High-fidelity agent initialization is crucial for credible Agent-Based Modeling across diverse domains. A ro
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers
arXiv:2601.06338v2 Announce Type: replace Abstract: Diffusion Transformers (DiTs) have greatly advanced text-to-image generation, but models still struggle to g
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors
arXiv:2601.08950v2 Announce Type: replace Abstract: Despite their growing adoption in education, LLMs remain misaligned with the core principle of effective tut
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Decision-Making
arXiv:2601.21439v2 Announce Type: replace Abstract: While Large Language Models (LLMs) are widely documented to be sensitive to minor prompt perturbations and p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
TSPO: Breaking the Double Homogenization Dilemma in Multi-turn Search Policy Optimization
arXiv:2601.22776v2 Announce Type: replace Abstract: Multi-turn tool-integrated reasoning enables Large Language Models (LLMs) to solve complex tasks through ite
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Enhancing Foundation VLM Robustness to Missing Modality: Scalable Diffusion for Bi-directional Feature Restoration
arXiv:2602.03151v2 Announce Type: replace Abstract: Vision Language Model (VLM) typically assume complete modality input during inference. However, their effect
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery
arXiv:2602.07943v2 Announce Type: replace Abstract: In the presence of confounding between an endogenous variable and the outcome, instrumental variables (IVs)
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
Voxtral Realtime
arXiv:2602.11298v3 Announce Type: replace Abstract: We introduce Voxtral Realtime, a natively streaming automatic speech recognition model that matches offline
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning
arXiv:2602.13218v2 Announce Type: replace Abstract: Reinforcement Learning from Verifiable Rewards (RLVR) is bottlenecked by data: existing synthesis pipelines
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
KLong: Training LLM Agent for Extremely Long-horizon Tasks
arXiv:2602.17547v2 Announce Type: replace Abstract: This paper introduces KLong, an open-source LLM agent trained to solve extremely long-horizon tasks. The pri
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
AI Runtime Infrastructure
arXiv:2603.00495v2 Announce Type: replace Abstract: We introduce AI Runtime Infrastructure, a distinct execution-time layer that operates above the model and be
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality
arXiv:2603.05912v2 Announce Type: replace Abstract: Search-augmented LLM agents can produce deep research reports (DRRs), but verifying claim-level factuality r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation
arXiv:2603.08388v4 Announce Type: replace Abstract: We propose a Hierarchical Error-Corrective Graph FrameworkforAutonomousAgentswithLLM-BasedActionGeneration(H
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Collective AI can amplify tiny perturbations into divergent decisions
arXiv:2603.09127v2 Announce Type: replace Abstract: Large language models are increasingly deployed not as single assistants but as committees whose members del
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
A Self-Evolving Defect Detection Framework for Industrial Photovoltaic Systems
arXiv:2603.14869v2 Announce Type: replace Abstract: Reliable photovoltaic (PV) power generation requires timely detection of module defects that may reduce ener
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
3w ago
Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI
arXiv:2603.18104v2 Announce Type: replace Abstract: Prevailing AI training infrastructure assumes reverse-mode automatic differentiation over IEEE-754 arithmeti
DeepCamp AI