3,169 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AIForbes InnovationOpenAI NewsDev.to AIHugging Face BlogHackernoon
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Enhancing Robustness of Federated Learning via Server Learning
arXiv:2604.03226v1 Announce Type: cross Abstract: This paper explores the use of server learning for enhancing the robustness of federated learning against mali
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
WiseMind: a knowledge-guided multi-agent framework for accurate and empathetic psychiatric diagnosis
arXiv:2502.20689v3 Announce Type: replace Abstract: Large Language Models (LLMs) offer promising opportunities to support mental healthcare workflows, yet they
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Learn to Relax with Large Language Models: Solving Constraint Optimization Problems via Bidirectional Coevolution
arXiv:2509.12643v4 Announce Type: replace Abstract: Large Language Model (LLM)-based optimization has recently shown promise for autonomous problem solving, yet
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Glia: A Human-Inspired AI for Automated Systems Design and Optimization
arXiv:2510.27176v5 Announce Type: replace Abstract: Can AI autonomously design mechanisms for computer systems on par with the creativity and reasoning of human
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents
arXiv:2511.02734v2 Announce Type: replace Abstract: Current evaluations of Large Language Model (LLM) agents primarily emphasize task completion, often overlook
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
arXiv:2512.16300v2 Announce Type: replace Abstract: Existing image forgery detection (IFD) methods either exploit low-level, semantics-agnostic artifacts or rel
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
ClinicalReTrial: Clinical Trial Redesign with Self-Evolving Agents
arXiv:2601.00290v2 Announce Type: replace Abstract: Clinical trials constitute a critical yet exceptionally challenging and costly stage of drug development (\$
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
AgenticRed: Evolving Agentic Systems for Red-Teaming
arXiv:2601.13518v3 Announce Type: replace Abstract: While recent automated red-teaming methods show promise for systematically exposing model vulnerabilities, m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics
arXiv:2601.23048v3 Announce Type: replace Abstract: Large language models now solve many benchmark math problems at near-expert levels, yet this progress has no
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
From Virtual Environments to Real-World Trials: Emerging Trends in Autonomous Driving
arXiv:2603.17714v2 Announce Type: replace Abstract: Autonomous driving technologies have achieved significant advances in recent years, yet their real-world dep
ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 4d ago
When AI Gets it Wrong: Reliability and Risk in AI-Assisted Medication Decision Systems
arXiv:2604.01449v2 Announce Type: replace Abstract: Artificial intelligence (AI) systems are increasingly integrated into healthcare and pharmacy workflows, sup
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
OSCAR: Orchestrated Self-verification and Cross-path Refinement
arXiv:2604.01624v2 Announce Type: replace Abstract: Diffusion language models (DLMs) expose their denoising trajectories, offering a natural handle for inferenc
ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 4d ago
Solving the Two-dimensional single stock size Cutting Stock Problem with SAT and MaxSAT
arXiv:2604.01732v2 Announce Type: replace Abstract: Cutting rectangular items from stock sheets to satisfy demands while minimizing waste is a central manufactu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Beyond the Assistant Turn: User Turn Generation as a Probe of Interaction Awareness in Language Models
arXiv:2604.02315v2 Announce Type: replace Abstract: Standard LLM benchmarks evaluate the assistant turn: the model generates a response to an input, a verifier
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Efficient Causal Graph Discovery Using Large Language Models
arXiv:2402.01207v5 Announce Type: replace-cross Abstract: We propose a novel framework that leverages LLMs for full causal graph discovery. While previous LLM-b
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Expressive Prompting: Improving Emotion Intensity and Speaker Consistency in Zero-Shot TTS
arXiv:2409.18512v2 Announce Type: replace-cross Abstract: Recent advancements in speech synthesis have enabled large language model (LLM)-based systems to perfo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
ForgeryGPT: A Multimodal LLM for Interpretable Image Forgery Detection and Localization
arXiv:2410.10238v3 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs), such as GPT4o, have shown strong capabilities in visual reas
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
S$^4$ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack
arXiv:2410.13891v3 Announce Type: replace-cross Abstract: Transferable Targeted Attacks (TTAs) face significant challenges due to severe overfitting to surrogat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Zero-shot Concept Bottleneck Models
arXiv:2502.09018v2 Announce Type: replace-cross Abstract: Concept bottleneck models (CBMs) are inherently interpretable and intervenable neural network models,
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 4d ago
LMask: Learn to Solve Constrained Routing Problems with Lazy Masking
arXiv:2505.17938v2 Announce Type: replace-cross Abstract: Routing problems are canonical combinatorial optimization tasks with wide-ranging applications in logi