7,014 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 7,014 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (19706) ArXiv cs.AIDev.to AIForbes InnovationDev.to · FORUM WEBMedium · ProgrammingMedium · AI
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Advancing AI Research Assistants with Expert-Involved Learning
arXiv:2505.04638v5 Announce Type: replace Abstract: Large language models (LLMs) and large multimodal models (LMMs) promise to accelerate biomedical discovery,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Beyond Syntax: Action Semantics Learning for App Agents
arXiv:2506.17697v3 Announce Type: replace Abstract: The recent development of Large Language Models (LLMs) enables the rise of App agents that interpret user in
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
URSA: The Universal Research and Scientific Agent
arXiv:2506.22653v2 Announce Type: replace Abstract: Large language models (LLMs) have moved far beyond their initial form as simple chatbots, now carrying out c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
MedGemma Technical Report
arXiv:2507.05201v4 Announce Type: replace Abstract: Artificial intelligence (AI) has significant potential in healthcare applications, but its training and depl
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
Modelling Cascading Physical Climate Risk in Supply Chains with Adaptive Firms: A Spatial Agent-Based Framework
arXiv:2509.18633v4 Announce Type: replace Abstract: We present an open-source Python framework for modelling cascading physical climate risk in a spatial supply
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Multiplayer Nash Preference Optimization
arXiv:2509.23102v3 Announce Type: replace Abstract: Reinforcement learning from human feedback (RLHF) has emerged as the standard paradigm for aligning large la
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
arXiv:2509.25454v4 Announce Type: replace Abstract: Although RLVR has become an essential component for developing advanced reasoning skills in language models,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Hypothesis-Driven Feature Manifold Analysis in LLMs via Supervised Multi-Dimensional Scaling
arXiv:2510.01025v2 Announce Type: replace Abstract: The linear representation hypothesis states that language models (LMs) encode concepts as directions in thei
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering
arXiv:2510.07432v2 Announce Type: replace Abstract: Large language models (LLMs) exhibit strong symbolic and compositional reasoning, yet they struggle with tim
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
DRIFT: Decompose, Retrieve, Illustrate, then Formalize Theorems
arXiv:2510.10815v4 Announce Type: replace Abstract: Automating the formalization of mathematical statements for theorem proving remains a major challenge for La
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Toward Virtuous Reinforcement Learning: A Critique and Roadmap
arXiv:2512.04246v2 Announce Type: replace Abstract: This paper critiques common patterns in machine ethics for Reinforcement Learning (RL) and argues for a virt
ArXiv cs.AI 🛡️ AI Safety & Ethics 📄 Paper ⚡ AI Lesson 2w ago
Robust AI Security and Alignment: A Sisyphean Endeavor?
arXiv:2512.10100v2 Announce Type: replace Abstract: This manuscript establishes information-theoretic limitations for robustness of AI security and alignment by
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration
arXiv:2512.19396v2 Announce Type: replace Abstract: Contemporary GUI agents, while increasingly capable due to advances in Large Vision-Language Models (VLMs),
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training
arXiv:2602.05765v2 Announce Type: replace Abstract: Reinforcement learning (RL) has emerged as a critical paradigm for post-training Vision-Language-Action (VLA
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Emergent Introspection in AI is Content-Agnostic
arXiv:2603.05414v2 Announce Type: replace Abstract: Introspection is a foundational cognitive ability, but its mechanism is not well understood. Recent work has
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling
arXiv:2603.21357v2 Announce Type: replace Abstract: LLM agents fail on the majority of real-world tasks -- GPT-4o succeeds on fewer than 15% of WebArena navigat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement
arXiv:2604.01591v2 Announce Type: replace Abstract: We introduce ThinkTwice, a simple two-phase framework that jointly optimizes LLMs to solve reasoning problem
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
arXiv:2407.14971v3 Announce Type: replace-cross Abstract: Vision-Language Models (VLMs) rely heavily on pretrained vision encoders to support downstream tasks s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
TransAgent: Enhancing LLM-Based Code Translation via Fine-Grained Execution Alignment
arXiv:2409.19894v5 Announce Type: replace-cross Abstract: Code translation transforms code between programming languages while preserving functionality, which i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Cobblestone: A Divide-and-Conquer Approach for Automating Formal Verification
arXiv:2410.19940v4 Announce Type: replace-cross Abstract: Formal verification using proof assistants, such as Coq, is an effective way of improving software qua