3,169 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,169 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AIForbes InnovationOpenAI NewsDev.to AIHugging Face BlogHackernoon
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Polysemanticity or Polysemy? Lexical Identity Confounds Superposition Metrics
arXiv:2604.00443v1 Announce Type: cross Abstract: If the same neuron activates for both "lender" and "riverside," standard metrics attribute the overlap to supe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
First Logit Boosting: Visual Grounding Method to Mitigate Object Hallucination in Large Vision-Language Models
arXiv:2604.00455v1 Announce Type: cross Abstract: Recent Large Vision-Language Models (LVLMs) have demonstrated remarkable performance across various multimodal
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Not My Truce: Personality Differences in AI-Mediated Workplace Negotiation
arXiv:2604.00464v1 Announce Type: cross Abstract: AI-driven conversational coaching is increasingly used to support workplace negotiation, yet prior work assume
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Executing as You Generate: Hiding Execution Latency in LLM Code Generation
arXiv:2604.00491v1 Announce Type: cross Abstract: Current LLM-based coding agents follow a serial execution paradigm: the model first generates the complete cod
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
A Reasoning-Enabled Vision-Language Foundation Model for Chest X-ray Interpretation
arXiv:2604.00493v1 Announce Type: cross Abstract: Chest X-rays (CXRs) are among the most frequently performed imaging examinations worldwide, yet rising imaging
ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago
Towards Initialization-dependent and Non-vacuous Generalization Bounds for Overparameterized Shallow Neural Networks
arXiv:2604.00505v1 Announce Type: cross Abstract: Overparameterized neural networks often show a benign overfitting property in the sense of achieving excellent
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
MOON3.0: Reasoning-aware Multimodal Representation Learning for E-commerce Product Understanding
arXiv:2604.00513v1 Announce Type: cross Abstract: With the rapid growth of e-commerce, exploring general representations rather than task-specific ones has attr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
MAESIL: Masked Autoencoder for Enhanced Self-supervised Medical Image Learning
arXiv:2604.00514v1 Announce Type: cross Abstract: Training deep learning models for three-dimensional (3D) medical imaging, such as Computed Tomography (CT), is
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Toward Optimal Sampling Rate Selection and Unbiased Classification for Precise Animal Activity Recognition
arXiv:2604.00517v1 Announce Type: cross Abstract: With the rapid advancements in deep learning techniques, wearable sensor-aided animal activity recognition (AA
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding
arXiv:2604.00528v1 Announce Type: cross Abstract: 3D Visual Grounding (3D-VG) aims to localize objects in 3D scenes via natural language descriptions. While rec
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Optimsyn: Influence-Guided Rubrics Optimization for Synthetic Data Generation
arXiv:2604.00536v1 Announce Type: cross Abstract: Large language models (LLMs) achieve strong downstream performance largely due to abundant supervised fine-tun
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
MATHENA: Mamba-based Architectural Tooth Hierarchical Estimator and Holistic Evaluation Network for Anatomy
arXiv:2604.00537v1 Announce Type: cross Abstract: Dental diagnosis from Orthopantomograms (OPGs) requires coordination of tooth detection, caries segmentation (
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
HabitatAgent: An End-to-End Multi-Agent System for Housing Consultation
arXiv:2604.00556v1 Announce Type: cross Abstract: Housing selection is a high-stakes and largely irreversible decision problem. We study housing consultation as
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
UniMixer: A Unified Architecture for Scaling Laws in Recommendation Systems
arXiv:2604.00590v1 Announce Type: cross Abstract: In recent years, the scaling laws of recommendation models have attracted increasing attention, which govern t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Streaming Model Cascades for Semantic SQL
arXiv:2604.00660v1 Announce Type: cross Abstract: Modern data warehouses extend SQL with semantic operators that invoke large language models on each qualifying
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Procela: Epistemic Governance in Mechanistic Simulations Under Structural Uncertainty
arXiv:2604.00675v1 Announce Type: cross Abstract: Mechanistic simulations typically assume fixed ontologies: variables, causal relationships, and resolution pol
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Internal APIs Are All You Need: Shadow APIs, Shared Discovery, and the Case Against Browser-First Agent Architectures
arXiv:2604.00694v1 Announce Type: cross Abstract: Autonomous agents increasingly interact with the web, yet most websites remain designed for human browsers --
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Learning to Hint for Reinforcement Learning
arXiv:2604.00698v1 Announce Type: cross Abstract: Group Relative Policy Optimization (GRPO) is widely used for reinforcement learning with verifiable rewards, b
ArXiv cs.AI 🔐 Cybersecurity 📄 Paper ⚡ AI Lesson 1w ago
AutoEG: Exploiting Known Third-Party Vulnerabilities in Black-Box Web Applications
arXiv:2604.00704v1 Announce Type: cross Abstract: Large-scale web applications are widely deployed with complex third-party components, inheriting security risk
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
To Memorize or to Retrieve: Scaling Laws for RAG-Considerate Pretraining
arXiv:2604.00715v1 Announce Type: cross Abstract: Retrieval-augmented generation (RAG) improves language model (LM) performance by providing relevant context at