📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news

arXiv:2603.29219v1 Announce Type: cross Abstract: Sign language is the primary approach of communication for the Deaf and Hard-of-Hearing (DHH) community. While

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Derived Fields Preserve Fine-Scale Detail in Budgeted Neural Simulators

arXiv:2603.29224v1 Announce Type: cross Abstract: Fine-scale-faithful neural simulation under fixed storage budgets remains challenging. Many existing methods r

ArXiv cs.AI 📄 Paper 1w ago

Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs

arXiv:2603.29232v1 Announce Type: cross Abstract: Large language models (LLMs) are widely applied to data analytics over documents, yet direct reasoning over lo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

MemRerank: Preference Memory for Personalized Product Reranking

arXiv:2603.29247v1 Announce Type: cross Abstract: LLM-based shopping agents increasingly rely on long purchase histories and multi-turn interactions for persona

ArXiv cs.AI 📄 Paper 1w ago

Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism

arXiv:2603.29252v1 Announce Type: cross Abstract: Long video understanding is a key challenge that plagues the advancement of \emph{Multimodal Large language Mo

ArXiv cs.AI 📄 Paper 1w ago

Omni-NegCLIP: Enhancing CLIP with Front-Layer Contrastive Fine-Tuning for Comprehensive Negation Understanding

arXiv:2603.29258v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) have demonstrated strong capabilities across a wide range of multimodal tasks. H

ArXiv cs.AI 📄 Paper 1w ago

Monodense Deep Neural Model for Determining Item Price Elasticity

arXiv:2603.29261v1 Announce Type: cross Abstract: Item Price Elasticity is used to quantify the responsiveness of consumer demand to changes in item prices, ena

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

PRISM: A Multi-View Multi-Capability Retail Video Dataset for Embodied Vision-Language Models

arXiv:2603.29281v1 Announce Type: cross Abstract: A critical gap exists between the general-purpose visual understanding of state-of-the-art physical AI models

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Sima AIunty: Caste Audit in LLM-Driven Matchmaking

arXiv:2603.29288v1 Announce Type: cross Abstract: Social and personal decisions in relational domains such as matchmaking are deeply entwined with cultural norm

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Downsides of Smartness Across Edge-Cloud Continuum in Modern Industry

arXiv:2603.29289v1 Announce Type: cross Abstract: The fast pace of modern AI is rapidly transforming traditional industrial systems into vast, intelligent and p

ArXiv cs.AI 🎨 Image & Video AI 📄 Paper ⚡ AI Lesson 1w ago

MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network

arXiv:2603.29291v1 Announce Type: cross Abstract: Composed Image Retrieval (CIR) uses a reference image and a modification text as a query to retrieve a target

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus

arXiv:2603.29292v1 Announce Type: cross Abstract: Improving the code generation capabilities of large language models (LLMs) typically relies on supervised fine

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

IMPASTO: Integrating Model-Based Planning with Learned Dynamics Models for Robotic Oil Painting Reproduction

arXiv:2603.29315v1 Announce Type: cross Abstract: Robotic reproduction of oil paintings using soft brushes and pigments requires force-sensitive control of defo

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Real-Time Band-Grouped Vocal Denoising Using Sigmoid-Driven Ideal Ratio Masking

arXiv:2603.29326v1 Announce Type: cross Abstract: Real-time, deep learning-based vocal denoising has seen significant progress over the past few years, demonstr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning

arXiv:2603.29328v1 Announce Type: cross Abstract: Backdoor attacks on federated learning (FL) are most often evaluated with synthetic corner patches or out-of-d

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Scaling Whole-Body Human Musculoskeletal Behavior Emulation for Specificity and Diversity

arXiv:2603.29332v1 Announce Type: cross Abstract: The embodied learning of human motor control requires whole-body neuro-actuated musculoskeletal dynamics, whil

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

CIPHER: Counterfeit Image Pattern High-level Examination via Representation

arXiv:2603.29356v1 Announce Type: cross Abstract: The rapid progress of generative adversarial networks (GANs) and diffusion models has enabled the creation of

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper 1w ago

Deep Learning-Based Anomaly Detection in Spacecraft Telemetry on Edge Devices

arXiv:2603.29375v1 Announce Type: cross Abstract: Spacecraft anomaly detection is critical for mission safety, yet deploying sophisticated models on-board prese

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

PromptForge-350k: A Large-Scale Dataset and Contrastive Framework for Prompt-Based AI Image Forgery Localization

arXiv:2603.29386v1 Announce Type: cross Abstract: The rapid democratization of prompt-based AI image editing has recently exacerbated the risks associated with

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1w ago

Extend3D: Town-Scale 3D Generation

arXiv:2603.29387v1 Announce Type: cross Abstract: In this paper, we propose Extend3D, a training-free pipeline for 3D scene generation from a single image, buil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Security in LLM-as-a-Judge: A Comprehensive SoK

arXiv:2603.29403v1 Announce Type: cross Abstract: LLM-as-a-Judge (LaaJ) is a novel paradigm in which powerful language models are used to assess the quality, sa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Hallucination-aware intermediate representation edit in large vision-language models

arXiv:2603.29405v1 Announce Type: cross Abstract: Large Vision-Language Models have demonstrated exceptional performance in multimodal reasoning and complex sce

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Hybrid Quantum-Classical Spatiotemporal Forecasting for 3D Cloud Fields

arXiv:2603.29407v1 Announce Type: cross Abstract: Accurate forecasting of three-dimensional (3D) cloud fields is important for atmospheric analysis and short-ra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

AGFT: Alignment-Guided Fine-Tuning for Zero-Shot Adversarial Robustness of Vision-Language Models

arXiv:2603.29410v1 Announce Type: cross Abstract: Pre-trained vision-language models (VLMs) exhibit strong zero-shot generalization but remain vulnerable to adv