📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
SyriSign: A Parallel Corpus for Arabic Text to Syrian Arabic Sign Language Translation
arXiv:2603.29219v1 Announce Type: cross Abstract: Sign language is the primary approach of communication for the Deaf and Hard-of-Hearing (DHH) community. While
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Derived Fields Preserve Fine-Scale Detail in Budgeted Neural Simulators
arXiv:2603.29224v1 Announce Type: cross Abstract: Fine-scale-faithful neural simulation under fixed storage budgets remains challenging. Many existing methods r
ArXiv cs.AI
📄 Paper
1w ago
Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs
arXiv:2603.29232v1 Announce Type: cross Abstract: Large language models (LLMs) are widely applied to data analytics over documents, yet direct reasoning over lo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
MemRerank: Preference Memory for Personalized Product Reranking
arXiv:2603.29247v1 Announce Type: cross Abstract: LLM-based shopping agents increasingly rely on long purchase histories and multi-turn interactions for persona
ArXiv cs.AI
📄 Paper
1w ago
Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism
arXiv:2603.29252v1 Announce Type: cross Abstract: Long video understanding is a key challenge that plagues the advancement of \emph{Multimodal Large language Mo
ArXiv cs.AI
📄 Paper
1w ago
Omni-NegCLIP: Enhancing CLIP with Front-Layer Contrastive Fine-Tuning for Comprehensive Negation Understanding
arXiv:2603.29258v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) have demonstrated strong capabilities across a wide range of multimodal tasks. H
ArXiv cs.AI
📄 Paper
1w ago
Monodense Deep Neural Model for Determining Item Price Elasticity
arXiv:2603.29261v1 Announce Type: cross Abstract: Item Price Elasticity is used to quantify the responsiveness of consumer demand to changes in item prices, ena
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
PRISM: A Multi-View Multi-Capability Retail Video Dataset for Embodied Vision-Language Models
arXiv:2603.29281v1 Announce Type: cross Abstract: A critical gap exists between the general-purpose visual understanding of state-of-the-art physical AI models
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Sima AIunty: Caste Audit in LLM-Driven Matchmaking
arXiv:2603.29288v1 Announce Type: cross Abstract: Social and personal decisions in relational domains such as matchmaking are deeply entwined with cultural norm
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Downsides of Smartness Across Edge-Cloud Continuum in Modern Industry
arXiv:2603.29289v1 Announce Type: cross Abstract: The fast pace of modern AI is rapidly transforming traditional industrial systems into vast, intelligent and p
ArXiv cs.AI
🎨 Image & Video AI
📄 Paper
⚡ AI Lesson
1w ago
MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network
arXiv:2603.29291v1 Announce Type: cross Abstract: Composed Image Retrieval (CIR) uses a reference image and a modification text as a query to retrieve a target
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus
arXiv:2603.29292v1 Announce Type: cross Abstract: Improving the code generation capabilities of large language models (LLMs) typically relies on supervised fine
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
IMPASTO: Integrating Model-Based Planning with Learned Dynamics Models for Robotic Oil Painting Reproduction
arXiv:2603.29315v1 Announce Type: cross Abstract: Robotic reproduction of oil paintings using soft brushes and pigments requires force-sensitive control of defo
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Real-Time Band-Grouped Vocal Denoising Using Sigmoid-Driven Ideal Ratio Masking
arXiv:2603.29326v1 Announce Type: cross Abstract: Real-time, deep learning-based vocal denoising has seen significant progress over the past few years, demonstr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning
arXiv:2603.29328v1 Announce Type: cross Abstract: Backdoor attacks on federated learning (FL) are most often evaluated with synthetic corner patches or out-of-d
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Scaling Whole-Body Human Musculoskeletal Behavior Emulation for Specificity and Diversity
arXiv:2603.29332v1 Announce Type: cross Abstract: The embodied learning of human motor control requires whole-body neuro-actuated musculoskeletal dynamics, whil
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
CIPHER: Counterfeit Image Pattern High-level Examination via Representation
arXiv:2603.29356v1 Announce Type: cross Abstract: The rapid progress of generative adversarial networks (GANs) and diffusion models has enabled the creation of
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
1w ago
Deep Learning-Based Anomaly Detection in Spacecraft Telemetry on Edge Devices
arXiv:2603.29375v1 Announce Type: cross Abstract: Spacecraft anomaly detection is critical for mission safety, yet deploying sophisticated models on-board prese
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
PromptForge-350k: A Large-Scale Dataset and Contrastive Framework for Prompt-Based AI Image Forgery Localization
arXiv:2603.29386v1 Announce Type: cross Abstract: The rapid democratization of prompt-based AI image editing has recently exacerbated the risks associated with
ArXiv cs.AI
💻 AI-Assisted Coding
📄 Paper
⚡ AI Lesson
1w ago
Extend3D: Town-Scale 3D Generation
arXiv:2603.29387v1 Announce Type: cross Abstract: In this paper, we propose Extend3D, a training-free pipeline for 3D scene generation from a single image, buil
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Security in LLM-as-a-Judge: A Comprehensive SoK
arXiv:2603.29403v1 Announce Type: cross Abstract: LLM-as-a-Judge (LaaJ) is a novel paradigm in which powerful language models are used to assess the quality, sa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Hallucination-aware intermediate representation edit in large vision-language models
arXiv:2603.29405v1 Announce Type: cross Abstract: Large Vision-Language Models have demonstrated exceptional performance in multimodal reasoning and complex sce
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Hybrid Quantum-Classical Spatiotemporal Forecasting for 3D Cloud Fields
arXiv:2603.29407v1 Announce Type: cross Abstract: Accurate forecasting of three-dimensional (3D) cloud fields is important for atmospheric analysis and short-ra
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
AGFT: Alignment-Guided Fine-Tuning for Zero-Shot Adversarial Robustness of Vision-Language Models
arXiv:2603.29410v1 Announce Type: cross Abstract: Pre-trained vision-language models (VLMs) exhibit strong zero-shot generalization but remain vulnerable to adv
DeepCamp AI