📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AI Forbes Innovation OpenAI News Dev.to AI Hugging Face Blog Hackernoon

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Adversarial Prompt Injection Attack on Multimodal Large Language Models

arXiv:2603.29418v1 Announce Type: cross Abstract: Although multimodal large language models (MLLMs) are increasingly deployed in real-world applications, their

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

RAAP: Retrieval-Augmented Affordance Prediction with Cross-Image Action Alignment

arXiv:2603.29419v1 Announce Type: cross Abstract: Understanding object affordances is essential for enabling robots to perform purposeful and fine-grained inter

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1w ago

NeoNet: An End-to-End 3D MRI-Based Deep Learning Framework for Non-Invasive Prediction of Perineural Invasion via Generation-Driven Classification

arXiv:2603.29449v1 Announce Type: cross Abstract: Minimizing invasive diagnostic procedures to reduce the risk of patient injury and infection is a central goal

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 1w ago

Few-shot Writer Adaptation via Multimodal In-Context Learning

arXiv:2603.29450v1 Announce Type: cross Abstract: While state-of-the-art Handwritten Text Recognition (HTR) models perform well on standard benchmarks, they fre

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 1w ago

An Isotropic Approach to Efficient Uncertainty Quantification with Gradient Norms

arXiv:2603.29466v1 Announce Type: cross Abstract: Existing methods for quantifying predictive uncertainty in neural networks are either computationally intracta

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

M-MiniGPT4: Multilingual VLLM Alignment via Translated Data

arXiv:2603.29467v1 Announce Type: cross Abstract: This paper presents a Multilingual Vision Large Language Model, named M-MiniGPT4. Our model exhibits strong vi

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1w ago

iPoster: Content-Aware Layout Generation for Interactive Poster Design via Graph-Enhanced Diffusion Models

arXiv:2603.29469v1 Announce Type: cross Abstract: We present iPoster, an interactive layout generation framework that empowers users to guide content-aware post

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

MemFactory: Unified Inference & Training Framework for Agent Memory

arXiv:2603.29493v1 Announce Type: cross Abstract: Memory-augmented Large Language Models (LLMs) are essential for developing capable, long-term AI agents. Recen

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Target-Aligned Reinforcement Learning

arXiv:2603.29501v1 Announce Type: cross Abstract: Many reinforcement learning algorithms rely on target networks - lagged copies of the online network - to stab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Impact of enriched meaning representations for language generation in dialogue tasks: A comprehensive exploration of the relevance of tasks, corpora and metrics

arXiv:2603.29518v1 Announce Type: cross Abstract: Conversational systems should generate diverse language forms to interact fluently and accurately with users.

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

TrafficMoE: Heterogeneity-aware Mixture of Experts for Encrypted Traffic Classification

arXiv:2603.29520v1 Announce Type: cross Abstract: Encrypted traffic classification is a critical task for network security. While deep learning has advanced thi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Baby Scale: Investigating Models Trained on Individual Children's Language Input

arXiv:2603.29522v1 Announce Type: cross Abstract: Modern language models (LMs) must be trained on many orders of magnitude more words of training data than huma

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Quantization with Unified Adaptive Distillation to enable multi-LoRA based one-for-all Generative Vision Models on edge

arXiv:2603.29535v1 Announce Type: cross Abstract: Generative Artificial Intelligence (GenAI) features such as image editing, object removal, and prompt-guided i

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper 1w ago

Mean Masked Autoencoder with Flow-Mixing for Encrypted Traffic Classification

arXiv:2603.29537v1 Announce Type: cross Abstract: Network traffic classification using self-supervised pre-training models based on Masked Autoencoders (MAE) ha

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper 1w ago

Reducing Complexity for Quantum Approaches in Train Load Optimization

arXiv:2603.29543v1 Announce Type: cross Abstract: Efficiently planning container loads onto trains is a computationally challenging combinatorial optimization p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 1w ago

Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models

arXiv:2603.29552v1 Announce Type: cross Abstract: Multilingualism is incredibly common around the world, leading to many important theoretical and practical que

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1w ago

Generating Key Postures of Bharatanatyam Adavus with Pose Estimation

arXiv:2603.29570v1 Announce Type: cross Abstract: Preserving intangible cultural dances rooted in centuries of tradition and governed by strict structural and s

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Turbo4DGen: Ultra-Fast Acceleration for 4D Generation

arXiv:2603.29572v1 Announce Type: cross Abstract: 4D generation, or dynamic 3D content generation, integrates spatial, temporal, and view dimensions to model re

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

Learn2Fold: Structured Origami Generation with World Model Planning

arXiv:2603.29585v1 Announce Type: cross Abstract: The ability to transform a flat sheet into a complex three-dimensional structure is a fundamental test of phys

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

FigAgent: Towards Automatic Method Illustration Figure Generation for AI Scientific Papers

arXiv:2603.29590v1 Announce Type: cross Abstract: Method illustration figures (MIFs) play a crucial role in conveying the core ideas of scientific papers, yet t

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper 1w ago

IMAGAgent: Orchestrating Multi-Turn Image Editing via Constraint-Aware Planning and Reflection

arXiv:2603.29602v1 Announce Type: cross Abstract: Existing multi-turn image editing paradigms are often confined to isolated single-step execution. Due to a lac

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago

Convergent Representations of Linguistic Constructions in Human and Artificial Neural Systems

arXiv:2603.29617v1 Announce Type: cross Abstract: Understanding how the brain processes linguistic constructions is a central challenge in cognitive neuroscienc

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago

An Empirical Study of Multi-Agent Collaboration for Automated Research

arXiv:2603.29632v1 Announce Type: cross Abstract: As AI agents evolve, the community is rapidly shifting from single Large Language Models (LLMs) to Multi-Agent

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1w ago

MacTok: Robust Continuous Tokenization for Image Generation

arXiv:2603.29634v1 Announce Type: cross Abstract: Continuous image tokenizers enable efficient visual generation, and those based on variational frameworks can