📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 3,273 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Adversarial Prompt Injection Attack on Multimodal Large Language Models
arXiv:2603.29418v1 Announce Type: cross Abstract: Although multimodal large language models (MLLMs) are increasingly deployed in real-world applications, their
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
RAAP: Retrieval-Augmented Affordance Prediction with Cross-Image Action Alignment
arXiv:2603.29419v1 Announce Type: cross Abstract: Understanding object affordances is essential for enabling robots to perform purposeful and fine-grained inter
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1w ago
NeoNet: An End-to-End 3D MRI-Based Deep Learning Framework for Non-Invasive Prediction of Perineural Invasion via Generation-Driven Classification
arXiv:2603.29449v1 Announce Type: cross Abstract: Minimizing invasive diagnostic procedures to reduce the risk of patient injury and infection is a central goal
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
1w ago
Few-shot Writer Adaptation via Multimodal In-Context Learning
arXiv:2603.29450v1 Announce Type: cross Abstract: While state-of-the-art Handwritten Text Recognition (HTR) models perform well on standard benchmarks, they fre
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
1w ago
An Isotropic Approach to Efficient Uncertainty Quantification with Gradient Norms
arXiv:2603.29466v1 Announce Type: cross Abstract: Existing methods for quantifying predictive uncertainty in neural networks are either computationally intracta
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
M-MiniGPT4: Multilingual VLLM Alignment via Translated Data
arXiv:2603.29467v1 Announce Type: cross Abstract: This paper presents a Multilingual Vision Large Language Model, named M-MiniGPT4. Our model exhibits strong vi
ArXiv cs.AI
💻 AI-Assisted Coding
📄 Paper
⚡ AI Lesson
1w ago
iPoster: Content-Aware Layout Generation for Interactive Poster Design via Graph-Enhanced Diffusion Models
arXiv:2603.29469v1 Announce Type: cross Abstract: We present iPoster, an interactive layout generation framework that empowers users to guide content-aware post
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
MemFactory: Unified Inference & Training Framework for Agent Memory
arXiv:2603.29493v1 Announce Type: cross Abstract: Memory-augmented Large Language Models (LLMs) are essential for developing capable, long-term AI agents. Recen
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Target-Aligned Reinforcement Learning
arXiv:2603.29501v1 Announce Type: cross Abstract: Many reinforcement learning algorithms rely on target networks - lagged copies of the online network - to stab
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Impact of enriched meaning representations for language generation in dialogue tasks: A comprehensive exploration of the relevance of tasks, corpora and metrics
arXiv:2603.29518v1 Announce Type: cross Abstract: Conversational systems should generate diverse language forms to interact fluently and accurately with users.
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
TrafficMoE: Heterogeneity-aware Mixture of Experts for Encrypted Traffic Classification
arXiv:2603.29520v1 Announce Type: cross Abstract: Encrypted traffic classification is a critical task for network security. While deep learning has advanced thi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Baby Scale: Investigating Models Trained on Individual Children's Language Input
arXiv:2603.29522v1 Announce Type: cross Abstract: Modern language models (LMs) must be trained on many orders of magnitude more words of training data than huma
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Quantization with Unified Adaptive Distillation to enable multi-LoRA based one-for-all Generative Vision Models on edge
arXiv:2603.29535v1 Announce Type: cross Abstract: Generative Artificial Intelligence (GenAI) features such as image editing, object removal, and prompt-guided i
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
1w ago
Mean Masked Autoencoder with Flow-Mixing for Encrypted Traffic Classification
arXiv:2603.29537v1 Announce Type: cross Abstract: Network traffic classification using self-supervised pre-training models based on Masked Autoencoders (MAE) ha
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
1w ago
Reducing Complexity for Quantum Approaches in Train Load Optimization
arXiv:2603.29543v1 Announce Type: cross Abstract: Efficiently planning container loads onto trains is a computationally challenging combinatorial optimization p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
1w ago
Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models
arXiv:2603.29552v1 Announce Type: cross Abstract: Multilingualism is incredibly common around the world, leading to many important theoretical and practical que
ArXiv cs.AI
💻 AI-Assisted Coding
📄 Paper
⚡ AI Lesson
1w ago
Generating Key Postures of Bharatanatyam Adavus with Pose Estimation
arXiv:2603.29570v1 Announce Type: cross Abstract: Preserving intangible cultural dances rooted in centuries of tradition and governed by strict structural and s
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Turbo4DGen: Ultra-Fast Acceleration for 4D Generation
arXiv:2603.29572v1 Announce Type: cross Abstract: 4D generation, or dynamic 3D content generation, integrates spatial, temporal, and view dimensions to model re
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
Learn2Fold: Structured Origami Generation with World Model Planning
arXiv:2603.29585v1 Announce Type: cross Abstract: The ability to transform a flat sheet into a complex three-dimensional structure is a fundamental test of phys
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
FigAgent: Towards Automatic Method Illustration Figure Generation for AI Scientific Papers
arXiv:2603.29590v1 Announce Type: cross Abstract: Method illustration figures (MIFs) play a crucial role in conveying the core ideas of scientific papers, yet t
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
1w ago
IMAGAgent: Orchestrating Multi-Turn Image Editing via Constraint-Aware Planning and Reflection
arXiv:2603.29602v1 Announce Type: cross Abstract: Existing multi-turn image editing paradigms are often confined to isolated single-step execution. Due to a lac
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Convergent Representations of Linguistic Constructions in Human and Artificial Neural Systems
arXiv:2603.29617v1 Announce Type: cross Abstract: Understanding how the brain processes linguistic constructions is a central challenge in cognitive neuroscienc
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1w ago
An Empirical Study of Multi-Agent Collaboration for Automated Research
arXiv:2603.29632v1 Announce Type: cross Abstract: As AI agents evolve, the community is rapidly shifting from single Large Language Models (LLMs) to Multi-Agent
ArXiv cs.AI
💻 AI-Assisted Coding
📄 Paper
⚡ AI Lesson
1w ago
MacTok: Robust Continuous Tokenization for Image Generation
arXiv:2603.29634v1 Announce Type: cross Abstract: Continuous image tokenizers enable efficient visual generation, and those based on variational frameworks can
DeepCamp AI