✕ Clear all filters
94 articles

📰 ArXiv cs.AI

94 articles · Updated every 3 hours · View all reads

All Articles 81,277Blog Posts 104,942Tech Tutorials 19,794Research Papers 17,820News 13,834 ⚡ AI Lessons
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
Vision-Guided Iterative Refinement for Frontend Code Generation
arXiv:2604.05839v1 Announce Type: new Abstract: Code generation with large language models often relies on multi-stage human-in-the-loop refinement, which is ef
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
Architecture Without Architects: How AI Coding Agents Shape Software Architecture
arXiv:2604.04990v1 Announce Type: cross Abstract: AI coding agents select frameworks, scaffold infrastructure, and wire integrations, often in seconds. These ar
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
ID-Sim: An Identity-Focused Similarity Metric
arXiv:2604.05039v1 Announce Type: cross Abstract: Humans have remarkable selective sensitivity to identities -- easily distinguishing between highly similar ide
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels
arXiv:2604.05066v1 Announce Type: cross Abstract: Data movement is the primary bottleneck in modern computing systems. For loop-based programs common in high-pe
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
LSRM: High-Fidelity Object-Centric Reconstruction via Scaled Context Windows
arXiv:2604.05182v1 Announce Type: cross Abstract: We introduce the Large Sparse Reconstruction Model to study how scaling transformer context windows impacts fe
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
3DTurboQuant: Training-Free Near-Optimal Quantization for 3D Reconstruction Models
arXiv:2604.05366v1 Announce Type: cross Abstract: Every existing method for compressing 3D Gaussian Splatting, NeRF, or transformer-based 3D reconstructors requ
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
Human Interaction-Aware 3D Reconstruction from a Single Image
arXiv:2604.05436v1 Announce Type: cross Abstract: Reconstructing textured 3D human models from a single image is fundamental for AR/VR and digital human applica
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
Evaluation of Randomization through Style Transfer for Enhanced Domain Generalization
arXiv:2604.05616v1 Announce Type: cross Abstract: Deep learning models for computer vision often suffer from poor generalization when deployed in real-world set
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
Semantic-Topological Graph Reasoning for Language-Guided Pulmonary Screening
arXiv:2604.05620v1 Announce Type: cross Abstract: Medical image segmentation driven by free-text clinical instructions is a critical frontier in computer-aided
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
On the Robustness of Diffusion-Based Image Compression to Bit-Flip Errors
arXiv:2604.05743v1 Announce Type: cross Abstract: Modern image compression methods are typically optimized for the rate--distortion--perception trade-off, where
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
Graph-PiT: Enhancing Structural Coherence in Part-Based Image Synthesis via Graph Priors
arXiv:2604.06074v1 Announce Type: cross Abstract: Achieving fine-grained and structurally sound controllability is a cornerstone of advanced visual generation.
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
DiffHDR: Re-Exposing LDR Videos with Video Diffusion Models
arXiv:2604.06161v1 Announce Type: cross Abstract: Most digital videos are stored in 8-bit low dynamic range (LDR) formats, where much of the original high dynam
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
BulletGen: Improving 4D Reconstruction with Bullet-Time Generation
arXiv:2506.18601v2 Announce Type: replace-cross Abstract: Transforming casually captured, monocular videos into fully immersive dynamic experiences is a highly
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
Aleatoric Uncertainty Medical Image Segmentation Estimation via Flow Matching
arXiv:2507.22418v3 Announce Type: replace-cross Abstract: Quantifying aleatoric uncertainty in medical image segmentation is critical since it is a reflection o
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
Cross-Domain Few-Shot Learning for Hyperspectral Image Classification Based on Mixup Foundation Model
arXiv:2601.22581v2 Announce Type: replace-cross Abstract: Although cross-domain few-shot learning (CDFSL) for hyper-spectral image (HSI) classification has attr
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
R3G: A Reasoning--Retrieval--Reranking Framework for Vision-Centric Answer Generation
arXiv:2602.00104v2 Announce Type: replace-cross Abstract: Vision-centric retrieval for VQA requires retrieving images to supply missing visual cues and integrat
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
Automatic Image-Level Morphological Trait Annotation for Organismal Images
arXiv:2604.01619v2 Announce Type: replace-cross Abstract: Morphological traits are physical characteristics of biological organisms that provide vital clues on
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
MoViD: View-Invariant 3D Human Pose Estimation via Motion-View Disentanglement
arXiv:2604.03299v1 Announce Type: cross Abstract: 3D human pose estimation is a key enabling technology for applications such as healthcare monitoring, human-ro
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
TreeGaussian: Tree-Guided Cascaded Contrastive Learning for Hierarchical Consistent 3D Gaussian Scene Segmentation and Understanding
arXiv:2604.03309v1 Announce Type: cross Abstract: 3D Gaussian Splatting (3DGS) has emerged as a real-time, differentiable representation for neural scene unders
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2mo ago
StoryBlender: Inter-Shot Consistent and Editable 3D Storyboard with Spatial-temporal Dynamics
arXiv:2604.03315v1 Announce Type: cross Abstract: Storyboarding is a core skill in visual storytelling for film, animation, and games. However, automating this