PR-MaGIC: Prompt Refinement Via Mask Decoder Gradient Flow For In-Context Segmentation
📰 ArXiv cs.AI
arXiv:2604.12113v1 Announce Type: cross Abstract: Visual Foundation Models (VFMs) such as the Segment Anything Model (SAM) have significantly advanced broad use of image segmentation. However, SAM and its variants necessitate substantial manual effort for prompt generation and additional training for specific applications. Recent approaches address these limitations by integrating SAM into in-context (one/few shot) segmentation, enabling auto-prompting through semantic alignment between query an
DeepCamp AI