
Premium content
Access to this content requires a subscription. You must be a premium user to view this content.

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
keywords:
cv
segmentation
The Segment Anything Model (SAM) has proven to be a powerful foundation model for segmentation, demonstrating robust zero-shot generalization through prompt engineering. However, relying on manual prompts is impractical for real-world applications, especially on edge devices that require rapid prompt provision and resource efficiency. In this paper, we propose the Automation of Prompts for SAM (AoP-SAM), a novel approach that learns to predict the locations of essential prompts and is integrated with the Adaptive Sampling and Filtering (ASF) technique. This method automatically generates essential prompts for segmentation, thereby eliminating the need for manual prompt provision. We employ a simple yet efficient model that detects essential entities from entire images and identifies the optimal locations of potential prompts for segmentation, leveraging SAM’s image embeddings while ensuring its zero-shot generalization without fine-tuning. Moreover, we propose ASF to collect essential prompts in a coarse-to-fine manner, significantly improving prompt and mask generation efficiency by greatly reducing redundant mask refinements. We evaluate AoP-SAM on three datasets and demonstrate that our approach substantially enhances prompt and mask generation efficiency compared to previous prompting methods while matching the performance of segmentation. This improves the accuracy and efficiency of SAM in segmentation applications on edge devices.