Instance segmentor first using sam model to get all obj's mask of the input image. Second using clip model to classify each mask with both image features and your ...
⭐ If SSDiff is helpful to your paper or project, please consider star this repo or cite our paper. Thanks! 🤗 2025.12.07: Codes are relased. 2025.12.03: Checkpoints and scripts are relased. 2025.12.02 ...