Abstract: As a fundamental and challenging task in bridging language and vision domains, Image-Text Retrieval (ITR) aims at searching for the target instances that are semantically relevant to the ...
Our uncertainty-aware interactive segmentation model, SPA, efficiently achieves segmentations whose decisions on uncertain pixels are aligned with users preferences. This is achieved by modeling ...
Abstract: We propose PortraitACG, a novel framework for user-guided portrait image editing that leverages an asymmetric conditional generative adversarial network (GAN), which supports the ...