We propose MaskCut approach to generate pseudo-masks for multiple objects in an image. CutLER can learn unsupervised object detectors and instance segmentors solely on ImageNet-1K. CutLER exhibits ...
OV-DQUO is an open-vocabulary detection framework that learns from open-world unknown objects through wildcard matching and contrastive denoising training methods, mitigating performance degradation ...