DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
DINO improves DETR-like object detectors by introducing contrastive denoising training, mixed query selection for anchor initialization, and a look-forward-twice box prediction scheme, achieving state-of-the-art results on COCO with significantly reduced model and data requirements.