Vision

Computer vision methods that are useful background for modern multimodal systems.

Research category

Computer vision methods that are useful background for modern multimodal systems.

1Papers

2Resource links

2022.03Latest month

1 paper

Object Detection

2022.03 Object Detection

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

DINO improves DETR-like object detectors by introducing contrastive denoising training, mixed query selection for anchor initialization, and a look-forward-twice box prediction scheme, achieving state-of-the-art results on COCO with significantly reduced model and data requirements.

Paper Code