Awesome LLM Research Collections
  • Home
  • Papers
    • Attention
    • LLMs
    • Multimodal LLMs
    • Embeddings
    • SFT
    • Training
    • Reinforcement Learning
    • Agents Application
    • Vision
    • Auto-Prompt
  • Notes
  • Blogs
  • English
  • 中文

Vision

Computer vision methods that are useful background for modern multimodal systems.
中文

Research category

Computer vision methods that are useful background for modern multimodal systems.

1Papers
2Resource links
2022.03Latest month
Object Detection

1 paper

Object Detection

2022.03 Object Detection

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

DINO improves DETR-like object detectors by introducing contrastive denoising training, mixed query selection for anchor initialization, and a look-forward-twice box prediction scheme, achieving state-of-the-art results on COCO with significantly reduced model and data requirements.

Paper Code
  • View source
  • Report an issue