2022Computer VisionObject DetectionVision Transformer

OWL-ViT: Open-Vocabulary Object Detection using Vision Transformers

Open-vocabulary object detection using vision transformers with text prompts.

Paper Preview

Select text on the current page to save it as a note. Saved notes stay highlighted.

100%

0x00 spans0 notes

Page

Notes

Select text from the current page. Saving it will persist both the note and the highlight.

Current selection

Select text from the page preview to save it as a note.

Your note

Attach a snapshot of the current PDF page

Notes lockedSign in to create, view, and revisit saved annotations for this paper.