2022Computer VisionObject DetectionVision Transformer

OWL-ViT: Open-Vocabulary Object Detection using Vision Transformers

Open-vocabulary object detection using vision transformers with text prompts.

Original Source

Paper Preview

Select text on the current page to save it as a note. Saved notes stay highlighted.

Notes

Select text from the current page. Saving it will persist both the note and the highlight.

Login to unlock note-taking and persistent markers.

Current selection

Select text from the page preview to save it as a note.
Notes lockedSign in to create, view, and revisit saved annotations for this paper.