open-vocabulary-detection

Here are 26 public repositories matching this topic...

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

speech image-editing caption data-generation 3d-whole-body-pose-estimation open-vocabulary-detection open-vocabulary-segmentation automatic-labeling-system

Updated Sep 5, 2024
Jupyter Notebook

roboflow / notebooks

Star

Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.

Updated Nov 13, 2024
Jupyter Notebook

roboflow / awesome-openai-vision-api-experiments

Star

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

computer-vision openai classification clip zero-shot chatgpt segment-anything open-vocabulary-detection open-vocabulary-segmentation grounding-dino

Updated Feb 22, 2024
Python

FoundationVision / GLEE

Star

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

tracking open-world object-detection interactive-segmentation video-object-segmentation referring-expression-segmentation referring-expression-comprehension video-instance-segmentation zero-shot-object-detection referring-video-object-segmentation foundation-model segment-anything open-vocabulary-detection open-vocabulary-segmentation open-vocabulary-video-segmentation

Updated Oct 21, 2024
Python

IDEA-Research / Grounding-DINO-1.5-API

Star

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

open-world object-detection open-set zero-shot-object-detection foundation-model open-vocabulary-detection grounding-dino

Updated Aug 9, 2024
Python

SkalskiP / awesome-foundation-and-multimodal-models

Sponsor

Star

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

nlp computer-vision image-captioning clip blip multimodal zero-shot-detection foundational-models llava segment-anything open-vocabulary-detection open-vocabulary-segmentation grounding-dino

Updated Feb 29, 2024
Python

segments-ai / panoptic-segment-anything

Star

Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

segmentation open-vocabulary-detection open-vocabulary-segmentation

Updated May 3, 2024
Jupyter Notebook

wanghao9610 / OV-DINO

Star

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

open-world object-detection zero-shot-object-detection open-vocabulary-detection open-vocabulary-segmentation fundation-models ov-dino

Updated Sep 15, 2024
Python

Charles-Xie / awesome-described-object-detection

Star

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.

awesome awesome-list visual-grounding referring-expression-comprehension open-world-object-detection open-vocabulary-detection

Updated Aug 17, 2024

FoundationVision / GenerateU

Star

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

open-world object-detection multimodality open-vocabulary mllm open-vocabulary-detection

Updated Mar 25, 2024
Python

CVMI-Lab / CoDet

Star

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

object-detection open-vocabulary open-vocabulary-detection

Updated Apr 26, 2024
Python

shikras / d-cube

Star

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

dataset object-detection vision-language multi-modal-learning referring-expression-comprehension open-vocabulary-detection

Updated Mar 20, 2024
Python

naver / shine

Star

[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection

vision-language open-vocabulary-detection

Updated Jul 24, 2024
Python

rohit901 / cooperative-foundational-models

Star

[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

computer-vision deep-learning pytorch object-detection zero-shot-object-detection open-set-object-detection novel-objects open-vocabulary-detection

Updated Oct 29, 2024
Python

lorebianchi98 / FG-OVD

Star

[CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."

computer-vision deep-learning artificial-intelligence object-detection zero-shot-object-detection open-vocabulary-detection fine-grained-open-vocabulary-object-detection