Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.
-
Updated
Aug 7, 2024 - Python
Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.
GroundedSAM Base Model plugin for Autodistill
This project focuses on generating a diverse and realistic dataset for computer vision training using ChatGPT and a realistic vision image generation model. The process involves dynamically creating prompts, utilizing ChatGPT to generate image descriptions, and generating images based on those descriptions.
Prompt based automatic annotation
AI image masker with Grounded SAM using a Python GUI desktop app.
A Cross-Frame Multimodal Retrieval Augmented Generation (CFM-RAG) for Video Intelligence. It retrieves the most relevant multimodal evidence and empowers LLMs to deliver context-rich answers.
Add a description, image, and links to the grounded-sam topic page so that developers can more easily learn about it.
To associate your repository with the grounded-sam topic, visit your repo's landing page and select "manage topics."