Spaces:

Brightsun10
/

instance-segmentation-demo

Sleeping

App Files Files Community

Brightsun10 commited on 12 days ago

Commit

cdd7eee

verified ·

1 Parent(s): 1b1e5a0

Update README.md

Browse files

Files changed (1) hide show

README.md +59 -5

README.md CHANGED Viewed

@@ -1,8 +1,62 @@
-title: Advanced Instance Segmentation
-emoji: 🖼️✨
-colorFrom: blue
-colorTo: green
 sdk: gradio
 app_file: app.py
 pinned: false
-Advanced Instance Segmentation with Mask2FormerThis Hugging Face Space provides an interactive demo for Instance Segmentation, a computer vision task that locates and delineates each distinct object of interest in an image.This application uses the powerful Mask2Former model (facebook/mask2former-swin-large-coco-instance), a state-of-the-art architecture for panoptic, instance, and semantic segmentation.How to UseUpload an image using the panel on the left. You can also drag and drop a file.If you don't have an image, simply click one of the example images provided below the upload box.The model will process the image and display the output on the right. Each detected object will have:A colored mask overlay.A bounding box.A label with its confidence score.Target ClassesThe model is configured to specifically detect the following classes:Vehicles: car, truck, busPeople: personAnimals: cat, dogLimitationsBuilding Detection: The COCO dataset, on which this model was trained, does not have a generic "building" class. Therefore, buildings will not be segmented. To detect buildings, the model would need to be fine-tuned on a dataset that includes them (e.g., ADE20K).Performance: This is a large model. Processing on free CPU hardware can take 20-40 seconds. For real-time performance, upgrading the Space to GPU hardware is recommended.

+---
+title: Instance Segmentation Demo
+emoji: 🖼️
+colorFrom: pink
+colorTo: purple
 sdk: gradio
+sdk_version: "4.24.0"
 app_file: app.py
 pinned: false
+---
+# 🖼️ Instance Segmentation with Mask2Former
+This demo performs **advanced instance segmentation** using [Mask2Former](https://huggingface.co/facebook/mask2former-swin-large-coco-instance) from Facebook AI. It identifies and highlights individual objects in an image with:
+- **Colored masks**
+- **Bounding boxes**
+- **Class labels and confidence scores**
+## 🚀 How It Works
+- Input an image via upload or example selection.
+- The app uses the `facebook/mask2former-swin-large-coco-instance` model to detect objects.
+- Only the following classes are visualized:
+  - `cat`, `dog`, `car`, `truck`, `bus`, `person`
+- Results are drawn on the image and displayed along with a status message.
+## 🧠 Model
+- **Architecture:** Mask2Former with Swin-Large backbone
+- **Dataset:** COCO Instance
+- **Framework:** Hugging Face Transformers + PyTorch
+## 💻 Technologies Used
+- Python 🐍
+- [Gradio](https://gradio.app) for UI
+- Hugging Face Transformers
+- PIL & NumPy for image manipulation
+## 📷 Example Images
+Try out with example images like:
+- Cats vs. Dogs
+- Street scenes with vehicles and people
+You can also upload your own images!
+## 📌 Notes
+- Detection is limited to high-confidence predictions (`score > 0.9`)
+- This demo is optimized for CPU; inference may take up to 30 seconds.
+---
+## 🛠️ Developer Notes
+This app uses the following Gradio configuration:
+```yaml
+sdk: gradio
+sdk_version: "4.24.0"
+app_file: app.py