|
--- |
|
language: |
|
- en |
|
library_name: CountGD |
|
license: mit |
|
tags: |
|
- computer-vision |
|
- counting |
|
- grounding-dino |
|
- model_hub_mixin |
|
- multi-modal |
|
- open-vocabulary |
|
- pytorch_model_hub_mixin |
|
- transformers |
|
--- |
|
|
|
# CountGD |
|
|
|
A Multi-Modal Open-World Counting Model for counting objects in an image with text and image prompts. |
|
For more details, please check out the following links |
|
|
|
- Project page: https://www.robots.ox.ac.uk/~vgg/research/countgd/ |
|
- Code: https://github.com/niki-amini-naieni/CountGD |
|
- Demo: https://huggingface.co/spaces/nikigoli/countgd |
|
- Paper: https://arxiv.org/pdf/2407.04619 |
|
|
|
 |
|
|
|
## Architecture |
|
|
|
 |
|
|
|
|
|
## Citation |
|
|
|
``` |
|
@article{AminiNaieni24, |
|
author = "Amini-Naieni, N. and Han, T. and Zisserman, A.", |
|
title = "CountGD: Multi-Modal Open-World Counting", |
|
booktitle = "arxiv", |
|
year = "2024", |
|
} |
|
``` |