File size: 1,495 Bytes
3465c01 ebab2db |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 |
---
license: agpl-3.0
pipeline_tag: zero-shot-object-detection
tags:
- model_hub_mixin
- pytorch_model_hub_mixin
---
### Model Description
[YOLOE: Real-Time Seeing Anything](https://arxiv.org/abs/2503.07465)
- arXiv: https://arxiv.org/abs/2503.07465
- github: https://github.com/THU-MIG/yoloe
### Installation
```
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/CLIP
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/ml-mobileclip
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/lvis-api
pip install git+https://github.com/THU-MIG/yoloe.git
wget https://docs-assets.developer.apple.com/ml-research/datasets/mobileclip/mobileclip_blt.pt
```
### Prediction
Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#prediction.
### Transferring
Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#transferring.
### Validation
Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#validation.
### Training
Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#training.
### BibTeX Entry and Citation Info
```
@misc{wang2025yoloerealtimeseeing,
title={YOLOE: Real-Time Seeing Anything},
author={Ao Wang and Lihao Liu and Hui Chen and Zijia Lin and Jungong Han and Guiguang Ding},
year={2025},
eprint={2503.07465},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2503.07465},
}
``` |