yayafengzi's picture
Update README.md
7dd8c7d verified
---
license: apache-2.0
base_model:
- OpenGVLab/InternVL2_5-8B
pipeline_tag: mask-generation
---
<div align="center">
# HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model
[\[πŸ“‚ GitHub\]](https://github.com/yayafengzi/LMM-HiMTok)
[\[πŸ“œ Paper\]](https://arxiv.org/abs/2503.13026)
</div>
This is InternVL2_5-HiMTok-8B model fine-tuned on the refcoco series train dataset.
If you find this project useful in your research, please consider citing:
```BibTeX
@article{wang2025himtok,
title={HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model},
author={Wang, Tao and Cheng, Changxu and Wang, Lingfeng and Chen, Senda and Zhao, Wuyue},
journal={arXiv preprint arXiv:2503.13026},
year={2025}
}
```