|
--- |
|
language: multilingual |
|
tags: |
|
- fasttext |
|
datasets: |
|
- wikipedia |
|
- tatoeba |
|
- setimes |
|
license: cc-by-sa-3.0 |
|
--- |
|
|
|
## FastText model for language identification |
|
|
|
#### ♻️ Imported from https://fasttext.cc/docs/en/language-identification.html |
|
|
|
> [1] A. Joulin, E. Grave, P. Bojanowski, T. Mikolov, Bag of Tricks for Efficient Text Classification |
|
|
|
```bibtex |
|
@article{joulin2016bag, |
|
title={Bag of Tricks for Efficient Text Classification}, |
|
author={Joulin, Armand and Grave, Edouard and Bojanowski, Piotr and Mikolov, Tomas}, |
|
journal={arXiv preprint arXiv:1607.01759}, |
|
year={2016} |
|
} |
|
``` |
|
|
|
> [2] A. Joulin, E. Grave, P. Bojanowski, M. Douze, H. Jégou, T. Mikolov, FastText.zip: Compressing text classification models |
|
|
|
```bibtex |
|
@article{joulin2016fasttext, |
|
title={FastText.zip: Compressing text classification models}, |
|
author={Joulin, Armand and Grave, Edouard and Bojanowski, Piotr and Douze, Matthijs and J{\'e}gou, H{\'e}rve and Mikolov, Tomas}, |
|
journal={arXiv preprint arXiv:1612.03651}, |
|
year={2016} |
|
} |
|
``` |