Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dice-research
/
lola_v1
like
10
Follow
Data Science Group
14
Text Generation
Transformers
Safetensors
uonlp/CulturaX
lola_v1
multilingual
Mixture of Experts
custom_code
arxiv:
2409.11272
License:
cc-by-4.0
Model card
Files
Files and versions
Community
Train
Use this model
c2502f1
lola_v1
Ctrl+K
Ctrl+K
1 contributor
History:
13 commits
neo-nlp-dev
updating config for aux loss coefficient
c2502f1
verified
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
README.md
3.97 kB
Update README.md
8 months ago
config.json
977 Bytes
Update config.json
about 1 year ago
configuration_lola_gpt2.py
2.7 kB
updating config for aux loss coefficient
6 months ago
generation_config.json
Safe
121 Bytes
Upload model
about 1 year ago
merges.txt
Safe
1.2 MB
Upload tokenizer
about 1 year ago
model-00001-of-00006.safetensors
Safe
5 GB
LFS
Upload model
about 1 year ago
model-00002-of-00006.safetensors
Safe
4.97 GB
LFS
Upload model
about 1 year ago
model-00003-of-00006.safetensors
Safe
4.97 GB
LFS
Upload model
about 1 year ago
model-00004-of-00006.safetensors
Safe
4.97 GB
LFS
Upload model
about 1 year ago
model-00005-of-00006.safetensors
Safe
4.97 GB
LFS
Upload model
about 1 year ago
model-00006-of-00006.safetensors
Safe
4.97 GB
LFS
Upload model
about 1 year ago
model.safetensors.index.json
Safe
85.1 kB
Upload model
about 1 year ago
modeling_lola_gpt2.py
Safe
29.2 kB
Upload model
about 1 year ago
special_tokens_map.json
Safe
834 Bytes
Upload tokenizer
about 1 year ago
tokenizer.json
Safe
4.8 MB
Upload tokenizer
about 1 year ago
tokenizer_config.json
Safe
1.46 kB
Upload tokenizer
about 1 year ago
vocab.json
Safe
1.89 MB
Upload tokenizer
about 1 year ago