File size: 1,358 Bytes
a686184 0e4071b a686184 c5b3871 a686184 c5b3871 a686184 c5b3871 a686184 c5b3871 a686184 c5b3871 a686184 c5b3871 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
---
library_name: transformers
license: llama3.1
---
### Lugha-Llama: Adapting Large Language Models for African Languages
Authors: [Happy Buzaaba](https://buzaabah.github.io/), [Alexander Wettig](https://www.cs.princeton.edu/~awettig/), [David Ifeoluwa Adelani](https://dadelani.github.io/), [Christiane Fellbaum](https://www.cs.princeton.edu/people/profile/fellbaum)
Low-resource african languages remain underrepresented in the large training datasets of large language models (LLMs) and, as a result, LLMs struggle to understand these languages.
We are releasing three African-centric **Lugha-Llama** models based on [Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B), which achieve the *best performance* among open-source models on
[IrokoBench](https://arxiv.org/abs/2406.03368), a challenging African languages benchmark and [AfriQA](https://arxiv.org/abs/2305.06897), a cross-lingual open-retrieval question answering dataset for African languages (Lugha is the Kiswahili word for "language").
<!-- Provide a quick summary of what the model is/does. -->
All Lugha-Llama models are available on 🤗 [**huggingface hub**](https://huggingface.co/Lugha-Llama).
For the details and findings check this [Lugha-Llama blog post](https://blog.ai.princeton.edu/2025/04/22/lugha-llama-adapting-large-language-models-for-african-languages/). |