Lugha-Llama: Adapting Large Language Models for African Languages

Authors: Happy Buzaaba, Alexander Wettig, David Ifeoluwa Adelani, Christiane Fellbaum

Low-resource african languages remain underrepresented in the large training datasets of large language models (LLMs) and, as a result, LLMs struggle to understand these languages. We are releasing three African-centric Lugha-Llama models based on Llama-3.1-8B, which achieve the best performance among open-source models on IrokoBench, a challenging African languages benchmark and AfriQA, a cross-lingual open-retrieval question answering dataset for African languages (Lugha is the Kiswahili word for "language").

All Lugha-Llama models are available on ๐Ÿค— huggingface hub.

For the details and findings check this Lugha-Llama blog post.

Downloads last month
42
Safetensors
Model size
8.03B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support