Massive Language Adaptation of Large Language Models

Welcome to MaLA-LM (Massive Language Adaptation of Large Language Models)! 🌍

MaLA-LM focuses on adapting large language models to support hundreds of languages, including many underrepresented ones. Our models are multilingual, scalable, and optimized for diverse linguistic tasks. We work on data construction (e.g., MaLA corpus and PolyWrite), continual pretraining (e.g., EMMA-500, MaLA-500, and MixCPT), instruction fine-tuning (e.g., mono. vs. multilingual Alpaca and Lucky 52) and evaluation (e.g., GlotEval).

Featured 🗣️ Check out our multilingual LLM collections, featuring models trained to handle 500+ languages, ideal for global, multilingual applications.

Dive into the HuggingFace collections: EMMA-500 | MaLA corpus | MaLA-500

Latest Updates

[2025.11] We launch the FineOPUS project for refining parallel texts in many languages 🌐FineOPUS
[2025.06] We release EMMA-500 Llama 3/3.1 models and MaLA bilingual corpus in 2,500+ language pairs 🌐EMMA-500 Gen2
[2025.05] We release MaLA OPUS bilingual corpus (2410), aka, parallel corpus, in 16,000+ language pairs 🤗MaLA-LM/mala-opus-dedup-2410
[2025.04] We release a series of CPT models that study the data mixing in continual pre-training 🤗MixCPT
[2025.04] We release the preview of GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models GlotEval
[2024.09] We release PolyWrite, a synthetic benchmark for open-ended generation 🤗MaLA-LM/PolyWrite
[2024.09] We release the EMMA-500 Llama 2 model and MaLA monolingual corpus in 939 languages 🌐EMMA-500
[2024.04] We release Lucky 52 models that study the number of languages for instruction fine-tuning 🤗Lucky 52
[2024.03] We release the MaLA-500 v2 model 🤗MaLA-LM/mala-500-10b-v2
[2024.01] We release the MaLA-500 v1 model based on Llama 2 and LoRA 🤗MaLA-LM/mala-500-10b-v1

Our Works

Continual Pretraining 📜

Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data

EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

MaLA-500: Massive Language Adaptation of Large Language Models

Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources

Instruction Fine-tuning 🔮

How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on BLOOM

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

Evaluation 🛠️

GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models