news

Oct 15, 2025 We released BabyBabelLM: a multilingual benchmark of developmentally plausible training data for 45 languages! Find more here: babylm.github.io/babybabellm