Babybabellm

We release BabyBabelLM: A multilingual benchmark of developmentally plausible training data for 45 languages! Find more here: babylm.github.io/babybabellm