[2603.29552] Bringing Up a Bilingual BabyLM: Investigating

[2603.29552] Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models

arXiv - AI April 01, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.29552: Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models

Computer Science > Computation and Language arXiv:2603.29552 (cs) [Submitted on 31 Mar 2026] Title:Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models Authors:Linda Zeng, Steven Y. Feng, Michael C. Frank View a PDF of the paper titled Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models, by Linda Zeng and 2 other authors View PDF HTML (experimental) Abstract:Multilingualism is incredibly common around the world, leading to many important theoretical and practical questions about how children learn multiple languages at once. For example, does multilingual acquisition lead to delays in learning? Are there better and worse ways to structure multilingual input? Many correlational studies address these questions, but it is surprisingly difficult to get definitive answers because children cannot be randomly assigned to be multilingual and data are typically not matched between languages. We use language model training as a method for simulating a variety of highly controlled exposure conditions, and create matched 100M-word mono- and bilingual datasets using synthetic data and machine translation. We train GPT-2 models on monolingual and bilingual data organized to reflect a range of exposure regimes, and evaluate their performance on perplexity, grammaticality, and semantic knowledge. Across model scales and measures, bilingual models perform similarly to monolingual models...

Originally published on April 01, 2026. Curated by AI News.

Machine Learning

Slides Help Teaching ML First Time [P]

I’m an electrical engineering teacher. One of our faculty members has fallen ill, so I’ve been asked to take over teaching machine learni...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

https://preview.redd.it/f4d5krhkjyvg1.png?width=1020&format=png&auto=webp&s=11310f377b22abbe3dd110cc7d362ba8aae35f8d I have b...

Reddit - Machine Learning · 1 min · about 6 hours ago

Machine Learning

ICML 2026 - Heavy score variance among various batches? [D]

I've seen some people say in their batch very few papers have above 3.5 score, but then other reviewers say that most papers in their sco...

Reddit - Machine Learning · 1 min · about 8 hours ago

Machine Learning

We’re proud to open-source LIDARLearn [R] [D] [P]

It’s a unified PyTorch library for 3D point cloud deep learning. To our knowledge, it’s the first framework that supports such a large co...

Reddit - Machine Learning · 1 min · about 9 hours ago

[2603.29552] Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models

About this article

Related Articles

Slides Help Teaching ML First Time [P]

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

ICML 2026 - Heavy score variance among various batches? [D]

We’re proud to open-source LIDARLearn [R] [D] [P]

No comments

Stay updated with AI News