Plus, making sense of the Baltic cable incidents and state takeovers of school districts.
View in browser
Brookings Brief

October 30, 2023

People are seen writing various messages in Bengali on a road to commemorate International Mother Language Day
How language gaps constrain generative AI development

 

There are over 7,000 languages spoken worldwide, yet content on the internet is primarily written in English and a small group of other languages. This will have an impact on the development of generative artificial intelligence (AI) tools, which are often trained on internet data. Without interventions to mitigate this, access to these tools may be limited to those who speak a few data-rich languages like English, Spanish, and Mandarin.

 

“The language we speak determines how we engage with the world, as well as which worlds we can participate in,” write Regina Ta and Nicol Turner Lee. They argue that building inclusive digital ecosystems will require bridging the digital language divide by ensuring greater linguistic representation in AI training data.

Read more
 

More research and commentary

 

Making sense of the Baltic cable incidents. Bruce Jones highlights two events that show the importance of constant monitoring of vital undersea infrastructure, and of early attribution of attacks or disruptive actions.

 

When states take the reins in school districts. State takeovers of school districts have happened in all major U.S. regions, but they disproportionately affect some types of communities more than others. Beth Schueler, Melissa Arnold Lyon, and Joshua Bleiberg examine whether these takeovers are effective.

 
Twitter
Facebook
Instagram
LinkedIn
Brookings

The Brookings Institution,1775 Massachusetts Ave NW, Washington, DC 20036

Unsubscribe | Manage newsletter subscriptions 

The conclusions and recommendations of any Brookings publication are solely those of its author(s), and do not reflect the views of the Institution, its management, or its other scholars.