TechCrunch CENTER (0.0) Reliability: 7.5/10

MLCommons and Hugging Face team up to release massive speech dataset for AI research

✍️ Kyle Wiggers, Ai Editor, Marina Temkin, Connie Loizos, Sarah Perez, --C-Author-Card-Image-Size Align-Items Center Display Flex Gap Var, Media, Min-Width, --C-Author-Card-Image-Size, Img.Wp-Block-Tc_Author-Card__Image Height Var --C-Author-Card-Image-Size 📅 January 31, 2025 12:00 AM 🕐 Scraped: October 01, 2025

Summary: The nonprofit AI safety org MLCommons has teamed up with Hugging Face to release a public domain dataset of speech recordings.

MLCommons, a nonprofit AI safety working group, has teamed up with AI dev platform Hugging Face to release one of the world’s largest collections of public domain voice recordings for AI research.

The dataset, called Unsupervised People’s Speech, contains more than a million hours of audio spanning at least 89 languages. MLCommons says it was motivated to create it by a desire to support R&D in “various areas of speech technology.”

“Supporting broader natural language processing research for languages other than English helps bring communication technologies to more people globally,” the organization wrote in a b

📰

Continue Reading on TechCrunch

This preview shows approximately 15% of the article. Read the full story on the publisher's website to support quality journalism.

Read Full Article →

How do you feel about this article?

Read Original Article →

📝 Article Information

Author

Kyle Wiggers, Ai Editor, Marina Temkin, Connie Loizos, Sarah Perez, --C-Author-Card-Image-Size Align-Items Center Display Flex Gap Var, Media, Min-Width, --C-Author-Card-Image-Size, Img.Wp-Block-Tc_Author-Card__Image Height Var --C-Author-Card-Image-Size

Original Publisher

🌍 TechCrunch

CENTER Bias (0.0)

Published Date

January 31, 2025 at 12:00 AM

ℹ️

Disclaimer

ArkforgeAI.com - Newsource is a news aggregation platform. We do not create, write, or produce original news content. All articles are sourced from their respective publishers and remain the property of their original creators. We analyze and aggregate news from multiple sources to provide diverse perspectives across the political spectrum.

The content displayed here is for informational purposes only. For the complete and original article, please visit the source's website using the "Read Original Article" link above.

Desktop Only