Changelog Master Feed

Your one-stop shop for all Changelog podcasts. Weekly shows about software development, developer culture, open source, building startups, artificial intelligence, shipping code to production, and the people involved. Yes, we focus on the people. Everything else is an implementation detail.

https://changelog.com/master

subscribe
share






Democratizing ML for speech (Practical AI #164)


You might know about MLPerf, a benchmark from MLCommons that measures how fast systems can train models to a target quality metric. However, MLCommons is working on so much more! David Kanter joins us in this episode to discuss two new speech datasets that are democratizing machine learning for speech via data scale and language/speaker diversity.

Leave us a comment

Changelog++ members save 3 minutes on this episode because they made the ads disappear. Join today!

Sponsors:

  • Changelog++ – You love our content and you want to take it to the next level by showing your support. We’ll take you closer to the metal with no ads, extended episodes, outtakes, bonus content, a deep discount in our merch store (soon), and more to come. Let’s do this!
  • The Brave Browser – Browse the web up to 8x faster than Chrome and Safari, block ads and trackers by default, and reward your favorite creators with the built-in Basic Attention Token. Download Brave for free and give tipping a try right here on changelog.com.
  • Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com

Featuring:

  • David Kanter – Twitter, GitHub
  • Chris Benson – Twitter, GitHub, LinkedIn, Website
  • Daniel Whitenack – Twitter, GitHub, Website

Show Notes:

  • Press Release about MLCommons datasets: MLCommons™ Association Unveils Open Datasets and Tools to Drive Democratization of Machine Learning
  • NeurIPS Papers:
    • People’s Speech Dataset
    • Multilingual Spoken Words Corpus (MSWC)
  • Gradient article: New Datasets to Democratize Speech Recognition Technology
  • Blog posts for more insight:
    • People’s Speech
    • Multilingual Spoken Words Corpus (MSWC)
  • Downloads:
    • People’s Speech
    • Multilingual Spoken Words Corpus

Something missing or broken? PRs welcome!


fyyd: Podcast Search Engine
share








 January 19, 2022  44m