Changelog Master Feed

Your one-stop shop for all Changelog podcasts. Weekly shows about software development, developer culture, open source, building startups, artificial intelligence, shipping code to production, and the people involved. Yes, we focus on the people. Everything else is an implementation detail.

https://changelog.com/master

subscribe
share






Speech tech and Common Voice at Mozilla (Practical AI #104)


Many people are excited about creating usable speech technology. However, most of the audio data used by large companies isn’t available to the majority of people, and that data is often biased in terms of language, accent, and gender. Jenny, Josh, and Remy from Mozilla join us to discuss how Mozilla is building an open-source voice database that anyone can use to make innovative apps for devices and the web (Common Voice). They also discuss efforts through Mozilla fellowship program to develop speech tech for African languages and understand bias in data sets.

Leave us a comment

Changelog++ members get a bonus 2 minutes at the end of this episode and zero ads. Join today!

Sponsors:

  • Linode – Our cloud of choice and the home of Changelog.com. Deploy a fast, efficient, native SSD cloud server for only $5/month. Get 4 months free using the code changelog2019 OR changelog2020. To learn more and get started head to linode.com/changelog.
  • Pace.dev – Minimalist web based management tool for your teams. Async by default communication and simplistic task management gives you everything you need to build your next thing. Brought to you by Go Time panelist Mat Ryer. Try it out today!
  • Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com.
  • Rollbar – We move fast and fix things because of Rollbar. Resolve errors in minutes. Deploy with confidence. Learn more at rollbar.com/changelog.

Featuring:

  • Jenny Zhang – Twitter, Website
  • Remy Muhire – Twitter, GitHub
  • Josh Meyer – Twitter, GitHub
  • Chris Benson – Twitter, GitHub, LinkedIn, Website
  • Daniel Whitenack – Twitter, GitHub, Website

Show Notes:

  • Mozilla Common Voice
  • Announcement of Josh and Remy’s fellowship work on speech tech for African languages
  • Artie Bias Corpus
  • Readings on Demographic Bias in ASR:
    • Voice recognition still has significant race and gender biases
    • Gender and Dialect Bias in YouTube’s Automatic Captions
    • Racial disparities in automated speech recognition
  • Common Voice LREC Paper
  • Common Voice + DeepSpeech collaborators for Low-resource languages:
    • Digital Umuganda
    • AI Lab, Makerere University
    • Language Technologies Unit, Bangor University
    • Linguistics Department, Indiana University Bloomington
  • “under-sampled majority” is a quote from Joy Boulamwini (see this article)

Something missing or broken? PRs welcome!


fyyd: Podcast Search Engine
share








 September 9, 2020  58m