Introduction

👥 📕 Mozilla Common Voice Community Playbook V1.2

Welcome to Common Voice Community 🥳

Mozilla Common Voice is an initiative to help teach machines how real people talk.

Everyone can take part in Common Voice! You don't need technical experience, though if you do, that's great too!

Playbook Chapters:

  • Localization: Translating project tools and material to be understood by contributors in their language

  • Text Corpus: Gathering, validating and processing public domain sentences

  • Voice Corpus: Recording and validating voice clips to create a public domain dataset

  • Language Communities: Learn more about the Communities of Common Voice and how to start your own

What is Common Voice?

Voice is natural, voice is human. That’s why we’re excited about creating usable voice technology for our machines. But to create voice systems, developers need an extremely large amount of voice data.

Most of the data used by large companies isn’t available to the majority of people. We think that stifles innovation. So we’ve launched Common Voice, a project to help make voice recognition open and accessible to everyone.

Language communities advance a language from not having a presence in Common Voice at all to being able to generate a functional Speech to text (STT) model which is able to understand how people speak. Once the Common Voice website is localized by a language community, the process involves the 5 following steps.

  1. Collecting and validating public domain sentences

  2. Recording and validating the recordings of the sentences

  3. Repeating this process to grow the size of the data

  4. Generating a dataset

  5. and using machine learning to train speech-to-text models using this dataset

Goals

Our community playbook is a living document of our communities history and knowledge. After reading the playbook chapters, you will understand:

  • The goals and ethos of the Common Voice project

  • The journey of language onto Common Voice

  • How to set up and maintain a language community as part of Common Voice

TLDR: Too Long Don't Read

To help you quickly navigate the playbook, this list provides you with possible information you would like to have and the associated content.

I would like to....

As you read the playbook....

🔨 Make sure you check the required skills for each section and look for people who can fit.

💬 Check the channels section to learn how to set up your local forums and chat to communicate with other people in your language.

Each chapter covers; Purpose - Who we are - Success - How to join - What we do - Roles - Channels

Common Voice Ecosystem

Participation Guidelines

Common Voice communities are governed by Mozilla's code of conduct and etiquette guidelines, we take this very seriously and no violations are tolerated.

We encourage you to please read Mozilla Community Participation Guidelines before contributing to this project.

For more information on how to report violations of the Community Participation Guidelines, please read our 'How to Report' page.

Governance

Mozilla Foundation stewards the overall Common Voice project and is the ultimate decision-maker for its direction and goals. It also oversees the development of some tools and channels described here to support our communities.

Common Voice language communities are self-organized, and you don’t need to ask for permission to participate or mobilize any of these communities in your language. All the data generated by communities is published under open licences.

Some community roles exist formally and informally, and they all should follow the Mozilla leadership shared agreements.

The Common Voice Team at Mozilla Foundation, share weekly updates to the community on discourse.

Common Voice Communities 🗺

Common Voice has a variety of communities that support the project in different important areas, they are usually grouped by language.

Language Communities

👥 A language's journey onto Common Voice is made possible with communities of multidisciplinary teams of committed people. Roles vary from no coding needed to organizing roles. Our community mobilization resource and Community page can connect you to resources and existing language communities that can support you.

ℹ️ Note: Mozilla welcomes small and minority language communities, and we understand some of these goals may seem out of reach. In that case, feel free to share with us how they are different for you, and we will try to help. Connect with the Common Voice Team on discourse or github issues.

Last updated