All Voices is a community-driven platform for collecting and validating speech data across African languages. It enables native speakers, researchers, and institutions to contribute high-quality voice recordings that help train AI systems like Mansa.


Many African languages lack the datasets required to build reliable language technologies. By recording speech, transcribing audio, and validating samples, contributors help create the data needed to train systems such as:
.png)
.png)
.png)
.png)

All Voices enables contributors to collect and validate speech and text data for African languages, creating datasets for AI training. Key features:
Structured Data Collection: Record speech, translate text, transcribe audio, and add speaker or dialect info.
Language Tasks: Transcribe speech and translate text to generate additional data.
Validation & Quality: Review and rate contributions to ensure accuracy
Validated data to supports speech recognition, translation, and transcription systems.
Contributors listen to recordings and convert them into text.
Users type translations from short prompts in selected language.
Users read and record short prompts in their language.
Users review text or audio samples and rate their quality or accuracy.
.png)