Google has partnered with leading African research institutions to launch WAXAL, a large-scale open speech dataset designed to expand access to artificial intelligence (AI) for more than 100 million people across Sub-Saharan Africa.
Developed over three years with Google’s support, WAXAL includes 1,250 hours of transcribed natural speech and more than 20 hours of studio-quality recordings, enabling the creation of accurate voice recognition systems and synthetic speech tools.
The initiative aims to close a major digital gap by providing foundational speech data for 21 African languages, including Hausa, Yoruba, Luganda, and Acholi.
Despite the rapid global growth of voice-enabled technologies, most African languages remain underserved due to limited high-quality speech datasets, leaving millions unable to access digital tools in their native tongues.
Read also: Google Maps goes hands-free with Gemini for walkers, cyclists
Aisha Walcott-Bryant, head of Google Research Africa, stated that the project is designed to empower African students, researchers, and entrepreneurs to build technologies in local languages and unlock economic opportunities across the continent.
The data collection was led by African institutions, including Makerere University (Uganda), University of Ghana, and Digital Umuganda (Rwanda), which retain full ownership of the dataset setting a new standard for equitable and community-led AI development.
Read also: Google unveils tool for seamless Gmail address changes without data compromise
By supporting research, education, and innovation, WAXAL is expected to accelerate the creation of voice-enabled tools in areas such as education, healthcare, financial services, and agriculture by helping millions better access essential digital services.
Join BusinessDay whatsapp Channel, to stay up to date
Open In Whatsapp
