WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.

Google joins push to localise AI for African languages with speech database

3 min read

Google has collaborated with African universities and research institutions to launch WAXAL, an open-source speech database designed to support the development of voice-based artificial intelligence for African languages. 

African institutions, including Makerere University in Uganda, the University of Ghana, Digital Umuganda in Rwanda, and the African Institute for Mathematical Sciences (AIMS), participated in the data collection for this initiative. The dataset provides foundational data for 21 Sub-Saharan African languages, including Hausa, Luganda, Yoruba, and Acholi.

WAXAL is designed to support the development of speech recognition systems, voice assistants, text-to-speech tools, and other voice-enabled applications across sectors such as education, healthcare, agriculture, and public services.

“This dataset provides the critical foundation for students, researchers, and entrepreneurs to build technology on their own terms, in their own languages,” said Aisha Walcott-Bryantt, Head of Google Research Africa

WAXAL’s launch comes amid growing efforts across Africa to develop language technologies that reflect local cultures and realities. 

In September 2025, the Nigerian government unveiled N-ATLAS, an open-source language model capable of recognising and transcribing spoken words and generating text, in Yoruba, Hausa, Igbo, and Nigerian-accented English. 

Similar initiatives are emerging in the private sector, where startups such as  South Africa’s Lelapa AI are building tools like Vulavula, which offers speech recognition, translation, and sentiment analysis. 

By making this speech dataset openly accessible, WAXAL provides the fuel for a growing wave of homegrown efforts to bring African languages into the digital age.

Although Sub-Saharan Africa is home to more than 2,000 languages, reports suggest that fewer than 5% of those languages have the resources needed for Natural Language Processing (NLP), which allows computers to understand and comprehend human language. This lack of representation in training datasets limits the effectiveness of speech recognition and text-to-speech systems for African users.  

Developed over three years with funding and technical support from Google, WAXAL addresses a major gap in global AI development.

WAXAL provides speech data for 21 Sub-Saharan African languages, including Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Swahili, and Yoruba. The dataset contains more than 11,000 hours of speech drawn from nearly two million individual recordings. 

Under the project’s partnership model, contributing institutions retain ownership of the data they collected, while making it openly available to researchers and developers worldwide.

“For AI to have a real impact in Africa, it must speak our languages and understand our contexts,” Joyce Nakatumba-Nabende, Senior Lecturer at Makerere University’s School of Computing and Information Technology, said. 

“The WAXAL dataset gives our researchers the high-quality data they need to build speech technologies that reflect our unique communities.”

Get The Best African Tech Newsletters In Your Inbox

Subscribe
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Markets await Fed’s first 2025 cut, experts bet “this bull market is not even close to over”

Markets await Fed’s first 2025 cut, experts bet “this bull market is not even close to over”

Will the Fed’s first rate cut of 2025 fuel another leg higher for Bitcoin and equities, or does September’s history point to caution? First rate cut of 2025 set against a fragile backdrop The Federal Reserve is widely expected to…
Share
Crypto.news2025/09/18 00:27
Trump Owns $870 Million Bitcoin Amid Crypto Market Meltdown

Trump Owns $870 Million Bitcoin Amid Crypto Market Meltdown

The post Trump Owns $870 Million Bitcoin Amid Crypto Market Meltdown appeared on BitcoinEthereumNews.com. President Donald Trump has quietly become one of the world’s largest Bitcoin (BTC) holders, even as the crypto market faces a historic meltdown. The revelation comes as Bitcoin and the broader crypto market struggle through one of their steepest declines in recent years. Trump Media’s $2 Billion Bitcoin Bet Makes President A Major Investors According to a Forbes report, Trump’s indirect Bitcoin exposure is now valued at around $870 million, placing him among the biggest investors in the digital asset space. Despite the crash, Trump’s holdings remain strong, showing his business’ growing ties to the crypto market. Forbes found that Trump’s holdings are not listed in any official government filings or financial disclosures. Instead, his exposure comes through his 41% stake in Trump Media and Technology Group, the parent company of Truth Social. Earlier this year, Trump Media raised $2.3 billion through debt and stock sales, using most of the proceeds to buy $2 billion worth of Bitcoin. The move aligns with MicroStrategy’s renewed interest in buying Bitcoin after not buying any last week. That move gave Trump a massive indirect stake in the world’s largest cryptocurrency. Trump Media’s Bitcoin Strategy Shows Trump’s Shift From Crypto Disbelief When the company chose to start holding BTC on its balance sheet, it represented a radical turning point from just being a social media company. Through the adoption of the same corporate treasury technique popularized by Michael Saylor’s Strategy Inc., Trump Media has become a U.S. company holding large amounts of Bitcoin. This shift mirrors the growing wave of institutional adoption. Recently, trillion-dollar asset manager Morgan Stanley opened crypto investments to all its wealth clients. According to Forbes, the company’s overall evaluation has fallen since its Bitcoin purchase. However, its Bitcoin reserves now make up the strongest part of its balance sheet. Trump’s…
Share
BitcoinEthereumNews2025/10/13 05:12
Trump Denies Involvement in $500M Abu Dhabi WLFI Stake

Trump Denies Involvement in $500M Abu Dhabi WLFI Stake

The post Trump Denies Involvement in $500M Abu Dhabi WLFI Stake appeared on BitcoinEthereumNews.com. US President Donald Trump has denied knowledge of a reported
Share
BitcoinEthereumNews2026/02/03 23:26