Leverage High-Quality Children Speech Data to Train AI Models

Recently, scientists has performed a speech recognition capability test on some voice assistant on the market. Researchers found voice assistants including Amazon Echo, Google Home and other devices had recognition errors in the scene of interacting with children.

Different from adults, children’s voices have natural technical difficulties due to their voice and pronunciation characteristics. More importantly, children are not good at interacting with the voice assistant with the way that machines can understand. Whether it is a more friendly interactive interface or a more intelligent voice assistant, the recognition effect is not satisfactory.

The importance of high-quality children speech data is evident, in order to train a smarter voice assistant. As a professional AI data services provider, Datatang has accumulated 4,000 hour high-quality children speech data, to supports the research and application of children voice interactive products.

Chinese Children Speech data

Mobile phone captured audio data of Chinese children, with total duration of 3,255 hours. 9,780 speakers are children aged 6 to 12, with accent covering seven dialect areas; the recorded text contains common children languages such as essay stories, numbers, and their interactions on cars, at home, and with voice assistants, precisely matching the actual application scenes.

Chinese Children Speaking English Speech Data

Children read English audio data, covering ages from preschool (3–5 years old) to post-school (6–12 years old), with children’s speech features; content accurately matches children’s actual scenes of speaking English. It provides data support for children’s smart home, automatic speech recognition and oral assessment in intelligent education scene.

American Children Speech Data

It is recorded by 219 American children native speakers. The recording texts are mainly storybook, children’s song, spoken expressions, etc. 350 sentences for each speaker. Each sentence contain 4.5 words in average. Each sentence is repeated 2.1 times in average.

British Children Speech Data

It collects 201 British children. The recordings are mainly children textbooks, storybooks. The average sentence length is 4.68 words and the average sentence repetition rate is 6.6 times. This data is recorded by high fidelity microphone.

If the above data cannot meet the needs of your current research, Datatang also provides data customization services for specific groups of people, specific scenarios, and specific languages to meet customers’ diversified data needs.


If you need data services, please feel free to contact us: info@datatang.com

Off-the-shelf AI training data, on-demand data collection & annotation services

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

AI: The Inverse Tower of Babel


Elon Musk Inventions and Projects You Might Not Know Off

The AI cluster running on the Internet has data mined you, and has some deep fakes to sell you

AI in Healthcare: How AI Changes the Way Healthcare Is Delivered | Eastern Peak

Will AI Kill Programming?

Consumer Behaviour — March 15, 2021

Break Into the Expanding Healthcare Market With AI

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store


Off-the-shelf AI training data, on-demand data collection & annotation services

More from Medium

Process Analytical Technology (PAT) and Artificial Intelligence: The Expanding Impact of ML in…

Process Analytical Technology (PAT) and Artificial Intelligence

Machine Learning Technology: The Past, Present, and Future


Applying for a PhD in NLP