Leverage High-Quality Children Speech Data to Train AI Models
Recently, scientists has performed a speech recognition capability test on some voice assistant on the market. Researchers found voice assistants including Amazon Echo, Google Home and other devices had recognition errors in the scene of interacting with children.
Different from adults, children’s voices have natural technical difficulties due to their voice and pronunciation characteristics. More importantly, children are not good at interacting with the voice assistant with the way that machines can understand. Whether it is a more friendly interactive interface or a more intelligent voice assistant, the recognition effect is not satisfactory.
The importance of high-quality children speech data is evident, in order to train a smarter voice assistant. As a professional AI data services provider, Datatang has accumulated 4,000 hour high-quality children speech data, to supports the research and application of children voice interactive products.
Mobile phone captured audio data of Chinese children, with total duration of 3,255 hours. 9,780 speakers are children aged 6 to 12, with accent covering seven dialect areas; the recorded text contains common children languages such as essay stories, numbers, and their interactions on cars, at home, and with voice assistants, precisely matching the actual application scenes.
Children read English audio data, covering ages from preschool (3–5 years old) to post-school (6–12 years old), with children’s speech features; content accurately matches children’s actual scenes of speaking English. It provides data support for children’s smart home, automatic speech recognition and oral assessment in intelligent education scene.
It is recorded by 219 American children native speakers. The recording texts are mainly storybook, children’s song, spoken expressions, etc. 350 sentences for each speaker. Each sentence contain 4.5 words in average. Each sentence is repeated 2.1 times in average.
It collects 201 British children. The recordings are mainly children textbooks, storybooks. The average sentence length is 4.68 words and the average sentence repetition rate is 6.6 times. This data is recorded by high fidelity microphone.
If the above data cannot meet the needs of your current research, Datatang also provides data customization services for specific groups of people, specific scenarios, and specific languages to meet customers’ diversified data needs.
If you need data services, please feel free to contact us: firstname.lastname@example.org