We constructed UY/CH-CHILD, a new speech database consisting of 29,061 Chinese samples spoken by 106 Uyghur children.
106
childrenWe collected speech production data from 106 children aged from 4 to 8 in Ili Prefecture. Xiniiang Uyghur Autonomous Region in China. The participating children come from native Uyghur families and attend kindergartens or primary schools where Chinese is the primary language.
29,061
samplesThe recorded speech was uploaded to the annotation platform for phonetic labeling. We invited 13 students in the International Cultural Exchange College Xinjiang University to conduct the labelling procedure. All the students are native Chinese, and have considerable experience and knowledge in Chinese pronunciation. After the phonetic labelling, there were more than 29,061 valid samples in total.
The database is public to universities and research institutes for research purpose only.
To request a copy of the database, please send an email to Prof. Dong Wang.
All the resources contained in the dataset are free for research institutes and individuals. The copyright remains with the original owners of the audio/video. No commerical usage is permitted.