Audiobook annotated pinyin audio data, with duration of 35 hours; 5 speakers are recorded including 3 males and 2 females; Chinese characters and pinyin are annotated, including the tone of pinyin; this data set can be used for automatic speech recognition, machine translation, and voiceprint recognition.
For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/243?source=Github
44.1kHz, 16bit, uncompressed wav, mono channel
Relatively quiet environment
Audio books, including five categories like beautiful essays, novel, logical thinking, children's story, and Twenty Years in Late Qing Dynasty.
5 people in total and 3 males and 2 females
Mandarin
Voice Recognition, Voice Print Recognition
Annotating audio data with Chinese and Pinyin.
Commercial License