35-Hours-Pinyin-Annotation-Speech-Data-of-Audio-Book-Text

Description

Audiobook annotated pinyin audio data, with duration of 35 hours; 5 speakers are recorded including 3 males and 2 females; Chinese characters and pinyin are annotated, including the tone of pinyin; this data set can be used for automatic speech recognition, machine translation, and voiceprint recognition.

For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/243?source=Github

Format

44.1kHz, 16bit, uncompressed wav, mono channel

Environment

Relatively quiet environment

Recording Content

Audio books, including five categories like beautiful essays, novel, logical thinking, children's story, and Twenty Years in Late Qing Dynasty.

People

5 people in total and 3 males and 2 females

Language

Mandarin

Application Scenario

Voice Recognition, Voice Print Recognition

Annotation Feature

Annotating audio data with Chinese and Pinyin.

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
G0001S00001.txt		G0001S00001.txt
G0001S00001.wav		G0001S00001.wav
G0001S00002.txt		G0001S00002.txt
G0001S00002.wav		G0001S00002.wav
G0001S00003.txt		G0001S00003.txt
G0001S00003.wav		G0001S00003.wav
G0001S00004.txt		G0001S00004.txt
G0001S00004.wav		G0001S00004.wav
G0001S00005.txt		G0001S00005.txt
G0001S00005.wav		G0001S00005.wav
G0001S00006.txt		G0001S00006.txt
G0001S00006.wav		G0001S00006.wav
G0001S00007.txt		G0001S00007.txt
G0001S00007.wav		G0001S00007.wav
G0002S00001.txt		G0002S00001.txt
G0002S00001.wav		G0002S00001.wav
G0002S00002.txt		G0002S00002.txt
G0002S00002.wav		G0002S00002.wav
G0002S00003.txt		G0002S00003.txt
G0002S00003.wav		G0002S00003.wav
G0002S00004.txt		G0002S00004.txt
G0002S00004.wav		G0002S00004.wav
G0002S00005.txt		G0002S00005.txt
G0002S00005.wav		G0002S00005.wav
G0002S00006.txt		G0002S00006.txt
G0002S00006.wav		G0002S00006.wav
G0002S00007.txt		G0002S00007.txt
G0002S00007.wav		G0002S00007.wav
G0002S00008.txt		G0002S00008.txt
G0002S00008.wav		G0002S00008.wav
G0004S00001.txt		G0004S00001.txt
G0004S00001.wav		G0004S00001.wav
G0004S00002.txt		G0004S00002.txt
G0004S00002.wav		G0004S00002.wav
G0004S00003.txt		G0004S00003.txt
G0004S00003.wav		G0004S00003.wav
G0004S00004.txt		G0004S00004.txt
G0004S00004.wav		G0004S00004.wav
G0004S00005.txt		G0004S00005.txt
G0004S00005.wav		G0004S00005.wav
G0004S00006.txt		G0004S00006.txt
G0004S00006.wav		G0004S00006.wav
G0004S00007.txt		G0004S00007.txt
G0004S00007.wav		G0004S00007.wav
G0005S00001.txt		G0005S00001.txt
G0005S00001.wav		G0005S00001.wav
G0005S00002.txt		G0005S00002.txt
G0005S00002.wav		G0005S00002.wav
G0005S00003.txt		G0005S00003.txt
G0005S00003.wav		G0005S00003.wav
G0005S00004.txt		G0005S00004.txt
G0005S00004.wav		G0005S00004.wav
G0005S00005.txt		G0005S00005.txt
G0005S00005.wav		G0005S00005.wav
G0005S00006.txt		G0005S00006.txt
G0005S00006.wav		G0005S00006.wav
G0005S00007.txt		G0005S00007.txt
G0005S00007.wav		G0005S00007.wav
G0005S00008.txt		G0005S00008.txt
G0005S00008.wav		G0005S00008.wav
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

35-Hours-Pinyin-Annotation-Speech-Data-of-Audio-Book-Text

Description

Format

Environment

Recording Content

People

Language

Application Scenario

Annotation Feature

Licensing Information

About

Releases

Packages

Contributors 2

Nexdata-AI/35-Hours-Pinyin-Annotation-Speech-Data-of-Audio-Book-Text

Folders and files

Latest commit

History

Repository files navigation

35-Hours-Pinyin-Annotation-Speech-Data-of-Audio-Book-Text

Description

Format

Environment

Recording Content

People

Language

Application Scenario

Annotation Feature

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages