Frame Level Speech Recognition

In this project we apply feedforward neural network expertise to the task of speech recognition. The given dataset comprises of phoneme state (subphoneme) labels for audio recordings (utterances). The data is taken from Wall Street Journal (WSJ) articles that have been read aloud and labeled using the original language. Finding the phoneme state label for each frame in the test dataset is the task at hand. It's crucial to remember that utterances might vary in length.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
dataloaders.py		dataloaders.py
main.py		main.py
models.py		models.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Frame Level Speech Recognition

About

Releases

Packages

Languages

moayad-hsn/frame_level_asr

Folders and files

Latest commit

History

Repository files navigation

Frame Level Speech Recognition

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages