Name		Name	Last commit message	Last commit date
parent directory ..
local		local
pruned_transducer_stateless5		pruned_transducer_stateless5
README.md		README.md
RESULTS.md		RESULTS.md
prepare.sh		prepare.sh
shared		shared

README.md

Introduction

This recipe contains some various ASR models trained with Aishell4 (including S, M and L three subsets).

The AISHELL-4 is a sizable real-recorded Mandarin speech dataset collected by 8-channel circular microphone array for speech processing in conference scenarios. The dataset consists of 211 recorded meeting sessions, each containing 4 to 8 speakers, with a total length of 120 hours. This dataset aims to bridge the advanced research on multi-speaker processing and the practical application scenario in three aspects. With real recorded meetings, AISHELL-4 provides realistic acoustics and rich natural speech characteristics in conversation such as short pause, speech overlap, quick speaker turn, noise, etc. Meanwhile, the accurate transcription and speaker voice activity are provided for each meeting in AISHELL-4. This allows the researchers to explore different aspects in meeting processing, ranging from individual tasks such as speech front-end processing, speech recognition and speaker diarization, to multi-modality modeling and joint optimization of relevant tasks.

(From Open Speech and Language Resources)

./RESULTS.md contains the latest results.

Transducers

There are various folders containing the name transducer in this folder. The following table lists the differences among them.

	Encoder	Decoder	Comment
`pruned_transducer_stateless5`	Conformer(modified)	Embedding + Conv1d	Using k2 pruned RNN-T loss

The decoder in transducer_stateless is modified from the paper Rnn-Transducer with Stateless Prediction Network. We place an additional Conv1d layer right after the input embedding layer.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ASR

ASR

README.md

Introduction

Transducers

Files

ASR

Directory actions

More options

Directory actions

More options

Latest commit

History

ASR

Folders and files

parent directory

README.md

Introduction

Transducers