-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions regarding feature preprocessing, number of CTC outputs #1
Comments
Q2: Just added chars.txt file |
Thanks for your comment and quick reply :) |
@zxie, Could you let me know the answer of Q1? I'm stuck at the same step of @gmkim90's question. In the ctc/dataLoader.py, they requires 3 files: keysOO.txt, alisOO.txt, and featsOO.bin. It'll be great help if you describe the format of them. Thank you! |
Hi @jinserk. Yeah, we forgot some Stanford processing scripts after the Kaldi pipeline that makes things easier to just load into numpy. I just e-mailed you a link that has an example of each of the data files. We can also add the code (egs/swbd/s5b/stanford-utils, src/stanford-bin) excluding the main Kaldi codebase, pipeline is tricky to piece together though. @amaas, thoughts? Edit: For future reference can also find some example files using TIMIT training data at: http://deeplearning.stanford.edu/lexfree/timit/ -- just edited README accordingly. |
Hi @zxie and @amaas, I really appreciate your reply and the link. The example directories of Kaldi are well-organized, so you can add your utils (maybe binaries or bash scripts, I guess) generating input files from the Kaldi data with the necessary Kaldi path specified in your own stanford-ctc repository. I've downloaded the files from the link you mentioned, and copy them into
Could you comment for this error? |
Just checked flags w/ some more recently computed features and they seem to match the ones in timit file. Looks like you're loading the model from As fallback, hack, but could just use |
@zxie, Thank you for your kind reply. There exists two directories I guess the |
Yes, I'd go with the Thanks for letting us know about other bug; |
@zxie I'd be fine with flipping our Kaldi add-ons public but there's not a On Mon, Sep 21, 2015 at 7:14 PM, zxie [email protected] wrote:
|
@zxie, sorry for late reply.
|
@zxie, also I want to know which source code can make feat19.bin? now i have download key19.txt , ali19.txt, just feat19.bin is missing. thank you very much and look forward for your reply. |
Hello
First of all, thanks for providing BLSTM + CTC as open source. I attribute yours in my future paper.
I successfully install dependencies (i.e. cudamat, gnumpy, npmat) and make execution files in ctc folder.
Since TIMIT is small enough DB to start, I was going to execute runTimit.sh but got error message below :
[kenkim@node3 ctc]$ ./runTimit.sh
models/nesterov_layers_2048,2048,2048,2048_step_1e-4_mom_.95_anneal.bin
gnumpy: failed to use gpu_lock. Using board #0 without knowing whether it is in use or not.
Using nesterov..
Traceback (most recent call last):
File "runNNet.py", line 171, in
run()
File "runNNet.py", line 93, in run
data_dict,alis,keys,sizes = loader.loadDataFileDict(i)
File "/home/kenkim/stanford-ctc-master/ctc/dataLoader.py", line 48, in loadDataFileDict
data_mat, alis, keys, sizes = self.loadDataFile(filenum)
File "/home/kenkim/stanford-ctc-master/ctc/dataLoader.py", line 30, in loadDataFile
data = np.fromfile(datafile,np.float32).reshape(-1,self.rawsize)
IOError: [Errno 2] No such file or directory: '/home/kenkim/kaldi-trunk/egs/timit/s5/exp/nomral_nn_train/feats12.bin'
Here is my questions.
Q1) Which source code can make feats12.bin?
Q2) In your paper, you use 33 characters for CTC output symbol.
Can you clarify what exactly they are? (i.e. alphabet(26) + blank(1) + something(6))
Q3) How did you preprocess 'non-character' in transcription of switchboard?
For example, 747, &, 20/20, _1_1 etc.
The text was updated successfully, but these errors were encountered: