Skip to content

tshrinivasan/voice-recorder-for-tawictionary

Repository files navigation

Voice Recorder for Tamil Wiktionary
===================================

Wiktionary.org is a wikipedia project to provide the dictionary service for words.

http://ta.wiktionary.org provides Tamil meaning for English words.


Check this page:
http://ta.wiktionary.org/wiki/tiny

It shows the word, gives its meaning in tamil and has an audio file link, on how to pronounce it.
We can click the file and play it to hear the pronounciation.

Tamil Wiktionary has around one lakh words.
only few words have the audio files to hear them.

To add an audio file in the wiktionary page,
1. see the word
2. read it and record in ogg format
3. upload to wikipedia commons
4. link in the same page for the audio file.

Doing these will be a boring task to do them all manually.

Think of a system, 
which shows you a word, 
you spell it.
it records it as ogg.
uploads to commons
links it to the wiktionary page.

Your only task is to spell it.


We can do this system using python.

Here is the small application which does the above said.

see in action.

Tamil - http://www.youtube.com/watch?v=7a5Iv_ol1So
English - http://www.youtube.com/watch?v=zd4KNbNX4_Y



INSTALL:
========

Read the INSTALL file for installing in ubuntu 14.04 and above or mac.


How to Execute:
==============

1. Fill the words you want to recoed in a textfile named "file"
2. run the below command

python voice-record.py 2> err

this will record the sounds in ogg and wav formats.

3.
To upload all the ogg files to Wikimedia Commons

3a. Edit the file mediawiki-uploader.py 
Fill the commons api url, username and password

3b. run the below command
python mediawiki-uploader.py







Todo:

Link back to the tamil wiktionary page is still pending.





Contact:
========
Shrinivasan T
[email protected]
http://goinggnu.wordpress.com