Skip to content

Scribal is an online education platform that also crowd sources transcriptions for the hearing impaired.

Notifications You must be signed in to change notification settings

wysesean/Scribal

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scribal

Scribal is an online education platform that rehosts videos taken down by a recent lawsuit against Berkeley for not being ADA compliant. Scribal not only rehosts them but makes them ADA compliant by transcribing the educational videos on the platform through crowd-sourcing.

screenshot2

How it works:

On the front end:
Main Features:
  • Users can pick and enroll in a course from a categorized selection
  • Before users are allowed to watch a video for that course, a popup comes up forcing them to transcribe a randomly selected 7 second clip.
  • Once the clip is transcribed it gets stored in a database where it will then be analyzed.
  • The user can then proceed to their video.
  • Process repeats for all other videos.
Other Features:
  • Users can track their videos watched.
  • Transcriptions are always available to be used even if they are incomplete. Just check the options for a video and use them if you'd like. A completion % and confidence % is provided. screenshot1
On the back end:
Clips
  • Clips are 7 second consecutive segments for the length of the video. So if a video is 35 seconds long, it will have 5 clips.
    • First clip covering an offset of 0 to 7 seconds.
    • Second clip covering an offset of 7 to 14 seconds.
    • etc...
  • To ensure that transcriptions cover words that are cut off at the boundary of clips, a second set of clips is generated. This set is offset by 2 seconds.
    • First clip in second set covers an offset of 2 to 9 seconds.
    • Second clip in second set covers an offset of 9 to 16 seconds.
    • etc
  • When it comes time to generate transcriptions, the first clipset is merged with the second clip set accordingly.
Input Filter
  • User inputs for random clip transcriptions are stored in a database.
  • A random clip can be served up multiple times to different users and each different user's input is stored.
  • With a collection of inputs for a clip, the levenshtein distance (or edit distance) is compared across and agasint all inputs.
  • The input which has the lowest average levenshtein distance is used in the final transcription.
  • If none of that made any sense, essentially the filter looks at a collection of strings and returns the most "average" looking string for that clip.

Tech

Scribal uses a number of open source projects to work properly:

Installation

git clone https://github.com/wysesean/Scribal.git
cd Scribal
npm install
mongod (in a new tab)
npm run go

open localhost:3000 in the browser

License

MIT

Free Software, Hell Yeah!

About

Scribal is an online education platform that also crowd sources transcriptions for the hearing impaired.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published