This is kind of the absolute basic use case I imagined, of just being able to capture any text on the screen and being able to translate it. The main benifits of this approach being that
-
You can translate text of any type, that is, on a website, in an image, in a video, places which googles website translation can't get to, fails on or isn't implemented for.
-
There is no downloading or uploading images anywhere and you don't need to pull out your phone, you just select exactly what you want to translate.
With the core idea here being that you can translate anything anywhere on your desktop with the click of a button. Also a neat addition is the fact that this does not trigger the security black-out during capture you get with traditional software like snipping tool in websites using EME for DRM, everything on the screen is unaffected even though you are still capturing screenshots.
The final image of the videoplayer is still black but for our purposes; since we only want the subtitles; this is basically to our benefit.
The primary thing I will be focusing on is improving the basic screenshot translation and getting it to a polished state where it creates a full basic feature. This will include making sure the translator
window is completely invisible when it captures an image in addition to polishing some of the basic UI so it becomes a bit more intuitive to use. Aswell as adding some basic preprocessing for both the image and the OCR'd text.