Rust bindings to https://github.com/k2-fsa/sherpa-onnx
-
Updated
Jul 6, 2024 - Rust
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Speech Recognization App source Code
On-device Inference of Whisper Speech Recognition Models for Apple Silicon
Simple voice assistant made with Qt5
MuskanAi is a personal Digital Assistant which is capable of performing all Automation task whether it is Controlling your Devices, Browsing the Internet and Emotional Understanding..
Archival Intelligences. (TBA)
On-device speech-to-text engine powered by deep learning
Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
A PyTorch-based Speech Toolkit
Chrome/Edge BROWSER EXTENSION that can RECOGNIZE any live audio/video streaming then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE!
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A set of bash scripts designed to use a whisper speech to text model offline for voicetyping on linux quickly and effeciently
Results of Dutch ASR models, collected by the community
Faster Whisper transcription with CTranslate2
HTML Web template that can RECOGNIZE any live audio/video streaming (using Chrome webkitSpeechRecognition API) then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE
Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."