speech-recognition

Star

Here are 4,699 public repositories matching this topic...

thewh1teagle / sherpa-rs

Sponsor

Star

Rust bindings to https://github.com/k2-fsa/sherpa-onnx

audio rust embeddings speech-recognition sherpa diarization

Updated Jul 6, 2024
Rust

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Jul 6, 2024
Python

huggingface / transformers

Star

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated Jul 6, 2024
Python

openvinotoolkit / openvino

Star

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated Jul 6, 2024
C++

cybera3s / speech_recognization_app

Star

Speech Recognization App source Code

flask speech-recognition voice-to-text

Updated Jul 5, 2024
HTML

argmaxinc / WhisperKit

Star

On-device Inference of Whisper Speech Recognition Models for Apple Silicon

macos swift ios watchos transformers inference speech-recognition pretrained-models whisper visionos

Updated Jul 5, 2024
Swift

janinainfa / mex-assistant

Star

Simple voice assistant made with Qt5

python qt5 assistant speech-recognition

Updated Jul 5, 2024
Python

4darsh-Dev / MuskanAi

Sponsor

Star

MuskanAi is a personal Digital Assistant which is capable of performing all Automation task whether it is Controlling your Devices, Browsing the Internet and Emotional Understanding..

python machine-learning natural-language-processing django web-development deep-learning django-rest-framework speech-recognition trending-repositories html-css-javascript natural-language-understanding digital-assistant emotional-analysis pytorch-nlp aritificalintelligence

Updated Jul 5, 2024
Jupyter Notebook

heypoom / archival-intelligences

Star

Archival Intelligences. (TBA)

typescript ai vue speech-recognition stable-diffusion

Updated Jul 5, 2024
TypeScript

Picovoice / leopard

Star

On-device speech-to-text engine powered by deep learning

voice-recognition speech-recognition automatic-speech-recognition speech-to-text transcription stt asr voice-to-text on-device

Updated Jul 5, 2024
Python

Sharrnah / whispering-ui

Star

Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)

translator ai tts speech-recognition translate transcribe whisper-ai

Updated Jul 5, 2024
Go

harvard-edge / multilingual_kws

Star

Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus

speech-recognition keyword-spotting wake-word-detection query-by-example kws keyword-search few-shot-learning

Updated Jul 5, 2024
Jupyter Notebook

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Jul 5, 2024
Python

botbahlul / crx-live-translate

Star

Chrome/Edge BROWSER EXTENSION that can RECOGNIZE any live audio/video streaming then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE!

javascript chrome edge voice-recognition speech-recognition browser-extension speech-to-text google-translate-api webkitspeechrecognition auto-caption auto-subtitle webkit-speech-recognition

Updated Jul 5, 2024
JavaScript

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jul 5, 2024
Python

jessemcg / geek-dictation

Star

A set of bash scripts designed to use a whisper speech to text model offline for voicetyping on linux quickly and effeciently

linux bash speech-recognition speech-to-text whisper voice-typing whisper-cpp ggml

Updated Jul 5, 2024
Shell

opensource-spraakherkenning-nl / ASR_NL_results

Star

Results of Dutch ASR models, collected by the community

speech-recognition asr-benchmark dutch-language

Updated Jul 5, 2024
SCSS

SYSTRAN / faster-whisper

Star

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated Jul 5, 2024
Python

botbahlul / js-live-audio-video-translate

Star

HTML Web template that can RECOGNIZE any live audio/video streaming (using Chrome webkitSpeechRecognition API) then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE

javascript html web voice-recognition speech-recognition google-translate web-template google-translate-api webkitspeechrecognition auto-caption auto-subtitle webkit-speech-recognition

Updated Jul 5, 2024
JavaScript

matthiasn / lotti

Sponsor

Star

Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.

windows macos ios journal health speech-recognition time-tracker speech-to-text android-app flutter linux-app fitness-app

Updated Jul 5, 2024
Dart

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-recognition

Here are 4,699 public repositories matching this topic...

thewh1teagle / sherpa-rs

DmitryRyumin / ICASSP-2023-24-Papers

huggingface / transformers

openvinotoolkit / openvino

cybera3s / speech_recognization_app

argmaxinc / WhisperKit

janinainfa / mex-assistant

4darsh-Dev / MuskanAi

heypoom / archival-intelligences

Picovoice / leopard

Sharrnah / whispering-ui

harvard-edge / multilingual_kws

speechbrain / speechbrain

botbahlul / crx-live-translate

modelscope / FunASR

jessemcg / geek-dictation

opensource-spraakherkenning-nl / ASR_NL_results

SYSTRAN / faster-whisper

botbahlul / js-live-audio-video-translate

matthiasn / lotti

Improve this page

Add this topic to your repo