pypi.org "speech-to-text" keyword
View the packages on the pypi.org package registry that are tagged with the "speech-to-text" keyword.
myproject101 1.0
Speech recognition module for Python, supporting several engines and APIs, online and offline.1 version - Latest release: about 4 years ago - 1 dependent repositories - 60 downloads last month - 8,691 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
17 versions - Latest release: 12 days ago - 32 dependent packages - 102 dependent repositories - 1.32 million downloads last month - 7,821 stars on GitHub - 2 maintainers
speechbrain 1.0.3
All-in-one speech toolkit in pure Python and Pytorch17 versions - Latest release: 12 days ago - 32 dependent packages - 102 dependent repositories - 1.32 million downloads last month - 7,821 stars on GitHub - 2 maintainers
voicegain-speech 1.118.0
Voicegain Speech-to-Text Python SDK158 versions - Latest release: 3 days ago - 1 dependent repositories - 3.55 thousand downloads last month - 3 stars on GitHub - 1 maintainer
piper-whistle 1.6.253
CLI tool to manage piper voices.26 versions - Latest release: over 1 year ago - 853 downloads last month - 3 stars on gitlab.com - 1 maintainer
asrecognition 0.0.4 💰
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.4 versions - Latest release: over 3 years ago - 1 dependent repositories - 171 downloads last month - 51 stars on GitHub - 1 maintainer
Top 0.8% on pypi.org
65 versions - Latest release: 27 days ago - 61 dependent packages - 908 dependent repositories - 1.45 million downloads last month - 8,019 stars on GitHub - 2 maintainers
speechrecognition 3.14.2
Library for performing speech recognition, with support for several engines and APIs, online and ...65 versions - Latest release: 27 days ago - 61 dependent packages - 908 dependent repositories - 1.45 million downloads last month - 8,019 stars on GitHub - 2 maintainers
Top 9.9% on pypi.org
45 versions - Latest release: 10 months ago - 1 dependent repositories - 840 downloads last month - 965 stars on GitHub - 1 maintainer
tensorflowasr 2.1.0
Almost State-of-the-art Automatic Speech Recognition using Tensorflow 245 versions - Latest release: 10 months ago - 1 dependent repositories - 840 downloads last month - 965 stars on GitHub - 1 maintainer
asrt-sdk 1.2.0
A python sdk for ASRT Speech Recognition Toolkit5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 187 downloads last month - 51 stars on GitHub - 1 maintainer
webspeechrecognition 0.1.4
A Python library for speech-to-text integration using Selenium WebDriver.1 version - Latest release: 4 months ago - 102 downloads last month - 1 maintainer
Top 1.6% on pypi.org
19 versions - Latest release: 4 months ago - 53 dependent packages - 35 dependent repositories - 1.38 million downloads last month - 9,301 stars on GitHub - 2 maintainers
faster-whisper 1.1.1
Faster Whisper transcription with CTranslate219 versions - Latest release: 4 months ago - 53 dependent packages - 35 dependent repositories - 1.38 million downloads last month - 9,301 stars on GitHub - 2 maintainers
Top 1.3% on pypi.org
100 versions - Latest release: over 4 years ago - 6 dependent packages - 240 dependent repositories - 17.2 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
deepspeech 0.9.3
A library for running inference on a DeepSpeech model100 versions - Latest release: over 4 years ago - 6 dependent packages - 240 dependent repositories - 17.2 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
ef-sherpa-onnx 1.9.21.dev1
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen K...1 version - Latest release: 12 months ago - 33 downloads last month - 5,645 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
132 versions - Latest release: 16 days ago - 1 dependent repositories - 58.1 thousand downloads last month - 5,645 stars on GitHub - 2 maintainers
sherpa-onnx 1.11.3
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen K...132 versions - Latest release: 16 days ago - 1 dependent repositories - 58.1 thousand downloads last month - 5,645 stars on GitHub - 2 maintainers
ppasr 3.0.2
Automatic speech recognition toolkit on PaddlePaddle43 versions - Latest release: 1 day ago - 1 dependent repositories - 894 downloads last month - 852 stars on GitHub - 1 maintainer
masr 3.0.2
Automatic speech recognition toolkit on Pytorch36 versions - Latest release: 1 day ago - 1 dependent repositories - 552 downloads last month - 588 stars on GitHub - 1 maintainer
vernacular-ai-speech 0.1.2
Vernacular Speech API python client3 versions - Latest release: over 4 years ago - 1 dependent repositories - 173 downloads last month - 21 stars on GitHub - 1 maintainer
vaani-speech-to-text 0.1.2
Vaani is an open-source, AI-powered speech-to-text desktop app. Vaani (वाणी) refers to "speech" o...3 versions - Latest release: 2 days ago - 174 downloads last month - 0 stars on GitHub - 1 maintainer
automatic-speech-recognition 1.0.4
Distill the Automatic Speech Recognition (TensorFlow)3 versions - Latest release: about 5 years ago - 1 dependent repositories - 533 downloads last month - 224 stars on GitHub - 1 maintainer
voskintentvoiceconverthanzi 0.3.32
Offline open source speech recognition API based on Kaldi and Vosk1 version - Latest release: over 3 years ago - 1 dependent repositories - 32 downloads last month - 9,253 stars on GitHub - 1 maintainer
topai-faster-whisper 1.0.4
Faster Whisper transcription with CTranslate25 versions - Latest release: 6 months ago - 163 downloads last month - 15,431 stars on GitHub - 1 maintainer
unhallucinated-faster-whisper 1.0.3
Faster Whisper transcription with CTranslate25 versions - Latest release: 3 months ago - 240 downloads last month - 15,431 stars on GitHub - 1 maintainer
airunner 4.1.3 💰
Run local opensource AI models (Stable Diffusion, LLMs, TTS, STT, chatbots) in a lightweight Pyth...167 versions - Latest release: 1 day ago - 11.1 thousand downloads last month - 237 stars on GitHub - 1 maintainer
pvoctopus 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.15 versions - Latest release: 3 days ago - 1 dependent package - 1 dependent repositories - 387 downloads last month - 36 stars on GitHub - 1 maintainer
funcodec 0.2.0
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec1 version - Latest release: over 1 year ago - 1 dependent package - 298 downloads last month - 396 stars on GitHub - 1 maintainer
openlrc 1.6.0
Transcribe (whisper) and translate (gpt) voice into LRC file.31 versions - Latest release: 4 months ago - 1.09 thousand downloads last month - 407 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
22 versions - Latest release: 3 months ago - 43 dependent packages - 1,125 dependent repositories - 826 thousand downloads last month - 549 stars on GitHub - 1 maintainer
jiwer 3.1.0
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)22 versions - Latest release: 3 months ago - 43 dependent packages - 1,125 dependent repositories - 826 thousand downloads last month - 549 stars on GitHub - 1 maintainer
fastrtc 0.0.31
The realtime communication library for Python74 versions - Latest release: 2 months ago - 18.5 thousand downloads last month - 3,432 stars on GitHub - 1 maintainer
live-translation 0.5.0
A real-time translation tool using Whisper & Opus-MT7 versions - Latest release: 2 days ago - 673 downloads last month - 3 stars on GitHub - 1 maintainer
whisperx-numpy2-compatibility 0.1.1 💰
A compatibility fix to allow whisperx to work with other packages that require numpy>2. Should no...2 versions - Latest release: 5 months ago - 200 downloads last month - 14,976 stars on GitHub - 1 maintainer
slikts-whisperx 3.3.1 💰
Time-Accurate Automatic Speech Recognition using Whisper.1 version - Latest release: 3 months ago - 53 downloads last month - 14,976 stars on GitHub - 1 maintainer
whisperx-karaoke 0.1.1 💰
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)2 versions - Latest release: 12 months ago - 102 downloads last month - 14,976 stars on GitHub - 1 maintainer
whisperx 3.3.2 💰
Time-Accurate Automatic Speech Recognition using Whisper.10 versions - Latest release: 9 days ago - 3 dependent packages - 118 thousand downloads last month - 14,976 stars on GitHub - 2 maintainers
mediacatch-s2t 2.0.1
Upload a media file and get the transcription link.17 versions - Latest release: over 1 year ago - 273 downloads last month - 0 stars on GitHub - 1 maintainer
srtvoiceext 0.1.2
A command line interface to combine text information from subtitles with voice data in the video.6 versions - Latest release: over 7 years ago - 1 dependent repositories - 133 downloads last month - 19 stars on GitHub - 1 maintainer
pvoctopusdemo 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.15 versions - Latest release: 3 days ago - 1 dependent repositories - 387 downloads last month - 36 stars on GitHub - 1 maintainer
nonocaptcha 2.0.1
An asynchronized Python library to automate solving ReCAPTCHA v2 by audio84 versions - Latest release: about 6 years ago - 1 dependent repositories - 3.22 thousand downloads last month - 895 stars on GitHub - 1 maintainer
phonexia-transcription-normalization-client 1.0.1
Transcription Normalization Client1 version - Latest release: 3 days ago - 1 maintainer
transcription-normalization-client 1.0.1
Transcription Normalization Client1 version - Latest release: 3 days ago - 1 maintainer
Top 4.6% on pypi.org
41 versions - Latest release: over 4 years ago - 2 dependent packages - 4 dependent repositories - 4.82 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
deepspeech-tflite 0.9.3
A library for running inference on a DeepSpeech model41 versions - Latest release: over 4 years ago - 2 dependent packages - 4 dependent repositories - 4.82 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
98 versions - Latest release: over 4 years ago - 12 dependent repositories - 7.27 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
deepspeech-gpu 0.9.3
A library for running inference on a DeepSpeech model98 versions - Latest release: over 4 years ago - 12 dependent repositories - 7.27 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
mlx-audio 0.0.4 💰
MLX-Audio is a package for inference of text-to-speech (TTS) and speech-to-speech (STS) models lo...4 versions - Latest release: 8 days ago - 2.59 thousand downloads last month - 474 stars on GitHub - 1 maintainer
commoncorrections 1.0.12
A small python implementation of common ASR corrections13 versions - Latest release: almost 3 years ago - 1 dependent repositories - 440 downloads last month - 3 stars on GitHub - 1 maintainer
esperanto 1.2.1
A unified interface for various AI model providers43 versions - Latest release: 7 days ago - 1.63 thousand downloads last month - 5 stars on GitHub - 1 maintainer
faster-whisper-hotkey 0.1.4
Push-to-talk transcription using faster-whisper5 versions - Latest release: 4 days ago - 419 downloads last month - 3 stars on GitHub - 1 maintainer
verbatim 1.1.0
high quality multi-lingual speech to text10 versions - Latest release: 2 months ago - 383 downloads last month - 18 stars on GitHub - 1 maintainer
pafts 1.0.1
Library That Preprocessing Audio For TTS.4 versions - Latest release: 5 months ago - 211 downloads last month - 8 stars on GitHub - 1 maintainer
pywer 0.1.1
A simple Python package to calculate word error rate (WER).2 versions - Latest release: about 4 years ago - 1 dependent repositories - 139 downloads last month - 5 stars on GitHub - 1 maintainer
livekit-plugins-gladia 1.0.12
Agent Framework plugin for services using Gladia's API.10 versions - Latest release: 4 days ago - 799 downloads last month - 5,538 stars on GitHub - 1 maintainer
mseep-elevenlabs-mcp 0.2.1
ElevenLabs MCP Server1 version - Latest release: 4 days ago - 1 maintainer
Top 8.9% on pypi.org
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 440 downloads last month - 180 stars on GitHub - 1 maintainer
goodbyecaptcha 2.4.2
An asynchronized Python library to automate solving ReCAPTCHA v2 by images/audio8 versions - Latest release: over 4 years ago - 1 dependent repositories - 440 downloads last month - 180 stars on GitHub - 1 maintainer
ms-funcodec 0.2.0
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec1 version - Latest release: about 1 month ago - 1.25 thousand downloads last month - 392 stars on GitHub - 1 maintainer
rev-reverb 0.1.0
A simplified python packge to interact with the reverb models1 version - Latest release: 5 months ago - 67 downloads last month - 389 stars on GitHub - 1 maintainer
melissa 1000.0.34
A lovely virtual assistant for OS X, Windows and Linux systems.2 versions - Latest release: over 7 years ago - 1 dependent repositories - 81 downloads last month - 490 stars on GitHub - 1 maintainer
iago 0.2.8
The package contains your python assistant for Speech Recognition and Text to Speech10 versions - Latest release: over 4 years ago - 1 dependent repositories - 402 downloads last month - 2 stars on GitHub - 1 maintainer
auroraapi 0.2.0
Python SDK for Aurora16 versions - Latest release: almost 7 years ago - 2 dependent repositories - 446 downloads last month - 4 stars on GitHub - 1 maintainer
playwright-recaptcha 0.5.1
A library for solving reCAPTCHA v2 and v3 with Playwright22 versions - Latest release: 10 months ago - 1 dependent repositories - 21.9 thousand downloads last month - 270 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
44 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 17.3 thousand downloads last month - 60 stars on GitHub - 1 maintainer
speechmatics-python 3.0.3
Python library and CLI for Speechmatics44 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 17.3 thousand downloads last month - 60 stars on GitHub - 1 maintainer
tatt 0.981
Tatt creates a uniform API for multiple speech-to-text (STT) services.55 versions - Latest release: almost 6 years ago - 1 dependent repositories - 835 downloads last month - 11 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
20 versions - Latest release: about 3 years ago - 2 dependent repositories - 313 downloads last month - 1,728 stars on GitHub - 1 maintainer
kalliope 0.7.2
Kalliope is a modular always-on voice controlled personal assistant designed for home automation.20 versions - Latest release: about 3 years ago - 2 dependent repositories - 313 downloads last month - 1,728 stars on GitHub - 1 maintainer
deepgram-unstable-sdk 3.8.0.dev4
The official Python SDK for the Deepgram automated speech recognition platform.22 versions - Latest release: 5 months ago - 600 downloads last month - 286 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
6 versions - Latest release: over 3 years ago - 2 dependent repositories - 984 downloads last month - 2,408 stars on GitHub - 1 maintainer
stt-tflite 0.10.0a10
A library for doing speech recognition using a Coqui STT model6 versions - Latest release: over 3 years ago - 2 dependent repositories - 984 downloads last month - 2,408 stars on GitHub - 1 maintainer
coqui-stt-training 1.4.0
Training code for Coqui STT33 versions - Latest release: over 2 years ago - 1 dependent repositories - 638 downloads last month - 2,408 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
35 versions - Latest release: over 2 years ago - 20 dependent repositories - 5.69 thousand downloads last month - 2,408 stars on GitHub - 2 maintainers
stt 1.4.0
A library for doing speech recognition using a Coqui STT model35 versions - Latest release: over 2 years ago - 20 dependent repositories - 5.69 thousand downloads last month - 2,408 stars on GitHub - 2 maintainers
iarahealth-stt-training 1.5.2
Training code for Coqui STT2 versions - Latest release: over 1 year ago - 89 downloads last month - 2,408 stars on GitHub - 1 maintainer
stt-gpu 0.10.0a4
A library for doing speech recognition using a Coqui STT model1 version - Latest release: about 4 years ago - 1 dependent repositories - 29 downloads last month - 2,401 stars on GitHub - 1 maintainer
Top 8.4% on pypi.org
16 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 1.26 thousand downloads last month - 619 stars on GitHub - 1 maintainer
pvcheetah 2.1.3
Cheetah Speech-to-Text Engine.16 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 1.26 thousand downloads last month - 619 stars on GitHub - 1 maintainer
pvcheetahdemo 2.1.3
Cheetah speech-to-text engine demos20 versions - Latest release: about 1 month ago - 1 dependent repositories - 660 downloads last month - 619 stars on GitHub - 1 maintainer
watson-streaming 0.0.13
Speech to text transcription in real-time using IBM Watson13 versions - Latest release: almost 5 years ago - 2 dependent repositories - 533 downloads last month - 3 stars on GitHub - 1 maintainer
pvleoparddemo 2.0.5
Leopard speech-to-text engine demos24 versions - Latest release: 2 months ago - 1 dependent repositories - 608 downloads last month - 422 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
25 versions - Latest release: 2 months ago - 1 dependent package - 4 dependent repositories - 2.33 thousand downloads last month - 422 stars on GitHub - 1 maintainer
pvleopard 2.0.5
Leopard Speech-to-Text Engine.25 versions - Latest release: 2 months ago - 1 dependent package - 4 dependent repositories - 2.33 thousand downloads last month - 422 stars on GitHub - 1 maintainer
werpy 3.0.2
A powerful yet lightweight Python package to calculate and analyze the Word Error Rate (WER).16 versions - Latest release: 16 days ago - 1 dependent repositories - 5.74 thousand downloads last month - 12 stars on GitHub - 1 maintainer
usttc 0.0.8
Unified Speech-to-text Client8 versions - Latest release: about 2 years ago - 1 dependent repositories - 181 downloads last month - 4 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
39 versions - Latest release: 12 months ago - 3 dependent repositories - 3.16 thousand downloads last month - 235 stars on GitHub - 1 maintainer
mltu 1.2.5
Machine Learning Training Utilities (MLTU) for TensorFlow and PyTorch39 versions - Latest release: 12 months ago - 3 dependent repositories - 3.16 thousand downloads last month - 235 stars on GitHub - 1 maintainer
anaouder 1.0.3
Breton language speech-to-text tools22 versions - Latest release: about 1 month ago - 1.11 thousand downloads last month - 10 stars on GitHub - 1 maintainer
speech_tool 0.0.8
An easy to use Python package for the Whisper STT model3 versions - Latest release: about 2 months ago - 193 downloads last month - 0 stars on GitHub - 1 maintainer
drdictaphone-neovim-plugin 0.9.1
DrDictaphone plugin for Neovim3 versions - Latest release: over 1 year ago - 75 downloads last month - 6 stars on GitHub - 1 maintainer
tafrigh 1.7.6
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.33 versions - Latest release: 17 days ago - 1.84 thousand downloads last month - 125 stars on GitHub - 1 maintainer
armspeech 0.1.4 💰
ArmSpeech is an offline Armenian speech recognition library (speech-to-text) and CLI tool based o...3 versions - Latest release: almost 2 years ago - 36 downloads last month - 4 stars on GitHub - 1 maintainer
whisper-s2t 1.3.1
An Optimized Speech-to-Text Pipeline for the Whisper Model.5 versions - Latest release: about 1 year ago - 2 dependent packages - 3.61 thousand downloads last month - 382 stars on GitHub - 1 maintainer
acapela-downloader-py 0.1.7
Acapela pwned but in Python.15 versions - Latest release: almost 2 years ago - 829 downloads last month - 102 stars on GitHub - 1 maintainer
whisper-turbo-mlx 0.0.1
Whisper Turbo in MLX17 versions - Latest release: 6 months ago - 527 downloads last month - 202 stars on GitHub - 1 maintainer
pronunciation-dictionary 0.0.6
Library to save and load pronunciation dictionaries (language-independent).5 versions - Latest release: about 1 year ago - 8 dependent packages - 6 dependent repositories - 249 downloads last month - 3 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
8 versions - Latest release: over 2 years ago - 7 dependent repositories - 1.08 thousand downloads last month - 447 stars on GitHub - 1 maintainer
huggingsound 0.1.6 💰
HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools.8 versions - Latest release: over 2 years ago - 7 dependent repositories - 1.08 thousand downloads last month - 447 stars on GitHub - 1 maintainer
speechbrain-geoph9 0.5.12a0
All-in-one speech toolkit in pure Python and Pytorch1 version - Latest release: over 2 years ago - 1 dependent repositories - 43 downloads last month - 8,811 stars on GitHub - 1 maintainer
whisperer-ml 0.1.7
Go from raw audio to a text-audio dataset with OpenAI's Whisper7 versions - Latest release: about 2 years ago - 1 dependent repositories - 266 downloads last month - 8,811 stars on GitHub - 1 maintainer
deepasr 0.1.2
Keras(Tensorflow) implementations of Automatic Speech Recognition12 versions - Latest release: over 3 years ago - 1 dependent repositories - 249 downloads last month - 23 stars on GitHub - 1 maintainer
speechloop 0.0.3
A "keep it simple" collection of many speech recognition engines... Designed to help answer - wha...3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 149 downloads last month - 19 stars on GitHub - 1 maintainer
realtimestt 0.3.101
A fast Voice Activity Detection and Transcription System44 versions - Latest release: 8 days ago - 1 dependent repositories - 22.1 thousand downloads last month - 1,465 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
22 versions - Latest release: over 3 years ago - 11 dependent packages - 58 dependent repositories - 4.47 thousand downloads last month - 714 stars on GitHub - 1 maintainer
adapt-parser 1.0.0
A text-to-intent parsing framework.22 versions - Latest release: over 3 years ago - 11 dependent packages - 58 dependent repositories - 4.47 thousand downloads last month - 714 stars on GitHub - 1 maintainer
quail 0.2.2
A python toolbox for analyzing and plotting free recall data8 versions - Latest release: over 1 year ago - 1 dependent repositories - 254 downloads last month - 22 stars on GitHub - 1 maintainer
pellipop 0.9.2
A graphical and command-line tool to extract key frames from videos along with their retranscript...62 versions - Latest release: about 1 month ago - 1 dependent repositories - 1.56 thousand downloads last month - 2 stars on GitHub - 1 maintainer
whispercpp-kit 0.1.5
A toolkit for whisper.cpp with audio processing and model management6 versions - Latest release: 3 months ago - 594 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
52 versions - Latest release: 4 months ago - 1 dependent repositories - 18.9 thousand downloads last month - 903 stars on GitHub - 1 maintainer
whisper-ctranslate2 0.5.2
Whisper command line client that uses CTranslate2 and faster-whisper52 versions - Latest release: 4 months ago - 1 dependent repositories - 18.9 thousand downloads last month - 903 stars on GitHub - 1 maintainer
geniusrise-audio 0.1.12
audio bolts for geniusrise13 versions - Latest release: about 1 year ago - 387 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
4 versions - Latest release: almost 3 years ago - 4 dependent packages - 3 dependent repositories - 9.55 thousand downloads last month - 4,944 stars on GitHub - 1 maintainer
silero 0.4.1 💰
Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks.4 versions - Latest release: almost 3 years ago - 4 dependent packages - 3 dependent repositories - 9.55 thousand downloads last month - 4,944 stars on GitHub - 1 maintainer
voicesynth 0.2.3 💰
Package for realistic voice synthesis24 versions - Latest release: 6 months ago - 396 downloads last month - 4,944 stars on GitHub - 1 maintainer
speech-dataset-generator 1.0.0
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset type...1 version - Latest release: about 1 year ago - 45 downloads last month - 135 stars on GitHub - 1 maintainer
textops-api-v1 0.1.2
Python client for TextOps transcription API3 versions - Latest release: 9 days ago - 1 maintainer
scraibe-webui 0.2.1
A web interface for the ScAIbe speech-to-text transcription tool5 versions - Latest release: 5 months ago - 180 downloads last month - 37 stars on GitHub - 1 maintainer
speak-now 0.1.3
A locally-hosted, low-latency speech-to-text solution with LLM integration.4 versions - Latest release: about 2 months ago - 234 downloads last month - 6,548 stars on GitHub - 1 maintainer
Related Keywords
speech-recognition
123
python
55
asr
53
whisper
39
deep-learning
34
stt
34
speech
32
transcription
28
text-to-speech
26
automatic-speech-recognition
25
voice-recognition
24
machine-learning
22
openai
20
audio
18
tensorflow
17
speech-synthesis
15
pytorch
15
ai
14
nlp
14
tts
12
deepspeech
12
python3
10
transformers
9
transformer
9
offline
9
audio-processing
8
inference
8
neural-networks
8
speech recognition
8
on-device
8
voice
8
speech-recognizer
7
embedded
7
speech-recognition-api
7
speech-processing
7
text
6
linux
6
realtime
6
word-error-rate
6
kaldi
6
vosk
6
Voice Recognition
6
huggingface
6
ASR
6
Speech Recognition
6
speechrecognition
6
voice-commands
5
wer
5
speaker-diarization
5
language-model
5
whisper-ai
5
speaker-verification
5
quantization
5
automatic speech recognition
5
raspberry-pi
5
natural-language-processing
5
real-time
5
chatgpt
5
artificial-intelligence
5
dictation
5
video
5
voice recognition
4
normalization
4
android
4
deepspeech2
4
asyncio
4
stt-benchmark
4
ios
4
speech to text
4
terminal
4
Speech-to-Text
4
onnx
4
translation
4
windows
4
Automatic Speech Recognition
4
open-source
4
llm
4
speech-analysis
4
openai-whisper
4
sdk
4
deep-neural-networks
4
conformer
4
NLP
4
AI
4
openai-api
4
transcribe
4
ctranslate2
4
youtube
4
google
4
recognition
4
chatbot
3
faster-whisper
3
voice-to-text
3
evaluation-metrics
3
cross-platform
3
ocr
3
search
3
multimodal
3
assistant
3
pyannote
3