An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "speech-to-text" keyword

View the packages on the pypi.org package registry that are tagged with the "speech-to-text" keyword.

myproject101 1.0
Speech recognition module for Python, supporting several engines and APIs, online and offline.
1 version - Latest release: about 4 years ago - 1 dependent repositories - 60 downloads last month - 8,691 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
speechbrain 1.0.3
All-in-one speech toolkit in pure Python and Pytorch
17 versions - Latest release: 12 days ago - 32 dependent packages - 102 dependent repositories - 1.32 million downloads last month - 7,821 stars on GitHub - 2 maintainers
voicegain-speech 1.118.0
Voicegain Speech-to-Text Python SDK
158 versions - Latest release: 3 days ago - 1 dependent repositories - 3.55 thousand downloads last month - 3 stars on GitHub - 1 maintainer
piper-whistle 1.6.253
CLI tool to manage piper voices.
26 versions - Latest release: over 1 year ago - 853 downloads last month - 3 stars on gitlab.com - 1 maintainer
asrecognition 0.0.4 💰
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 171 downloads last month - 51 stars on GitHub - 1 maintainer
Top 0.8% on pypi.org
speechrecognition 3.14.2
Library for performing speech recognition, with support for several engines and APIs, online and ...
65 versions - Latest release: 27 days ago - 61 dependent packages - 908 dependent repositories - 1.45 million downloads last month - 8,019 stars on GitHub - 2 maintainers
Top 9.9% on pypi.org
tensorflowasr 2.1.0
Almost State-of-the-art Automatic Speech Recognition using Tensorflow 2
45 versions - Latest release: 10 months ago - 1 dependent repositories - 840 downloads last month - 965 stars on GitHub - 1 maintainer
asrt-sdk 1.2.0
A python sdk for ASRT Speech Recognition Toolkit
5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 187 downloads last month - 51 stars on GitHub - 1 maintainer
webspeechrecognition 0.1.4
A Python library for speech-to-text integration using Selenium WebDriver.
1 version - Latest release: 4 months ago - 102 downloads last month - 1 maintainer
Top 1.6% on pypi.org
faster-whisper 1.1.1
Faster Whisper transcription with CTranslate2
19 versions - Latest release: 4 months ago - 53 dependent packages - 35 dependent repositories - 1.38 million downloads last month - 9,301 stars on GitHub - 2 maintainers
Top 1.3% on pypi.org
deepspeech 0.9.3
A library for running inference on a DeepSpeech model
100 versions - Latest release: over 4 years ago - 6 dependent packages - 240 dependent repositories - 17.2 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
ef-sherpa-onnx 1.9.21.dev1
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen K...
1 version - Latest release: 12 months ago - 33 downloads last month - 5,645 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
sherpa-onnx 1.11.3
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen K...
132 versions - Latest release: 16 days ago - 1 dependent repositories - 58.1 thousand downloads last month - 5,645 stars on GitHub - 2 maintainers
ppasr 3.0.2
Automatic speech recognition toolkit on PaddlePaddle
43 versions - Latest release: 1 day ago - 1 dependent repositories - 894 downloads last month - 852 stars on GitHub - 1 maintainer
masr 3.0.2
Automatic speech recognition toolkit on Pytorch
36 versions - Latest release: 1 day ago - 1 dependent repositories - 552 downloads last month - 588 stars on GitHub - 1 maintainer
vernacular-ai-speech 0.1.2
Vernacular Speech API python client
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 173 downloads last month - 21 stars on GitHub - 1 maintainer
vaani-speech-to-text 0.1.2
Vaani is an open-source, AI-powered speech-to-text desktop app. Vaani (वाणी) refers to "speech" o...
3 versions - Latest release: 2 days ago - 174 downloads last month - 0 stars on GitHub - 1 maintainer
automatic-speech-recognition 1.0.4
Distill the Automatic Speech Recognition (TensorFlow)
3 versions - Latest release: about 5 years ago - 1 dependent repositories - 533 downloads last month - 224 stars on GitHub - 1 maintainer
voskintentvoiceconverthanzi 0.3.32
Offline open source speech recognition API based on Kaldi and Vosk
1 version - Latest release: over 3 years ago - 1 dependent repositories - 32 downloads last month - 9,253 stars on GitHub - 1 maintainer
topai-faster-whisper 1.0.4
Faster Whisper transcription with CTranslate2
5 versions - Latest release: 6 months ago - 163 downloads last month - 15,431 stars on GitHub - 1 maintainer
unhallucinated-faster-whisper 1.0.3
Faster Whisper transcription with CTranslate2
5 versions - Latest release: 3 months ago - 240 downloads last month - 15,431 stars on GitHub - 1 maintainer
airunner 4.1.3 💰
Run local opensource AI models (Stable Diffusion, LLMs, TTS, STT, chatbots) in a lightweight Pyth...
167 versions - Latest release: 1 day ago - 11.1 thousand downloads last month - 237 stars on GitHub - 1 maintainer
pvoctopus 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.
15 versions - Latest release: 3 days ago - 1 dependent package - 1 dependent repositories - 387 downloads last month - 36 stars on GitHub - 1 maintainer
funcodec 0.2.0
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec
1 version - Latest release: over 1 year ago - 1 dependent package - 298 downloads last month - 396 stars on GitHub - 1 maintainer
openlrc 1.6.0
Transcribe (whisper) and translate (gpt) voice into LRC file.
31 versions - Latest release: 4 months ago - 1.09 thousand downloads last month - 407 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
jiwer 3.1.0
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
22 versions - Latest release: 3 months ago - 43 dependent packages - 1,125 dependent repositories - 826 thousand downloads last month - 549 stars on GitHub - 1 maintainer
fastrtc 0.0.31
The realtime communication library for Python
74 versions - Latest release: 2 months ago - 18.5 thousand downloads last month - 3,432 stars on GitHub - 1 maintainer
live-translation 0.5.0
A real-time translation tool using Whisper & Opus-MT
7 versions - Latest release: 2 days ago - 673 downloads last month - 3 stars on GitHub - 1 maintainer
whisperx-numpy2-compatibility 0.1.1 💰
A compatibility fix to allow whisperx to work with other packages that require numpy>2. Should no...
2 versions - Latest release: 5 months ago - 200 downloads last month - 14,976 stars on GitHub - 1 maintainer
slikts-whisperx 3.3.1 💰
Time-Accurate Automatic Speech Recognition using Whisper.
1 version - Latest release: 3 months ago - 53 downloads last month - 14,976 stars on GitHub - 1 maintainer
whisperx-karaoke 0.1.1 💰
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
2 versions - Latest release: 12 months ago - 102 downloads last month - 14,976 stars on GitHub - 1 maintainer
whisperx 3.3.2 💰
Time-Accurate Automatic Speech Recognition using Whisper.
10 versions - Latest release: 9 days ago - 3 dependent packages - 118 thousand downloads last month - 14,976 stars on GitHub - 2 maintainers
mediacatch-s2t 2.0.1
Upload a media file and get the transcription link.
17 versions - Latest release: over 1 year ago - 273 downloads last month - 0 stars on GitHub - 1 maintainer
srtvoiceext 0.1.2
A command line interface to combine text information from subtitles with voice data in the video.
6 versions - Latest release: over 7 years ago - 1 dependent repositories - 133 downloads last month - 19 stars on GitHub - 1 maintainer
pvoctopusdemo 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.
15 versions - Latest release: 3 days ago - 1 dependent repositories - 387 downloads last month - 36 stars on GitHub - 1 maintainer
nonocaptcha 2.0.1
An asynchronized Python library to automate solving ReCAPTCHA v2 by audio
84 versions - Latest release: about 6 years ago - 1 dependent repositories - 3.22 thousand downloads last month - 895 stars on GitHub - 1 maintainer
phonexia-transcription-normalization-client 1.0.1
Transcription Normalization Client
1 version - Latest release: 3 days ago - 1 maintainer
transcription-normalization-client 1.0.1
Transcription Normalization Client
1 version - Latest release: 3 days ago - 1 maintainer
Top 4.6% on pypi.org
deepspeech-tflite 0.9.3
A library for running inference on a DeepSpeech model
41 versions - Latest release: over 4 years ago - 2 dependent packages - 4 dependent repositories - 4.82 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
deepspeech-gpu 0.9.3
A library for running inference on a DeepSpeech model
98 versions - Latest release: over 4 years ago - 12 dependent repositories - 7.27 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
mlx-audio 0.0.4 💰
MLX-Audio is a package for inference of text-to-speech (TTS) and speech-to-speech (STS) models lo...
4 versions - Latest release: 8 days ago - 2.59 thousand downloads last month - 474 stars on GitHub - 1 maintainer
commoncorrections 1.0.12
A small python implementation of common ASR corrections
13 versions - Latest release: almost 3 years ago - 1 dependent repositories - 440 downloads last month - 3 stars on GitHub - 1 maintainer
esperanto 1.2.1
A unified interface for various AI model providers
43 versions - Latest release: 7 days ago - 1.63 thousand downloads last month - 5 stars on GitHub - 1 maintainer
faster-whisper-hotkey 0.1.4
Push-to-talk transcription using faster-whisper
5 versions - Latest release: 4 days ago - 419 downloads last month - 3 stars on GitHub - 1 maintainer
verbatim 1.1.0
high quality multi-lingual speech to text
10 versions - Latest release: 2 months ago - 383 downloads last month - 18 stars on GitHub - 1 maintainer
pafts 1.0.1
Library That Preprocessing Audio For TTS.
4 versions - Latest release: 5 months ago - 211 downloads last month - 8 stars on GitHub - 1 maintainer
pywer 0.1.1
A simple Python package to calculate word error rate (WER).
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 139 downloads last month - 5 stars on GitHub - 1 maintainer
livekit-plugins-gladia 1.0.12
Agent Framework plugin for services using Gladia's API.
10 versions - Latest release: 4 days ago - 799 downloads last month - 5,538 stars on GitHub - 1 maintainer
mseep-elevenlabs-mcp 0.2.1
ElevenLabs MCP Server
1 version - Latest release: 4 days ago - 1 maintainer
Top 8.9% on pypi.org
goodbyecaptcha 2.4.2
An asynchronized Python library to automate solving ReCAPTCHA v2 by images/audio
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 440 downloads last month - 180 stars on GitHub - 1 maintainer
ms-funcodec 0.2.0
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec
1 version - Latest release: about 1 month ago - 1.25 thousand downloads last month - 392 stars on GitHub - 1 maintainer
rev-reverb 0.1.0
A simplified python packge to interact with the reverb models
1 version - Latest release: 5 months ago - 67 downloads last month - 389 stars on GitHub - 1 maintainer
melissa 1000.0.34
A lovely virtual assistant for OS X, Windows and Linux systems.
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 81 downloads last month - 490 stars on GitHub - 1 maintainer
iago 0.2.8
The package contains your python assistant for Speech Recognition and Text to Speech
10 versions - Latest release: over 4 years ago - 1 dependent repositories - 402 downloads last month - 2 stars on GitHub - 1 maintainer
auroraapi 0.2.0
Python SDK for Aurora
16 versions - Latest release: almost 7 years ago - 2 dependent repositories - 446 downloads last month - 4 stars on GitHub - 1 maintainer
playwright-recaptcha 0.5.1
A library for solving reCAPTCHA v2 and v3 with Playwright
22 versions - Latest release: 10 months ago - 1 dependent repositories - 21.9 thousand downloads last month - 270 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
speechmatics-python 3.0.3
Python library and CLI for Speechmatics
44 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 17.3 thousand downloads last month - 60 stars on GitHub - 1 maintainer
tatt 0.981
Tatt creates a uniform API for multiple speech-to-text (STT) services.
55 versions - Latest release: almost 6 years ago - 1 dependent repositories - 835 downloads last month - 11 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
kalliope 0.7.2
Kalliope is a modular always-on voice controlled personal assistant designed for home automation.
20 versions - Latest release: about 3 years ago - 2 dependent repositories - 313 downloads last month - 1,728 stars on GitHub - 1 maintainer
deepgram-unstable-sdk 3.8.0.dev4
The official Python SDK for the Deepgram automated speech recognition platform.
22 versions - Latest release: 5 months ago - 600 downloads last month - 286 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
stt-tflite 0.10.0a10
A library for doing speech recognition using a Coqui STT model
6 versions - Latest release: over 3 years ago - 2 dependent repositories - 984 downloads last month - 2,408 stars on GitHub - 1 maintainer
coqui-stt-training 1.4.0
Training code for Coqui STT
33 versions - Latest release: over 2 years ago - 1 dependent repositories - 638 downloads last month - 2,408 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
stt 1.4.0
A library for doing speech recognition using a Coqui STT model
35 versions - Latest release: over 2 years ago - 20 dependent repositories - 5.69 thousand downloads last month - 2,408 stars on GitHub - 2 maintainers
iarahealth-stt-training 1.5.2
Training code for Coqui STT
2 versions - Latest release: over 1 year ago - 89 downloads last month - 2,408 stars on GitHub - 1 maintainer
stt-gpu 0.10.0a4
A library for doing speech recognition using a Coqui STT model
1 version - Latest release: about 4 years ago - 1 dependent repositories - 29 downloads last month - 2,401 stars on GitHub - 1 maintainer
Top 8.4% on pypi.org
pvcheetah 2.1.3
Cheetah Speech-to-Text Engine.
16 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 1.26 thousand downloads last month - 619 stars on GitHub - 1 maintainer
pvcheetahdemo 2.1.3
Cheetah speech-to-text engine demos
20 versions - Latest release: about 1 month ago - 1 dependent repositories - 660 downloads last month - 619 stars on GitHub - 1 maintainer
watson-streaming 0.0.13
Speech to text transcription in real-time using IBM Watson
13 versions - Latest release: almost 5 years ago - 2 dependent repositories - 533 downloads last month - 3 stars on GitHub - 1 maintainer
pvleoparddemo 2.0.5
Leopard speech-to-text engine demos
24 versions - Latest release: 2 months ago - 1 dependent repositories - 608 downloads last month - 422 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
pvleopard 2.0.5
Leopard Speech-to-Text Engine.
25 versions - Latest release: 2 months ago - 1 dependent package - 4 dependent repositories - 2.33 thousand downloads last month - 422 stars on GitHub - 1 maintainer
werpy 3.0.2
A powerful yet lightweight Python package to calculate and analyze the Word Error Rate (WER).
16 versions - Latest release: 16 days ago - 1 dependent repositories - 5.74 thousand downloads last month - 12 stars on GitHub - 1 maintainer
usttc 0.0.8
Unified Speech-to-text Client
8 versions - Latest release: about 2 years ago - 1 dependent repositories - 181 downloads last month - 4 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
mltu 1.2.5
Machine Learning Training Utilities (MLTU) for TensorFlow and PyTorch
39 versions - Latest release: 12 months ago - 3 dependent repositories - 3.16 thousand downloads last month - 235 stars on GitHub - 1 maintainer
anaouder 1.0.3
Breton language speech-to-text tools
22 versions - Latest release: about 1 month ago - 1.11 thousand downloads last month - 10 stars on GitHub - 1 maintainer
speech_tool 0.0.8
An easy to use Python package for the Whisper STT model
3 versions - Latest release: about 2 months ago - 193 downloads last month - 0 stars on GitHub - 1 maintainer
drdictaphone-neovim-plugin 0.9.1
DrDictaphone plugin for Neovim
3 versions - Latest release: over 1 year ago - 75 downloads last month - 6 stars on GitHub - 1 maintainer
tafrigh 1.7.6
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
33 versions - Latest release: 17 days ago - 1.84 thousand downloads last month - 125 stars on GitHub - 1 maintainer
armspeech 0.1.4 💰
ArmSpeech is an offline Armenian speech recognition library (speech-to-text) and CLI tool based o...
3 versions - Latest release: almost 2 years ago - 36 downloads last month - 4 stars on GitHub - 1 maintainer
whisper-s2t 1.3.1
An Optimized Speech-to-Text Pipeline for the Whisper Model.
5 versions - Latest release: about 1 year ago - 2 dependent packages - 3.61 thousand downloads last month - 382 stars on GitHub - 1 maintainer
acapela-downloader-py 0.1.7
Acapela pwned but in Python.
15 versions - Latest release: almost 2 years ago - 829 downloads last month - 102 stars on GitHub - 1 maintainer
whisper-turbo-mlx 0.0.1
Whisper Turbo in MLX
17 versions - Latest release: 6 months ago - 527 downloads last month - 202 stars on GitHub - 1 maintainer
pronunciation-dictionary 0.0.6
Library to save and load pronunciation dictionaries (language-independent).
5 versions - Latest release: about 1 year ago - 8 dependent packages - 6 dependent repositories - 249 downloads last month - 3 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
huggingsound 0.1.6 💰
HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools.
8 versions - Latest release: over 2 years ago - 7 dependent repositories - 1.08 thousand downloads last month - 447 stars on GitHub - 1 maintainer
speechbrain-geoph9 0.5.12a0
All-in-one speech toolkit in pure Python and Pytorch
1 version - Latest release: over 2 years ago - 1 dependent repositories - 43 downloads last month - 8,811 stars on GitHub - 1 maintainer
whisperer-ml 0.1.7
Go from raw audio to a text-audio dataset with OpenAI's Whisper
7 versions - Latest release: about 2 years ago - 1 dependent repositories - 266 downloads last month - 8,811 stars on GitHub - 1 maintainer
deepasr 0.1.2
Keras(Tensorflow) implementations of Automatic Speech Recognition
12 versions - Latest release: over 3 years ago - 1 dependent repositories - 249 downloads last month - 23 stars on GitHub - 1 maintainer
speechloop 0.0.3
A "keep it simple" collection of many speech recognition engines... Designed to help answer - wha...
3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 149 downloads last month - 19 stars on GitHub - 1 maintainer
realtimestt 0.3.101
A fast Voice Activity Detection and Transcription System
44 versions - Latest release: 8 days ago - 1 dependent repositories - 22.1 thousand downloads last month - 1,465 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
adapt-parser 1.0.0
A text-to-intent parsing framework.
22 versions - Latest release: over 3 years ago - 11 dependent packages - 58 dependent repositories - 4.47 thousand downloads last month - 714 stars on GitHub - 1 maintainer
quail 0.2.2
A python toolbox for analyzing and plotting free recall data
8 versions - Latest release: over 1 year ago - 1 dependent repositories - 254 downloads last month - 22 stars on GitHub - 1 maintainer
pellipop 0.9.2
A graphical and command-line tool to extract key frames from videos along with their retranscript...
62 versions - Latest release: about 1 month ago - 1 dependent repositories - 1.56 thousand downloads last month - 2 stars on GitHub - 1 maintainer
whispercpp-kit 0.1.5
A toolkit for whisper.cpp with audio processing and model management
6 versions - Latest release: 3 months ago - 594 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
whisper-ctranslate2 0.5.2
Whisper command line client that uses CTranslate2 and faster-whisper
52 versions - Latest release: 4 months ago - 1 dependent repositories - 18.9 thousand downloads last month - 903 stars on GitHub - 1 maintainer
geniusrise-audio 0.1.12
audio bolts for geniusrise
13 versions - Latest release: about 1 year ago - 387 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
silero 0.4.1 💰
Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks.
4 versions - Latest release: almost 3 years ago - 4 dependent packages - 3 dependent repositories - 9.55 thousand downloads last month - 4,944 stars on GitHub - 1 maintainer
voicesynth 0.2.3 💰
Package for realistic voice synthesis
24 versions - Latest release: 6 months ago - 396 downloads last month - 4,944 stars on GitHub - 1 maintainer
speech-dataset-generator 1.0.0
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset type...
1 version - Latest release: about 1 year ago - 45 downloads last month - 135 stars on GitHub - 1 maintainer
textops-api-v1 0.1.2
Python client for TextOps transcription API
3 versions - Latest release: 9 days ago - 1 maintainer
scraibe-webui 0.2.1
A web interface for the ScAIbe speech-to-text transcription tool
5 versions - Latest release: 5 months ago - 180 downloads last month - 37 stars on GitHub - 1 maintainer
speak-now 0.1.3
A locally-hosted, low-latency speech-to-text solution with LLM integration.
4 versions - Latest release: about 2 months ago - 234 downloads last month - 6,548 stars on GitHub - 1 maintainer