speech-to-text | pypi.org keywords | Ecosyste.ms: Packages

pypi.org "speech-to-text" keyword

View the packages on the pypi.org package registry that are tagged with the "speech-to-text" keyword.

myproject101 1.0

Speech recognition module for Python, supporting several engines and APIs, online and offline.
1 version - Latest release: about 4 years ago - 1 dependent repositories - 60 downloads last month - 8,691 stars on GitHub - 1 maintainer

Top 1.2% on pypi.org

speechbrain 1.0.3

All-in-one speech toolkit in pure Python and Pytorch
17 versions - Latest release: 12 days ago - 32 dependent packages - 102 dependent repositories - 1.32 million downloads last month - 7,821 stars on GitHub - 2 maintainers

voicegain-speech 1.118.0

Voicegain Speech-to-Text Python SDK
158 versions - Latest release: 3 days ago - 1 dependent repositories - 3.55 thousand downloads last month - 3 stars on GitHub - 1 maintainer

piper-whistle 1.6.253

CLI tool to manage piper voices.
26 versions - Latest release: over 1 year ago - 853 downloads last month - 3 stars on gitlab.com - 1 maintainer

asrecognition 0.0.4 💰

ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 171 downloads last month - 51 stars on GitHub - 1 maintainer

Top 0.8% on pypi.org

speechrecognition 3.14.2

Library for performing speech recognition, with support for several engines and APIs, online and ...
65 versions - Latest release: 27 days ago - 61 dependent packages - 908 dependent repositories - 1.45 million downloads last month - 8,019 stars on GitHub - 2 maintainers

Top 9.9% on pypi.org

tensorflowasr 2.1.0

Almost State-of-the-art Automatic Speech Recognition using Tensorflow 2
45 versions - Latest release: 10 months ago - 1 dependent repositories - 840 downloads last month - 965 stars on GitHub - 1 maintainer

asrt-sdk 1.2.0

A python sdk for ASRT Speech Recognition Toolkit
5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 187 downloads last month - 51 stars on GitHub - 1 maintainer

webspeechrecognition 0.1.4

A Python library for speech-to-text integration using Selenium WebDriver.
1 version - Latest release: 4 months ago - 102 downloads last month - 1 maintainer

Top 1.6% on pypi.org

faster-whisper 1.1.1

Faster Whisper transcription with CTranslate2
19 versions - Latest release: 4 months ago - 53 dependent packages - 35 dependent repositories - 1.38 million downloads last month - 9,301 stars on GitHub - 2 maintainers

Top 1.3% on pypi.org

deepspeech 0.9.3

A library for running inference on a DeepSpeech model
100 versions - Latest release: over 4 years ago - 6 dependent packages - 240 dependent repositories - 17.2 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer

ef-sherpa-onnx 1.9.21.dev1

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen K...
1 version - Latest release: 12 months ago - 33 downloads last month - 5,645 stars on GitHub - 1 maintainer

Top 7.7% on pypi.org

sherpa-onnx 1.11.3

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen K...
132 versions - Latest release: 16 days ago - 1 dependent repositories - 58.1 thousand downloads last month - 5,645 stars on GitHub - 2 maintainers

ppasr 3.0.2

Automatic speech recognition toolkit on PaddlePaddle
43 versions - Latest release: 1 day ago - 1 dependent repositories - 894 downloads last month - 852 stars on GitHub - 1 maintainer

masr 3.0.2

Automatic speech recognition toolkit on Pytorch
36 versions - Latest release: 1 day ago - 1 dependent repositories - 552 downloads last month - 588 stars on GitHub - 1 maintainer

vernacular-ai-speech 0.1.2

Vernacular Speech API python client
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 173 downloads last month - 21 stars on GitHub - 1 maintainer

vaani-speech-to-text 0.1.2

Vaani is an open-source, AI-powered speech-to-text desktop app. Vaani (वाणी) refers to "speech" o...
3 versions - Latest release: 2 days ago - 174 downloads last month - 0 stars on GitHub - 1 maintainer

automatic-speech-recognition 1.0.4

Distill the Automatic Speech Recognition (TensorFlow)
3 versions - Latest release: about 5 years ago - 1 dependent repositories - 533 downloads last month - 224 stars on GitHub - 1 maintainer

voskintentvoiceconverthanzi 0.3.32

Offline open source speech recognition API based on Kaldi and Vosk
1 version - Latest release: over 3 years ago - 1 dependent repositories - 32 downloads last month - 9,253 stars on GitHub - 1 maintainer

topai-faster-whisper 1.0.4

Faster Whisper transcription with CTranslate2
5 versions - Latest release: 6 months ago - 163 downloads last month - 15,431 stars on GitHub - 1 maintainer

unhallucinated-faster-whisper 1.0.3

Faster Whisper transcription with CTranslate2
5 versions - Latest release: 3 months ago - 240 downloads last month - 15,431 stars on GitHub - 1 maintainer

airunner 4.1.3 💰

Run local opensource AI models (Stable Diffusion, LLMs, TTS, STT, chatbots) in a lightweight Pyth...
167 versions - Latest release: 1 day ago - 11.1 thousand downloads last month - 237 stars on GitHub - 1 maintainer

pvoctopus 2.0.2

⚠️ DEPRECATED: This package is no longer maintained.
15 versions - Latest release: 3 days ago - 1 dependent package - 1 dependent repositories - 387 downloads last month - 36 stars on GitHub - 1 maintainer

funcodec 0.2.0

FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec
1 version - Latest release: over 1 year ago - 1 dependent package - 298 downloads last month - 396 stars on GitHub - 1 maintainer

openlrc 1.6.0

Transcribe (whisper) and translate (gpt) voice into LRC file.
31 versions - Latest release: 4 months ago - 1.09 thousand downloads last month - 407 stars on GitHub - 1 maintainer

Top 1.7% on pypi.org

jiwer 3.1.0

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
22 versions - Latest release: 3 months ago - 43 dependent packages - 1,125 dependent repositories - 826 thousand downloads last month - 549 stars on GitHub - 1 maintainer

fastrtc 0.0.31

The realtime communication library for Python
74 versions - Latest release: 2 months ago - 18.5 thousand downloads last month - 3,432 stars on GitHub - 1 maintainer

live-translation 0.5.0

A real-time translation tool using Whisper & Opus-MT
7 versions - Latest release: 2 days ago - 673 downloads last month - 3 stars on GitHub - 1 maintainer

whisperx-numpy2-compatibility 0.1.1 💰

A compatibility fix to allow whisperx to work with other packages that require numpy>2. Should no...
2 versions - Latest release: 5 months ago - 200 downloads last month - 14,976 stars on GitHub - 1 maintainer

slikts-whisperx 3.3.1 💰

Time-Accurate Automatic Speech Recognition using Whisper.
1 version - Latest release: 3 months ago - 53 downloads last month - 14,976 stars on GitHub - 1 maintainer

whisperx-karaoke 0.1.1 💰

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
2 versions - Latest release: 12 months ago - 102 downloads last month - 14,976 stars on GitHub - 1 maintainer

whisperx 3.3.2 💰

Time-Accurate Automatic Speech Recognition using Whisper.
10 versions - Latest release: 9 days ago - 3 dependent packages - 118 thousand downloads last month - 14,976 stars on GitHub - 2 maintainers

mediacatch-s2t 2.0.1

Upload a media file and get the transcription link.
17 versions - Latest release: over 1 year ago - 273 downloads last month - 0 stars on GitHub - 1 maintainer

srtvoiceext 0.1.2

A command line interface to combine text information from subtitles with voice data in the video.
6 versions - Latest release: over 7 years ago - 1 dependent repositories - 133 downloads last month - 19 stars on GitHub - 1 maintainer

pvoctopusdemo 2.0.2

⚠️ DEPRECATED: This package is no longer maintained.
15 versions - Latest release: 3 days ago - 1 dependent repositories - 387 downloads last month - 36 stars on GitHub - 1 maintainer

nonocaptcha 2.0.1

An asynchronized Python library to automate solving ReCAPTCHA v2 by audio
84 versions - Latest release: about 6 years ago - 1 dependent repositories - 3.22 thousand downloads last month - 895 stars on GitHub - 1 maintainer

phonexia-transcription-normalization-client 1.0.1

Transcription Normalization Client
1 version - Latest release: 3 days ago - 1 maintainer

transcription-normalization-client 1.0.1

Transcription Normalization Client
1 version - Latest release: 3 days ago - 1 maintainer

Top 4.6% on pypi.org

deepspeech-tflite 0.9.3

A library for running inference on a DeepSpeech model
41 versions - Latest release: over 4 years ago - 2 dependent packages - 4 dependent repositories - 4.82 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer

Top 3.3% on pypi.org

deepspeech-gpu 0.9.3

A library for running inference on a DeepSpeech model
98 versions - Latest release: over 4 years ago - 12 dependent repositories - 7.27 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer

mlx-audio 0.0.4 💰

MLX-Audio is a package for inference of text-to-speech (TTS) and speech-to-speech (STS) models lo...
4 versions - Latest release: 8 days ago - 2.59 thousand downloads last month - 474 stars on GitHub - 1 maintainer

commoncorrections 1.0.12

A small python implementation of common ASR corrections
13 versions - Latest release: almost 3 years ago - 1 dependent repositories - 440 downloads last month - 3 stars on GitHub - 1 maintainer

esperanto 1.2.1

A unified interface for various AI model providers
43 versions - Latest release: 7 days ago - 1.63 thousand downloads last month - 5 stars on GitHub - 1 maintainer

faster-whisper-hotkey 0.1.4

Push-to-talk transcription using faster-whisper
5 versions - Latest release: 4 days ago - 419 downloads last month - 3 stars on GitHub - 1 maintainer

verbatim 1.1.0

high quality multi-lingual speech to text
10 versions - Latest release: 2 months ago - 383 downloads last month - 18 stars on GitHub - 1 maintainer

pafts 1.0.1

Library That Preprocessing Audio For TTS.
4 versions - Latest release: 5 months ago - 211 downloads last month - 8 stars on GitHub - 1 maintainer

pywer 0.1.1

A simple Python package to calculate word error rate (WER).
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 139 downloads last month - 5 stars on GitHub - 1 maintainer

livekit-plugins-gladia 1.0.12

Agent Framework plugin for services using Gladia's API.
10 versions - Latest release: 4 days ago - 799 downloads last month - 5,538 stars on GitHub - 1 maintainer

mseep-elevenlabs-mcp 0.2.1

ElevenLabs MCP Server
1 version - Latest release: 4 days ago - 1 maintainer

Top 8.9% on pypi.org

goodbyecaptcha 2.4.2

An asynchronized Python library to automate solving ReCAPTCHA v2 by images/audio
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 440 downloads last month - 180 stars on GitHub - 1 maintainer

ms-funcodec 0.2.0

FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec
1 version - Latest release: about 1 month ago - 1.25 thousand downloads last month - 392 stars on GitHub - 1 maintainer

rev-reverb 0.1.0

A simplified python packge to interact with the reverb models
1 version - Latest release: 5 months ago - 67 downloads last month - 389 stars on GitHub - 1 maintainer

melissa 1000.0.34

A lovely virtual assistant for OS X, Windows and Linux systems.
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 81 downloads last month - 490 stars on GitHub - 1 maintainer

iago 0.2.8

The package contains your python assistant for Speech Recognition and Text to Speech
10 versions - Latest release: over 4 years ago - 1 dependent repositories - 402 downloads last month - 2 stars on GitHub - 1 maintainer

auroraapi 0.2.0

Python SDK for Aurora
16 versions - Latest release: almost 7 years ago - 2 dependent repositories - 446 downloads last month - 4 stars on GitHub - 1 maintainer

playwright-recaptcha 0.5.1

A library for solving reCAPTCHA v2 and v3 with Playwright
22 versions - Latest release: 10 months ago - 1 dependent repositories - 21.9 thousand downloads last month - 270 stars on GitHub - 1 maintainer

Top 9.1% on pypi.org

speechmatics-python 3.0.3

Python library and CLI for Speechmatics
44 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 17.3 thousand downloads last month - 60 stars on GitHub - 1 maintainer

tatt 0.981

Tatt creates a uniform API for multiple speech-to-text (STT) services.
55 versions - Latest release: almost 6 years ago - 1 dependent repositories - 835 downloads last month - 11 stars on GitHub - 1 maintainer

Top 8.8% on pypi.org

kalliope 0.7.2

Kalliope is a modular always-on voice controlled personal assistant designed for home automation.
20 versions - Latest release: about 3 years ago - 2 dependent repositories - 313 downloads last month - 1,728 stars on GitHub - 1 maintainer

deepgram-unstable-sdk 3.8.0.dev4

The official Python SDK for the Deepgram automated speech recognition platform.
22 versions - Latest release: 5 months ago - 600 downloads last month - 286 stars on GitHub - 1 maintainer

Top 8.7% on pypi.org

stt-tflite 0.10.0a10

A library for doing speech recognition using a Coqui STT model
6 versions - Latest release: over 3 years ago - 2 dependent repositories - 984 downloads last month - 2,408 stars on GitHub - 1 maintainer

coqui-stt-training 1.4.0

Training code for Coqui STT
33 versions - Latest release: over 2 years ago - 1 dependent repositories - 638 downloads last month - 2,408 stars on GitHub - 1 maintainer

Top 4.8% on pypi.org

stt 1.4.0

A library for doing speech recognition using a Coqui STT model
35 versions - Latest release: over 2 years ago - 20 dependent repositories - 5.69 thousand downloads last month - 2,408 stars on GitHub - 2 maintainers

iarahealth-stt-training 1.5.2

Training code for Coqui STT
2 versions - Latest release: over 1 year ago - 89 downloads last month - 2,408 stars on GitHub - 1 maintainer

stt-gpu 0.10.0a4

A library for doing speech recognition using a Coqui STT model
1 version - Latest release: about 4 years ago - 1 dependent repositories - 29 downloads last month - 2,401 stars on GitHub - 1 maintainer

Top 8.4% on pypi.org

pvcheetah 2.1.3

Cheetah Speech-to-Text Engine.
16 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 1.26 thousand downloads last month - 619 stars on GitHub - 1 maintainer

pvcheetahdemo 2.1.3

Cheetah speech-to-text engine demos
20 versions - Latest release: about 1 month ago - 1 dependent repositories - 660 downloads last month - 619 stars on GitHub - 1 maintainer

watson-streaming 0.0.13

Speech to text transcription in real-time using IBM Watson
13 versions - Latest release: almost 5 years ago - 2 dependent repositories - 533 downloads last month - 3 stars on GitHub - 1 maintainer

pvleoparddemo 2.0.5

Leopard speech-to-text engine demos
24 versions - Latest release: 2 months ago - 1 dependent repositories - 608 downloads last month - 422 stars on GitHub - 1 maintainer

Top 6.0% on pypi.org

pvleopard 2.0.5

Leopard Speech-to-Text Engine.
25 versions - Latest release: 2 months ago - 1 dependent package - 4 dependent repositories - 2.33 thousand downloads last month - 422 stars on GitHub - 1 maintainer

werpy 3.0.2

A powerful yet lightweight Python package to calculate and analyze the Word Error Rate (WER).
16 versions - Latest release: 16 days ago - 1 dependent repositories - 5.74 thousand downloads last month - 12 stars on GitHub - 1 maintainer

usttc 0.0.8

Unified Speech-to-text Client
8 versions - Latest release: about 2 years ago - 1 dependent repositories - 181 downloads last month - 4 stars on GitHub - 1 maintainer

Top 6.6% on pypi.org

mltu 1.2.5

Machine Learning Training Utilities (MLTU) for TensorFlow and PyTorch
39 versions - Latest release: 12 months ago - 3 dependent repositories - 3.16 thousand downloads last month - 235 stars on GitHub - 1 maintainer

anaouder 1.0.3

Breton language speech-to-text tools
22 versions - Latest release: about 1 month ago - 1.11 thousand downloads last month - 10 stars on GitHub - 1 maintainer

speech_tool 0.0.8

An easy to use Python package for the Whisper STT model
3 versions - Latest release: about 2 months ago - 193 downloads last month - 0 stars on GitHub - 1 maintainer

drdictaphone-neovim-plugin 0.9.1

DrDictaphone plugin for Neovim
3 versions - Latest release: over 1 year ago - 75 downloads last month - 6 stars on GitHub - 1 maintainer

tafrigh 1.7.6

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
33 versions - Latest release: 17 days ago - 1.84 thousand downloads last month - 125 stars on GitHub - 1 maintainer

armspeech 0.1.4 💰

ArmSpeech is an offline Armenian speech recognition library (speech-to-text) and CLI tool based o...
3 versions - Latest release: almost 2 years ago - 36 downloads last month - 4 stars on GitHub - 1 maintainer

whisper-s2t 1.3.1

An Optimized Speech-to-Text Pipeline for the Whisper Model.
5 versions - Latest release: about 1 year ago - 2 dependent packages - 3.61 thousand downloads last month - 382 stars on GitHub - 1 maintainer

acapela-downloader-py 0.1.7

Acapela pwned but in Python.
15 versions - Latest release: almost 2 years ago - 829 downloads last month - 102 stars on GitHub - 1 maintainer

whisper-turbo-mlx 0.0.1

Whisper Turbo in MLX
17 versions - Latest release: 6 months ago - 527 downloads last month - 202 stars on GitHub - 1 maintainer

pronunciation-dictionary 0.0.6

Library to save and load pronunciation dictionaries (language-independent).
5 versions - Latest release: about 1 year ago - 8 dependent packages - 6 dependent repositories - 249 downloads last month - 3 stars on GitHub - 1 maintainer

Top 6.0% on pypi.org

huggingsound 0.1.6 💰

HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools.
8 versions - Latest release: over 2 years ago - 7 dependent repositories - 1.08 thousand downloads last month - 447 stars on GitHub - 1 maintainer

speechbrain-geoph9 0.5.12a0

All-in-one speech toolkit in pure Python and Pytorch
1 version - Latest release: over 2 years ago - 1 dependent repositories - 43 downloads last month - 8,811 stars on GitHub - 1 maintainer

whisperer-ml 0.1.7

Go from raw audio to a text-audio dataset with OpenAI's Whisper
7 versions - Latest release: about 2 years ago - 1 dependent repositories - 266 downloads last month - 8,811 stars on GitHub - 1 maintainer

deepasr 0.1.2

Keras(Tensorflow) implementations of Automatic Speech Recognition
12 versions - Latest release: over 3 years ago - 1 dependent repositories - 249 downloads last month - 23 stars on GitHub - 1 maintainer

speechloop 0.0.3

A "keep it simple" collection of many speech recognition engines... Designed to help answer - wha...
3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 149 downloads last month - 19 stars on GitHub - 1 maintainer

realtimestt 0.3.101

A fast Voice Activity Detection and Transcription System
44 versions - Latest release: 8 days ago - 1 dependent repositories - 22.1 thousand downloads last month - 1,465 stars on GitHub - 1 maintainer

Top 2.8% on pypi.org

adapt-parser 1.0.0

A text-to-intent parsing framework.
22 versions - Latest release: over 3 years ago - 11 dependent packages - 58 dependent repositories - 4.47 thousand downloads last month - 714 stars on GitHub - 1 maintainer

quail 0.2.2

A python toolbox for analyzing and plotting free recall data
8 versions - Latest release: over 1 year ago - 1 dependent repositories - 254 downloads last month - 22 stars on GitHub - 1 maintainer

pellipop 0.9.2

A graphical and command-line tool to extract key frames from videos along with their retranscript...
62 versions - Latest release: about 1 month ago - 1 dependent repositories - 1.56 thousand downloads last month - 2 stars on GitHub - 1 maintainer

whispercpp-kit 0.1.5

A toolkit for whisper.cpp with audio processing and model management
6 versions - Latest release: 3 months ago - 594 downloads last month - 1 stars on GitHub - 1 maintainer

Top 9.0% on pypi.org

whisper-ctranslate2 0.5.2

Whisper command line client that uses CTranslate2 and faster-whisper
52 versions - Latest release: 4 months ago - 1 dependent repositories - 18.9 thousand downloads last month - 903 stars on GitHub - 1 maintainer

geniusrise-audio 0.1.12

audio bolts for geniusrise
13 versions - Latest release: about 1 year ago - 387 downloads last month - 2 stars on GitHub - 1 maintainer

Top 3.9% on pypi.org

silero 0.4.1 💰

Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks.
4 versions - Latest release: almost 3 years ago - 4 dependent packages - 3 dependent repositories - 9.55 thousand downloads last month - 4,944 stars on GitHub - 1 maintainer

voicesynth 0.2.3 💰

Package for realistic voice synthesis
24 versions - Latest release: 6 months ago - 396 downloads last month - 4,944 stars on GitHub - 1 maintainer

speech-dataset-generator 1.0.0

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset type...
1 version - Latest release: about 1 year ago - 45 downloads last month - 135 stars on GitHub - 1 maintainer

textops-api-v1 0.1.2

Python client for TextOps transcription API
3 versions - Latest release: 9 days ago - 1 maintainer

scraibe-webui 0.2.1

A web interface for the ScAIbe speech-to-text transcription tool
5 versions - Latest release: 5 months ago - 180 downloads last month - 37 stars on GitHub - 1 maintainer

speak-now 0.1.3

A locally-hosted, low-latency speech-to-text solution with LLM integration.
4 versions - Latest release: about 2 months ago - 234 downloads last month - 6,548 stars on GitHub - 1 maintainer

Related Keywords

speech-recognition 123 python 55 asr 53 whisper 39 deep-learning 34 stt 34 speech 32 transcription 28 text-to-speech 26 automatic-speech-recognition 25 voice-recognition 24 machine-learning 22 openai 20 audio 18 tensorflow 17 speech-synthesis 15 pytorch 15 ai 14 nlp 14 tts 12 deepspeech 12 python3 10 transformers 9 transformer 9 offline 9 audio-processing 8 inference 8 neural-networks 8 speech recognition 8 on-device 8 voice 8 speech-recognizer 7 embedded 7 speech-recognition-api 7 speech-processing 7 text 6 linux 6 realtime 6 word-error-rate 6 kaldi 6 vosk 6 Voice Recognition 6 huggingface 6 ASR 6 Speech Recognition 6 speechrecognition 6 voice-commands 5 wer 5 speaker-diarization 5 language-model 5 whisper-ai 5 speaker-verification 5 quantization 5 automatic speech recognition 5 raspberry-pi 5 natural-language-processing 5 real-time 5 chatgpt 5 artificial-intelligence 5 dictation 5 video 5 voice recognition 4 normalization 4 android 4 deepspeech2 4 asyncio 4 stt-benchmark 4 ios 4 speech to text 4 terminal 4 Speech-to-Text 4 onnx 4 translation 4 windows 4 Automatic Speech Recognition 4 open-source 4 llm 4 speech-analysis 4 openai-whisper 4 sdk 4 deep-neural-networks 4 conformer 4 NLP 4 AI 4 openai-api 4 transcribe 4 ctranslate2 4 youtube 4 google 4 recognition 4 chatbot 3 faster-whisper 3 voice-to-text 3 evaluation-metrics 3 cross-platform 3 ocr 3 search 3 multimodal 3 assistant 3 pyannote 3