An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "speech-recognition" keyword

View the packages on the pypi.org package registry that are tagged with the "speech-recognition" keyword.

chunkformer 1.2.2
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
5 versions - Latest release: about 4 hours ago - 532 downloads last month - 67 stars on GitHub - 1 maintainer
Top 1.3% on pypi.org
deepspeech 0.9.3
A library for running inference on a DeepSpeech model
100 versions - Latest release: almost 5 years ago - 6 dependent packages - 240 dependent repositories - 4.4 thousand downloads last month - 24,174 stars on GitHub - 1 maintainer
transcribe-align-textgrid 0.2.4
Create for-aligned transcription TextGrids from Audio
5 versions - Latest release: 8 months ago - 21 downloads last month - 18 stars on GitHub - 1 maintainer
Top 0.1% on pypi.org
transformers 4.57.1
State-of-the-art Machine Learning for JAX, PyTorch and TensorFlow
200 versions - Latest release: about 1 month ago - 2,589 dependent packages - 31,800 dependent repositories - 107 million downloads last month - 134,132 stars on GitHub - 4 maintainers
fms-acceleration-peft 0.4.2
FMS Acceleration for PeFT
15 versions - Latest release: 5 months ago - 528 downloads last month - 144,208 stars on GitHub - 1 maintainer
Top 2.0% on pypi.org
pvporcupine 3.0.5
Porcupine wake word engine.
37 versions - Latest release: 9 months ago - 14 dependent packages - 52 dependent repositories - 28.7 thousand downloads last month - 4,475 stars on GitHub - 1 maintainer
Top 0.8% on pypi.org
speechrecognition 3.14.3
Library for performing speech recognition, with support for several engines and APIs, online and ...
66 versions - Latest release: 6 months ago - 61 dependent packages - 908 dependent repositories - 2.67 million downloads last month - 8,019 stars on GitHub - 2 maintainers
simplepythonwer 1.0.3
A small basic python implementation of WER (word error rate) and levenshtein
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 24 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
pocketsphinx 5.0.4
Official Python bindings for PocketSphinx
26 versions - Latest release: 10 months ago - 13 dependent packages - 328 dependent repositories - 34.5 thousand downloads last month - 3,773 stars on GitHub - 2 maintainers
Top 0.7% on pypi.org
openvino-dev 2024.6.0
OpenVINO(TM) Development Tools
37 versions - Latest release: 11 months ago - 38 dependent packages - 498 dependent repositories - 268 thousand downloads last month - 6,310 stars on GitHub - 1 maintainer
Top 0.6% on pypi.org
pytorch-pretrained-bert 0.6.2
PyTorch version of Google AI BERT model with script to load Google pre-trained models
10 versions - Latest release: over 6 years ago - 11 dependent packages - 940 dependent repositories - 82.3 thousand downloads last month - 129,185 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
espnet 0.10.6
ESPnet: end-to-end speech processing toolkit
37 versions - Latest release: almost 4 years ago - 6 dependent packages - 217 dependent repositories - 30.6 thousand downloads last month - 7,825 stars on GitHub - 5 maintainers
comparisonframe 0.0.5
A simple tool to compare textual data against validation sets.
6 versions - Latest release: about 1 year ago - 35 downloads last month - 144,208 stars on GitHub - 1 maintainer
asrtoolkit 0.2.4
The GreenKey ASRToolkit provides tools for automatic speech recognition (ASR) file conversion and...
24 versions - Latest release: over 4 years ago - 4 dependent repositories - 323 downloads last month - 30 stars on GitHub - 2 maintainers
fonadalabs 2.0.6
Unified Python SDK for FonadaLabs Text-to-Speech, Automatic Speech Recognition, and Audio Denoisi...
9 versions - Latest release: 1 day ago - 825 downloads last month - 1 maintainer
Top 1.6% on pypi.org
faster-whisper 1.2.1
Faster Whisper transcription with CTranslate2
21 versions - Latest release: 14 days ago - 53 dependent packages - 35 dependent repositories - 1.96 million downloads last month - 9,301 stars on GitHub - 2 maintainers
Top 0.7% on pypi.org
pytorch-transformers 1.2.0
Repository of pre-trained NLP Transformer models: BERT & RoBERTa, GPT & GPT-2, Transformer-XL, XL...
4 versions - Latest release: about 6 years ago - 16 dependent packages - 772 dependent repositories - 56 thousand downloads last month - 129,185 stars on GitHub - 1 maintainer
whisper-cpp-python 0.2.0 💰
A Python wrapper for whisper.cpp
12 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 872 downloads last month - 39,693 stars on GitHub - 1 maintainer
grouped-query-attention-pytorch 0.3.0
grouped-query-attention-pytorch
4 versions - Latest release: over 1 year ago - 496 downloads last month - 144,208 stars on GitHub - 1 maintainer
fms-acceleration-ilab 0.1.0
FMS Acceleration Plugin for Functionalities Used in Instruct Lab Training
1 version - Latest release: over 1 year ago - 6 downloads last month - 144,208 stars on GitHub - 1 maintainer
nlptutti 0.0.2
Korean STT (Speech-to-Text) error rate calculation package
12 versions - Latest release: about 3 years ago - 1 dependent repositories - 1.11 thousand downloads last month - 63 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
speechbrain 1.0.3
All-in-one speech toolkit in pure Python and Pytorch
17 versions - Latest release: 7 months ago - 32 dependent packages - 102 dependent repositories - 1.09 million downloads last month - 7,821 stars on GitHub - 2 maintainers
audiomate 6.0.0
Audiomate is a library for working with audio datasets.
10 versions - Latest release: over 5 years ago - 3 dependent repositories - 270 downloads last month - 138 stars on GitHub - 1 maintainer
pyautosrt 0.2.9
pyautosrt is a python based desktop app to generate subtitle and translated subtitle file
28 versions - Latest release: over 2 years ago - 71 downloads last month - 184 stars on GitHub - 1 maintainer
openav 1.0.0a21
OpenAV
17 versions - Latest release: 9 months ago - 26 downloads last month - 3 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
stt-tflite 0.10.0a10
A library for doing speech recognition using a Coqui STT model
6 versions - Latest release: over 4 years ago - 2 dependent repositories - 94 downloads last month - 2,511 stars on GitHub - 1 maintainer
axlearn 0.0.1
AXLearn
3 versions - Latest release: about 2 years ago - 99 downloads last month - 144,208 stars on GitHub - 1 maintainer
deep-brief 0.2.0
A video analysis application for presentation feedback
2 versions - Latest release: 2 days ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
rev-ai 2.21.0
Rev AI makes speech applications easy to build!
34 versions - Latest release: 12 months ago - 2 dependent packages - 10 dependent repositories - 47.5 thousand downloads last month - 37 stars on GitHub - 3 maintainers
Top 6.7% on pypi.org
pvcobra 2.1.0
Cobra voice activity detection (VAD) engine
21 versions - Latest release: about 2 months ago - 3 dependent packages - 4 dependent repositories - 598 downloads last month - 232 stars on GitHub - 1 maintainer
allophant 1.0.0
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
1 version - Latest release: about 1 year ago - 160 downloads last month - 25 stars on GitHub - 1 maintainer
squeezeformer 1.0.0
An Efficient Transformer for Automatic Speech Recognition
3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 115 downloads last month - 124 stars on GitHub - 1 maintainer
pocketsphinx-fork 1.0.0
Forked version of [pocketsphinx-python](https://github.com/bambocher/pocketsphinx-python) which a...
1 version - Latest release: over 6 years ago - 1 dependent repositories - 6 downloads last month - 373 stars on GitHub - 2 maintainers
openspeech-core 0.4.0
Open-Source Toolkit for End-to-End Automatic Speech Recognition
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 21 downloads last month - 705 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
adapt-parser 1.0.0
A text-to-intent parsing framework.
22 versions - Latest release: about 4 years ago - 11 dependent packages - 58 dependent repositories - 2.94 thousand downloads last month - 721 stars on GitHub - 1 maintainer
pvcobrademo 2.1.0
Cobra voice activity detection (VAD) engine demos.
25 versions - Latest release: about 2 months ago - 1 dependent repositories - 123 downloads last month - 137 stars on GitHub - 1 maintainer
offline-stenographer 0.1.0
A GUI application for creating transcripts from video files using WhisperX
2 versions - Latest release: about 2 months ago - 22 downloads last month - 0 stars on GitHub - 1 maintainer
whisperer-ml 0.1.7
Go from raw audio to a text-audio dataset with OpenAI's Whisper
7 versions - Latest release: over 2 years ago - 1 dependent repositories - 20 downloads last month - 9,783 stars on GitHub - 1 maintainer
sense-voice-streaming-asr 0.1.1
Real-time streaming automatic speech recognition (ASR) with support for Chinese, English, Cantone...
2 versions - Latest release: about 1 month ago - 1 maintainer
speech-recognition-api 0.1.1
Simple but extensible API for Speech Recognition.
2 versions - Latest release: almost 2 years ago - 95 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
paddlespeech-ctcdecoders 0.2.1
CTC decoders in paddlespeech
8 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 489 downloads last month - 12,332 stars on GitHub - 1 maintainer
cli-whisperer 1.0.0
Voice to Text Tool with Smart File Management and OpenAI Formatting
2 versions - Latest release: 4 months ago - 28 downloads last month - 2 stars on GitHub - 1 maintainer
commoncorrections 1.0.12
A small python implementation of common ASR corrections
13 versions - Latest release: over 3 years ago - 1 dependent repositories - 66 downloads last month - 3 stars on GitHub - 1 maintainer
nanowakeword 1.3.2
A next-generation intelligent framework for efficiently training high-performance, custom wake wo...
9 versions - Latest release: 6 days ago - 587 downloads last month - 6 stars on GitHub - 1 maintainer
funasr-runtime 0.0.1
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
1 version - Latest release: about 2 years ago - 60 downloads last month - 12,582 stars on GitHub - 1 maintainer
sounder 0.2.0
An intent recognition algorithm.
2 versions - Latest release: about 8 years ago - 1 dependent repositories - 19 downloads last month - 125 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
huggingsound 0.1.6 💰
HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools.
8 versions - Latest release: about 3 years ago - 7 dependent repositories - 396 downloads last month - 464 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
allosaurus 1.0.2
a multilingual phone recognizer
30 versions - Latest release: about 4 years ago - 4 dependent packages - 4 dependent repositories - 4.18 thousand downloads last month - 659 stars on GitHub - 1 maintainer
whisperx-karaoke 0.1.1 💰
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
2 versions - Latest release: over 1 year ago - 11 downloads last month - 15,321 stars on GitHub - 1 maintainer
speech-dataset-generator 1.0.0
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset type...
1 version - Latest release: over 1 year ago - 6 downloads last month - 135 stars on GitHub - 1 maintainer
zetascale 2.8.8 💰
LEGO blocks for AI: Build state-of-the-art models faster with modular PyTorch components.
195 versions - Latest release: 6 days ago - 72 dependent packages - 10.7 thousand downloads last month - 230 stars on GitHub - 1 maintainer
localtalk 0.1.0a4
A local/offline-capable voice assistant with speech recognition, LLM processing, and text-to-speech
4 versions - Latest release: 4 months ago - 29 downloads last month - 0 stars on GitHub - 1 maintainer
talky-dictation 0.5.0
System-wide dictation for Linux using OpenAI's Whisper AI
1 version - Latest release: 7 days ago - 1 maintainer
mlx-audio 0.2.6 💰
MLX-Audio is a package for inference of text-to-speech (TTS) and speech-to-speech (STS) models lo...
12 versions - Latest release: 7 days ago - 8.28 thousand downloads last month - 2,693 stars on GitHub - 1 maintainer
indicasr 1.0.0
Speeech Recognition for Indic languages.
1 version - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 13 stars on GitHub - 1 maintainer
py-kaldi-asr 0.5.2
Simple Python/Cython interface to kaldi-asr nnet3/chain and gmm decoders
12 versions - Latest release: almost 7 years ago - 1 dependent repositories - 123 downloads last month - 170 stars on GitHub - 1 maintainer
scraibe-webui 0.2.1
A web interface for the ScAIbe speech-to-text transcription tool
5 versions - Latest release: 12 months ago - 32 downloads last month - 47 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
funasr-onnx 0.4.1
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
22 versions - Latest release: over 1 year ago - 1 dependent repositories - 2.08 thousand downloads last month - 12,195 stars on GitHub - 1 maintainer
paddlespeech-ldd-ctcdecoders 0.2.0
CTC decoders in paddlespeech
1 version - Latest release: over 3 years ago - 4 downloads last month - 12,271 stars on GitHub - 1 maintainer
stark-place 1.1.0
S.T.A.R.K. Platform Library And Community Extensions
5 versions - Latest release: about 2 years ago - 22 downloads last month - 7 stars on GitHub - 1 maintainer
voicesynth 0.2.3 💰
Package for realistic voice synthesis
24 versions - Latest release: about 1 year ago - 70 downloads last month - 4,944 stars on GitHub - 1 maintainer
at16k 0.1.5
at16k is a Python library to perform automatic speech recognition or speech to text conversion.
5 versions - Latest release: about 5 years ago - 1 dependent repositories - 102 downloads last month - 130 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
kalliope 0.7.2
Kalliope is a modular always-on voice controlled personal assistant designed for home automation.
20 versions - Latest release: over 3 years ago - 2 dependent repositories - 82 downloads last month - 1,735 stars on GitHub - 1 maintainer
audio-classification-models 1.0.9
Tensorflow Audio Classification Models. https://github.com/awsaf49/audio_classification_models
10 versions - Latest release: over 3 years ago - 194 downloads last month - 12 stars on GitHub - 1 maintainer
ur-audio-sub 0.0.6 💰
Generate text captions for audio files & youtube video using OpenAI Whisper. Multiple languages s...
6 versions - Latest release: almost 3 years ago - 1 dependent repositories - 31 downloads last month - 16 stars on GitHub - 1 maintainer
funllm 0.0.1
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
1 version - Latest release: about 2 years ago - 11 downloads last month - 12,582 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
parrots 1.2.0
Parrots, Automatic Speech Recognition(**ASR**), Text-To-Speech(**TTS**) toolkit
17 versions - Latest release: 9 days ago - 3 dependent repositories - 529 downloads last month - 506 stars on GitHub - 1 maintainer
slg-nimrod 0.0.11
minimal deep learning framework
11 versions - Latest release: about 1 year ago - 23 downloads last month - 2 stars on GitHub - 1 maintainer
vocoder-dictation 0.1.1
Dictation for programmers
2 versions - Latest release: over 3 years ago - 12 downloads last month - 3 stars on GitHub - 1 maintainer
speechloop 0.0.3
A "keep it simple" collection of many speech recognition engines... Designed to help answer - wha...
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 12 downloads last month - 19 stars on GitHub - 1 maintainer
myproject101 1.0
Speech recognition module for Python, supporting several engines and APIs, online and offline.
1 version - Latest release: over 4 years ago - 1 dependent repositories - 7 downloads last month - 8,717 stars on GitHub - 1 maintainer
asr-deepspeech 0.3.2
ASRDeepspeech (English / Japanese)
1 version - Latest release: about 3 years ago - 25 downloads last month - 69 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
paddlespeech-feat 0.1.0
python speech feature extraction in paddlespeech
2 versions - Latest release: almost 4 years ago - 3 dependent packages - 16 dependent repositories - 14 thousand downloads last month - 12,271 stars on GitHub - 1 maintainer
aniemore 1.2.3
Aniemore (Artem Nikita Ilya EMOtion REcognition) is a library for emotion recognition in voice an...
11 versions - Latest release: almost 2 years ago - 210 downloads last month - 75 stars on GitHub - 3 maintainers
autosrt 1.4.9
a utility for automatic speech recognition and subtitle generation
64 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 604 downloads last month - 61 stars on GitHub - 1 maintainer
pvoctopus 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.
15 versions - Latest release: 7 months ago - 1 dependent package - 1 dependent repositories - 68 downloads last month - 37 stars on GitHub - 1 maintainer
pvoctopusdemo 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.
15 versions - Latest release: 7 months ago - 1 dependent repositories - 64 downloads last month - 37 stars on GitHub - 1 maintainer
stark-engine 4.2.1
S.T.A.R.K - Speech and Text Algorithmic Recognition Kit. Modern framework for creating powerfull ...
18 versions - Latest release: 10 days ago - 198 downloads last month - 65 stars on GitHub - 1 maintainer
aavt 0.0.1
A Python package for Video Translate and Some Tools for Video
2 versions - Latest release: over 1 year ago - 46 downloads last month - 2,674 stars on GitHub - 1 maintainer
whisply 0.11.0
"Transcribe, translate, annotate and subtitle audio and video files with OpenAI's Whisper ... fast!"
20 versions - Latest release: 2 months ago - 118 downloads last month - 61 stars on GitHub - 2 maintainers
cued-speech 0.4.2
Cued Speech Processing Tools - Decode and Generate cued speech videos
19 versions - Latest release: 11 days ago - 1.23 thousand downloads last month - 0 stars on GitHub - 1 maintainer
vox-box 0.0.20
Vox box
21 versions - Latest release: 4 months ago - 2.29 thousand downloads last month - 6,709 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
deepgram-sdk 5.3.0
Official Python SDK for Deepgram's automated speech recognition APIs.
117 versions - Latest release: 11 days ago - 10 dependent packages - 18 dependent repositories - 829 thousand downloads last month - 204 stars on GitHub - 1 maintainer
jabberwocky 2.5.0
Core library powering a GUI providing an audio interface to GPT3.
42 versions - Latest release: almost 3 years ago - 1 dependent repositories - 216 downloads last month - 19 stars on GitHub - 1 maintainer
selenium-captcha-processing 1.0.2
A Python package for detecting and solving various captchas in Selenium-based web automation, sup...
3 versions - Latest release: 5 months ago - 36 downloads last month - 1 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
mltu 1.2.5
Machine Learning Training Utilities (MLTU) for TensorFlow and PyTorch
39 versions - Latest release: over 1 year ago - 3 dependent repositories - 1.13 thousand downloads last month - 249 stars on GitHub - 1 maintainer
a4f 1.0.3
Official Python SDK for A4F - Unified AI Gateway for chat, images, audio, and embeddings
4 versions - Latest release: about 2 months ago - 70 downloads last month - 1 maintainer
spych 0.0.5
Python wrapper for the deepspeech library
5 versions - Latest release: about 3 years ago - 1 dependent repositories - 26 downloads last month - 0 stars on GitHub - 1 maintainer
haloop 0.0.10
speech agent for 100 hours
7 versions - Latest release: over 2 years ago - 25 downloads last month - 14 stars on GitHub - 1 maintainer
speechkit 2.2.3
Python SDK for Yandex Speechkit API.
18 versions - Latest release: 9 months ago - 3 dependent repositories - 418 downloads last month - 53 stars on GitHub - 1 maintainer
yodi-umbaji 1.0.0
This is the official repository for the training of Yodi V1, the frst speech recognition system f...
4 versions - Latest release: over 1 year ago - 10 downloads last month - 1 stars on GitHub - 1 maintainer
pywhisper 1.0.6 💰
openai/whisper speech to text model + extra features
7 versions - Latest release: about 3 years ago - 5 dependent repositories - 124 downloads last month - 90 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
kaldi-active-grammar 3.2.0 💰
Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
43 versions - Latest release: 12 days ago - 2 dependent packages - 2 dependent repositories - 132 downloads last month - 344 stars on GitHub - 1 maintainer
whisper-run 1.3.0
Whisper with speaker diarization
22 versions - Latest release: about 1 year ago - 39 downloads last month - 8 stars on GitHub - 1 maintainer
geniusrise-audio 0.1.12
audio bolts for geniusrise
13 versions - Latest release: over 1 year ago - 39 downloads last month - 2 stars on GitHub - 1 maintainer
vernacular-ai-speech 0.1.2
Vernacular Speech API python client
3 versions - Latest release: about 5 years ago - 1 dependent repositories - 8 downloads last month - 21 stars on GitHub - 1 maintainer
mycroft-porcupine-plugin 0.3.0
A Porcupine wakeword plugin for mycroft
4 versions - Latest release: almost 4 years ago - 1 dependent repositories - 11 downloads last month - 4,087 stars on GitHub - 1 maintainer
pywer 0.1.1
A simple Python package to calculate word error rate (WER).
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 97 downloads last month - 5 stars on GitHub - 1 maintainer
kur 0.7.0
Descriptive deep learning
11 versions - Latest release: about 8 years ago - 2 dependent repositories - 135 downloads last month - 823 stars on GitHub - 1 maintainer
whisperx-numpy2-compatibility 0.1.1 💰
A compatibility fix to allow whisperx to work with other packages that require numpy>2. Should no...
2 versions - Latest release: 12 months ago - 136 downloads last month - 15,321 stars on GitHub - 1 maintainer