pypi.org "asr" keyword
View the packages on the pypi.org package registry that are tagged with the "asr" keyword.
torch-cif 0.2.0
A fast parallel implementation of continuous integrate-and-fire (CIF) https://arxiv.org/abs/1905....2 versions - Latest release: about 1 year ago - 118 downloads last month - 33 stars on GitHub - 1 maintainer
whishow 1.3.5
an online player for video stream8 versions - Latest release: about 1 year ago - 145 downloads last month - 5 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
85 versions - Latest release: 19 days ago - 4 dependent packages - 27 dependent repositories - 313 thousand downloads last month - 11,448 stars on GitHub - 3 maintainers
nemo-toolkit 2.2.1
NeMo - a toolkit for Conversational AI85 versions - Latest release: 19 days ago - 4 dependent packages - 27 dependent repositories - 313 thousand downloads last month - 11,448 stars on GitHub - 3 maintainers
Top 1.2% on pypi.org
17 versions - Latest release: 12 days ago - 32 dependent packages - 102 dependent repositories - 1.32 million downloads last month - 7,821 stars on GitHub - 2 maintainers
speechbrain 1.0.3
All-in-one speech toolkit in pure Python and Pytorch17 versions - Latest release: 12 days ago - 32 dependent packages - 102 dependent repositories - 1.32 million downloads last month - 7,821 stars on GitHub - 2 maintainers
Top 1.1% on pypi.org
12 versions - Latest release: over 8 years ago - 27 dependent packages - 819 dependent repositories - 136 thousand downloads last month - 2,194 stars on GitHub - 1 maintainer
webrtcvad 2.0.10
Python interface to the Google WebRTC Voice Activity Detector (VAD)12 versions - Latest release: over 8 years ago - 27 dependent packages - 819 dependent repositories - 136 thousand downloads last month - 2,194 stars on GitHub - 1 maintainer
whisper-normalizer 0.1.1 💰
A python package for whisper normalizer11 versions - Latest release: about 11 hours ago - 3 dependent packages - 1 dependent repositories - 67.7 thousand downloads last month - 44 stars on GitHub - 1 maintainer
kolm 1.1.4
Korean LM toolkit for building ASR system2 versions - Latest release: about 8 years ago - 2 dependent repositories - 57 downloads last month - 60 stars on GitHub - 1 maintainer
asrecognition 0.0.4 💰
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.4 versions - Latest release: over 3 years ago - 1 dependent repositories - 171 downloads last month - 51 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
26 versions - Latest release: 3 months ago - 13 dependent packages - 328 dependent repositories - 31 thousand downloads last month - 3,773 stars on GitHub - 2 maintainers
pocketsphinx 5.0.4
Official Python bindings for PocketSphinx26 versions - Latest release: 3 months ago - 13 dependent packages - 328 dependent repositories - 31 thousand downloads last month - 3,773 stars on GitHub - 2 maintainers
Top 2.4% on pypi.org
2 versions - Latest release: over 3 years ago - 3 dependent packages - 16 dependent repositories - 2.58 thousand downloads last month - 11,788 stars on GitHub - 1 maintainer
paddlespeech-feat 0.1.0
python speech feature extraction in paddlespeech2 versions - Latest release: over 3 years ago - 3 dependent packages - 16 dependent repositories - 2.58 thousand downloads last month - 11,788 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
9 versions - Latest release: over 2 years ago - 5 dependent packages - 50 dependent repositories - 3.3 thousand downloads last month - 11,788 stars on GitHub - 1 maintainer
paddleaudio 1.1.0
Speech audio tools based on Paddlepaddle9 versions - Latest release: over 2 years ago - 5 dependent packages - 50 dependent repositories - 3.3 thousand downloads last month - 11,788 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
35 versions - Latest release: 10 months ago - 1 dependent package - 69 dependent repositories - 6.91 thousand downloads last month - 11,788 stars on GitHub - 1 maintainer
paddlespeech 1.4.2
Speech tools and models based on Paddlepaddle35 versions - Latest release: 10 months ago - 1 dependent package - 69 dependent repositories - 6.91 thousand downloads last month - 11,788 stars on GitHub - 1 maintainer
paddlespeech-ldd-ctcdecoders 0.2.0
CTC decoders in paddlespeech1 version - Latest release: almost 3 years ago - 26 downloads last month - 11,788 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
8 versions - Latest release: almost 3 years ago - 1 dependent package - 1 dependent repositories - 1.14 thousand downloads last month - 11,788 stars on GitHub - 1 maintainer
paddlespeech-ctcdecoders 0.2.1
CTC decoders in paddlespeech8 versions - Latest release: almost 3 years ago - 1 dependent package - 1 dependent repositories - 1.14 thousand downloads last month - 11,788 stars on GitHub - 1 maintainer
ef-sherpa-onnx 1.9.21.dev1
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen K...1 version - Latest release: 12 months ago - 33 downloads last month - 5,645 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
132 versions - Latest release: 16 days ago - 1 dependent repositories - 58.1 thousand downloads last month - 5,645 stars on GitHub - 2 maintainers
sherpa-onnx 1.11.3
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen K...132 versions - Latest release: 16 days ago - 1 dependent repositories - 58.1 thousand downloads last month - 5,645 stars on GitHub - 2 maintainers
ppasr 3.0.2
Automatic speech recognition toolkit on PaddlePaddle43 versions - Latest release: 1 day ago - 1 dependent repositories - 894 downloads last month - 852 stars on GitHub - 1 maintainer
masr 3.0.2
Automatic speech recognition toolkit on Pytorch36 versions - Latest release: 1 day ago - 1 dependent repositories - 552 downloads last month - 588 stars on GitHub - 1 maintainer
vernacular-ai-speech 0.1.2
Vernacular Speech API python client3 versions - Latest release: over 4 years ago - 1 dependent repositories - 173 downloads last month - 21 stars on GitHub - 1 maintainer
video-analyzer 0.1.1
A tool for analyzing videos using Vision models2 versions - Latest release: about 1 month ago - 291 downloads last month - 628 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
28 versions - Latest release: 25 days ago - 49 dependent packages - 205 dependent repositories - 1.5 million downloads last month - 2,960 stars on GitHub - 1 maintainer
youtube-transcript-api 1.0.3 💰
This is an python API which allows you to get the transcripts/subtitles for a given YouTube video...28 versions - Latest release: 25 days ago - 49 dependent packages - 205 dependent repositories - 1.5 million downloads last month - 2,960 stars on GitHub - 1 maintainer
voskintentvoiceconverthanzi 0.3.32
Offline open source speech recognition API based on Kaldi and Vosk1 version - Latest release: over 3 years ago - 1 dependent repositories - 32 downloads last month - 9,253 stars on GitHub - 1 maintainer
tf-seq2seq-losses 0.3.0
Tensorflow implementations for (CTC) loss functions that are fast and support second-order deriva...6 versions - Latest release: 10 months ago - 1 dependent repositories - 115 downloads last month - 1 maintainer
Top 7.9% on pypi.org
8 versions - Latest release: 8 months ago - 4 dependent packages - 32 dependent repositories - 526 thousand downloads last month - 17 stars on GitHub - 1 maintainer
webrtcvad-wheels 2.0.14 💰
Python interface to the Google WebRTC Voice Activity Detector (VAD) [released with binary wheels!]8 versions - Latest release: 8 months ago - 4 dependent packages - 32 dependent repositories - 526 thousand downloads last month - 17 stars on GitHub - 1 maintainer
asr_tools 0.1
Automatic speech recognition tools.1 version - Latest release: almost 9 years ago - 2 dependent repositories - 53 downloads last month - 5 stars on GitHub - 1 maintainer
transfusion-asr 0.1.0
TransFusion: Transcribing Speech with Multinomial Diffusion1 version - Latest release: over 1 year ago - 1 dependent package - 2.56 thousand downloads last month - 66 stars on GitHub - 1 maintainer
whisperx-numpy2-compatibility 0.1.1 💰
A compatibility fix to allow whisperx to work with other packages that require numpy>2. Should no...2 versions - Latest release: 5 months ago - 200 downloads last month - 14,976 stars on GitHub - 1 maintainer
slikts-whisperx 3.3.1 💰
Time-Accurate Automatic Speech Recognition using Whisper.1 version - Latest release: 3 months ago - 53 downloads last month - 14,976 stars on GitHub - 1 maintainer
whisperx-karaoke 0.1.1 💰
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)2 versions - Latest release: 12 months ago - 102 downloads last month - 14,976 stars on GitHub - 1 maintainer
whisperx 3.3.2 💰
Time-Accurate Automatic Speech Recognition using Whisper.10 versions - Latest release: 9 days ago - 3 dependent packages - 118 thousand downloads last month - 14,976 stars on GitHub - 2 maintainers
soundswallower 0.6.5 💰
An even smaller speech recognizer18 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 2.3 thousand downloads last month - 32 stars on GitHub - 1 maintainer
voh 0.0.0
Voice of Heart1 version - Latest release: 5 months ago - 51 downloads last month - 1 maintainer
libreasr 0.0.1 💰
:speech_balloon: An On-Premises, Streaming Speech Recognition System1 version - Latest release: about 4 years ago - 1 dependent repositories - 65 downloads last month - 683 stars on GitHub - 1 maintainer
wrapperz 0.1.3
Simple wrappers for various AI APIs including LLMs, ASR, and TTS4 versions - Latest release: about 1 month ago - 529 downloads last month - 0 stars on GitHub - 1 maintainer
nemo-tts 0.9.0
Collection of Neural Modules for Speech Synthesis1 version - Latest release: over 5 years ago - 1 dependent repositories - 71 downloads last month - 13,640 stars on GitHub - 1 maintainer
nemo-nlp 0.9.0
Collection of Neural Modules for Natural Language Processing4 versions - Latest release: over 5 years ago - 1 dependent repositories - 182 downloads last month - 13,640 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
4 versions - Latest release: over 5 years ago - 2 dependent repositories - 544 downloads last month - 13,640 stars on GitHub - 1 maintainer
nemo-asr 0.9.0
Collection of Neural Modules for Speech Recognition4 versions - Latest release: over 5 years ago - 2 dependent repositories - 544 downloads last month - 13,640 stars on GitHub - 1 maintainer
aiplat-acu-asr 2.0.7
asr grpc client4 versions - Latest release: over 4 years ago - 1 dependent repositories - 148 downloads last month - 78 stars on GitHub - 1 maintainer
baidu-acu-asr 2.0.4
asr grpc client17 versions - Latest release: over 1 year ago - 1 dependent repositories - 461 downloads last month - 78 stars on GitHub - 1 maintainer
meeteval 0.4.1
MeetEval - A meeting transcription evaluation toolkit7 versions - Latest release: 18 days ago - 1.4 thousand downloads last month - 92 stars on GitHub - 1 maintainer
f5-tts-mlx-quantized 0.1.1
F5-TTS - MLX2 versions - Latest release: 4 months ago - 111 downloads last month - 11 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
19 versions - Latest release: over 6 years ago - 4 dependent repositories - 1.37 thousand downloads last month - 102 stars on GitHub - 1 maintainer
asr_evaluation 2.0.4
Evaluating ASR (automatic speech recognition) hypotheses, i.e. computing word error rate.19 versions - Latest release: over 6 years ago - 4 dependent repositories - 1.37 thousand downloads last month - 102 stars on GitHub - 1 maintainer
Top 8.2% on pypi.org
39 versions - Latest release: 4 days ago - 1 dependent repositories - 12.3 thousand downloads last month - 955 stars on GitHub - 2 maintainers
sherpa-ncnn 2.1.11
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn wi...39 versions - Latest release: 4 days ago - 1 dependent repositories - 12.3 thousand downloads last month - 955 stars on GitHub - 2 maintainers
rev-reverb 0.1.0
A simplified python packge to interact with the reverb models1 version - Latest release: 5 months ago - 67 downloads last month - 389 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
52 versions - Latest release: 4 months ago - 28 dependent packages - 362 dependent repositories - 118 thousand downloads last month - 713 stars on GitHub - 2 maintainers
cn2an 0.5.23
Convert Chinese numerals and Arabic numerals.52 versions - Latest release: 4 months ago - 28 dependent packages - 362 dependent repositories - 118 thousand downloads last month - 713 stars on GitHub - 2 maintainers
whisper-realtime-transcriber 1.0.0
Let whisper models transcribe in realtime.6 versions - Latest release: 7 months ago - 225 downloads last month - 7 stars on GitHub - 1 maintainer
simple-asr 0.0.2
Wrapper module around wav2vec2 designed for ease of use2 versions - Latest release: 3 months ago - 101 downloads last month - 1 stars on GitHub - 1 maintainer
kaldi-adapt-lm 0.1.3
Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model.1 version - Latest release: over 6 years ago - 1 dependent repositories - 46 downloads last month - 33 stars on GitHub - 1 maintainer
py-nltools 0.5.0
A collection of basic python modules for spoken natural language processing22 versions - Latest release: almost 6 years ago - 2 dependent repositories - 691 downloads last month - 56 stars on GitHub - 1 maintainer
alex_asr 1.0.4
Incremental speech recognition decoder for Kaldi NNET2 and GMM models.5 versions - Latest release: over 9 years ago - 2 dependent repositories - 114 downloads last month - 49 stars on GitHub - 1 maintainer
voicetools 0.0.1
All-in-one voice tools library1 version - Latest release: over 8 years ago - 1 dependent repositories - 61 downloads last month - 158 stars on GitHub - 1 maintainer
k2-sherpa 0.9.1
Speech-to-text server framework with next-gen Kaldi13 versions - Latest release: over 2 years ago - 1 dependent repositories - 191 downloads last month - 657 stars on GitHub - 1 maintainer
tatt 0.981
Tatt creates a uniform API for multiple speech-to-text (STT) services.55 versions - Latest release: almost 6 years ago - 1 dependent repositories - 835 downloads last month - 11 stars on GitHub - 1 maintainer
deepgram-unstable-sdk 3.8.0.dev4
The official Python SDK for the Deepgram automated speech recognition platform.22 versions - Latest release: 5 months ago - 600 downloads last month - 286 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
101 versions - Latest release: about 2 months ago - 10 dependent packages - 18 dependent repositories - 451 thousand downloads last month - 204 stars on GitHub - 1 maintainer
deepgram-sdk 3.10.1
The official Python SDK for the Deepgram automated speech recognition platform.101 versions - Latest release: about 2 months ago - 10 dependent packages - 18 dependent repositories - 451 thousand downloads last month - 204 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
6 versions - Latest release: over 3 years ago - 2 dependent repositories - 984 downloads last month - 2,408 stars on GitHub - 1 maintainer
stt-tflite 0.10.0a10
A library for doing speech recognition using a Coqui STT model6 versions - Latest release: over 3 years ago - 2 dependent repositories - 984 downloads last month - 2,408 stars on GitHub - 1 maintainer
coqui-stt-training 1.4.0
Training code for Coqui STT33 versions - Latest release: over 2 years ago - 1 dependent repositories - 638 downloads last month - 2,408 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
35 versions - Latest release: over 2 years ago - 20 dependent repositories - 5.69 thousand downloads last month - 2,408 stars on GitHub - 2 maintainers
stt 1.4.0
A library for doing speech recognition using a Coqui STT model35 versions - Latest release: over 2 years ago - 20 dependent repositories - 5.69 thousand downloads last month - 2,408 stars on GitHub - 2 maintainers
iarahealth-stt-training 1.5.2
Training code for Coqui STT2 versions - Latest release: over 1 year ago - 89 downloads last month - 2,408 stars on GitHub - 1 maintainer
stt-gpu 0.10.0a4
A library for doing speech recognition using a Coqui STT model1 version - Latest release: about 4 years ago - 1 dependent repositories - 29 downloads last month - 2,401 stars on GitHub - 1 maintainer
Top 8.4% on pypi.org
16 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 1.26 thousand downloads last month - 619 stars on GitHub - 1 maintainer
pvcheetah 2.1.3
Cheetah Speech-to-Text Engine.16 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 1.26 thousand downloads last month - 619 stars on GitHub - 1 maintainer
pvcheetahdemo 2.1.3
Cheetah speech-to-text engine demos20 versions - Latest release: about 1 month ago - 1 dependent repositories - 660 downloads last month - 619 stars on GitHub - 1 maintainer
watson-streaming 0.0.13
Speech to text transcription in real-time using IBM Watson13 versions - Latest release: almost 5 years ago - 2 dependent repositories - 533 downloads last month - 3 stars on GitHub - 1 maintainer
pvleoparddemo 2.0.5
Leopard speech-to-text engine demos24 versions - Latest release: 2 months ago - 1 dependent repositories - 608 downloads last month - 422 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
25 versions - Latest release: 2 months ago - 1 dependent package - 4 dependent repositories - 2.33 thousand downloads last month - 422 stars on GitHub - 1 maintainer
pvleopard 2.0.5
Leopard Speech-to-Text Engine.25 versions - Latest release: 2 months ago - 1 dependent package - 4 dependent repositories - 2.33 thousand downloads last month - 422 stars on GitHub - 1 maintainer
werpy 3.0.2
A powerful yet lightweight Python package to calculate and analyze the Word Error Rate (WER).16 versions - Latest release: 16 days ago - 1 dependent repositories - 5.74 thousand downloads last month - 12 stars on GitHub - 1 maintainer
pythaiasr 1.3.0
Python Thai ASR10 versions - Latest release: about 2 years ago - 1 dependent repositories - 495 downloads last month - 58 stars on GitHub - 1 maintainer
retico-wav2vecasr 0.1.6
The Huggingface wav2vec ASR incremental modules for the retico framework7 versions - Latest release: over 2 years ago - 152 downloads last month - 0 stars on GitHub - 1 maintainer
usttc 0.0.8
Unified Speech-to-text Client8 versions - Latest release: about 2 years ago - 1 dependent repositories - 181 downloads last month - 4 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
14 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 54.5 thousand downloads last month - 274 stars on GitHub - 1 maintainer
nemo-text-processing 1.1.0
NeMo text processing for ASR and TTS14 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 54.5 thousand downloads last month - 274 stars on GitHub - 1 maintainer
tafrigh 1.7.6
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.33 versions - Latest release: 17 days ago - 1.84 thousand downloads last month - 125 stars on GitHub - 1 maintainer
rapid-paraformer 2.0.5
Tool of speech recognition.2 versions - Latest release: 11 months ago - 83 downloads last month - 542 stars on GitHub - 1 maintainer
whisper-s2t 1.3.1
An Optimized Speech-to-Text Pipeline for the Whisper Model.5 versions - Latest release: about 1 year ago - 2 dependent packages - 3.61 thousand downloads last month - 382 stars on GitHub - 1 maintainer
malayalam-asr-benchmarking 0.0.4
A study to benchmark whisper based ASRs in Malayalam4 versions - Latest release: about 1 year ago - 132 downloads last month - 8 stars on GitHub - 1 maintainer
Top 9.2% on pypi.org
2 versions - Latest release: over 4 years ago - 1 dependent package - 5 dependent repositories - 6.63 thousand downloads last month - 33 stars on gitlab.com - 4 maintainers
asr 0.4.1
ASE recipes for calculating material properties2 versions - Latest release: over 4 years ago - 1 dependent package - 5 dependent repositories - 6.63 thousand downloads last month - 33 stars on gitlab.com - 4 maintainers
whisper-turbo-mlx 0.0.1
Whisper Turbo in MLX17 versions - Latest release: 6 months ago - 527 downloads last month - 202 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
8 versions - Latest release: over 2 years ago - 7 dependent repositories - 1.08 thousand downloads last month - 447 stars on GitHub - 1 maintainer
huggingsound 0.1.6 💰
HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools.8 versions - Latest release: over 2 years ago - 7 dependent repositories - 1.08 thousand downloads last month - 447 stars on GitHub - 1 maintainer
speechbrain-geoph9 0.5.12a0
All-in-one speech toolkit in pure Python and Pytorch1 version - Latest release: over 2 years ago - 1 dependent repositories - 43 downloads last month - 8,811 stars on GitHub - 1 maintainer
whisperer-ml 0.1.7
Go from raw audio to a text-audio dataset with OpenAI's Whisper7 versions - Latest release: about 2 years ago - 1 dependent repositories - 266 downloads last month - 8,811 stars on GitHub - 1 maintainer
deepasr 0.1.2
Keras(Tensorflow) implementations of Automatic Speech Recognition12 versions - Latest release: over 3 years ago - 1 dependent repositories - 249 downloads last month - 23 stars on GitHub - 1 maintainer
audiotoken 0.3.1
A package for creating audio tokens7 versions - Latest release: 8 months ago - 297 downloads last month - 49 stars on GitHub - 2 maintainers
delta-nlp 0.3.2
DELTA is a deep learning based natural language and speech processing platform.3 versions - Latest release: about 5 years ago - 1 dependent repositories - 110 downloads last month - 1,588 stars on GitHub - 1 maintainer
speechloop 0.0.3
A "keep it simple" collection of many speech recognition engines... Designed to help answer - wha...3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 149 downloads last month - 19 stars on GitHub - 1 maintainer
aana 0.2.4
Multimodal SDK8 versions - Latest release: 2 months ago - 532 downloads last month - 21 stars on GitHub - 1 maintainer
whispercpp-kit 0.1.5
A toolkit for whisper.cpp with audio processing and model management6 versions - Latest release: 3 months ago - 594 downloads last month - 1 stars on GitHub - 1 maintainer
yeaudio 0.0.7
Audio ToolKit for Python7 versions - Latest release: 5 months ago - 534 downloads last month - 4 stars on GitHub - 1 maintainer
mlx-e2-tts 0.0.6
E2-TTS - MLX5 versions - Latest release: 6 months ago - 214 downloads last month - 20 stars on GitHub - 1 maintainer
vocos-mlx 0.0.7
Vocos - MLX7 versions - Latest release: 6 months ago - 1.78 thousand downloads last month - 17 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
4 versions - Latest release: almost 3 years ago - 4 dependent packages - 3 dependent repositories - 9.55 thousand downloads last month - 4,944 stars on GitHub - 1 maintainer
silero 0.4.1 💰
Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks.4 versions - Latest release: almost 3 years ago - 4 dependent packages - 3 dependent repositories - 9.55 thousand downloads last month - 4,944 stars on GitHub - 1 maintainer
voicesynth 0.2.3 💰
Package for realistic voice synthesis24 versions - Latest release: 6 months ago - 396 downloads last month - 4,944 stars on GitHub - 1 maintainer
webrtcvad123 2.0.11.dev0
Python interface to the Google WebRTC Voice Activity Detector (VAD)1 version - Latest release: over 1 year ago - 40 downloads last month - 1,869 stars on GitHub - 1 maintainer
paraboth 0.1.3
A Python package implementing Paraboth with some improvements: https://aclanthology.org/2023.swis...2 versions - Latest release: 4 months ago - 90 downloads last month - 0 stars on GitHub - 1 maintainer
tsnorm 1.1.2
A library to put stress marks in Russian text3 versions - Latest release: 4 months ago - 83 downloads last month - 6 stars on GitHub - 1 maintainer
torch-edit-distance 0.4.0
PyTorch edit-distance functions3 versions - Latest release: about 2 years ago - 2 dependent repositories - 37.5 thousand downloads last month - 94 stars on GitHub - 1 maintainer
whisply 0.10.3
Transcribe, translate, annotate and subtitle audio and video files with OpenAI's Whisper ... fast!17 versions - Latest release: 22 days ago - 1.05 thousand downloads last month - 38 stars on GitHub - 2 maintainers
audio2text 1.0.2
This package is for extract text from audio/video file2 versions - Latest release: over 4 years ago - 76 downloads last month - 1 stars on GitHub - 1 maintainer
Top 4.1% on pypi.org
15 versions - Latest release: 5 months ago - 8 dependent packages - 4 dependent repositories - 34.7 thousand downloads last month - 2,333 stars on GitHub - 3 maintainers
whisper-timestamped 1.15.8
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word time...15 versions - Latest release: 5 months ago - 8 dependent packages - 4 dependent repositories - 34.7 thousand downloads last month - 2,333 stars on GitHub - 3 maintainers
nexaai 0.1.0
Nexa AI SDK31 versions - Latest release: over 1 year ago - 1.27 thousand downloads last month - 4,476 stars on GitHub - 2 maintainers
nexaai-gpu 0.0.1.dev0
Nexa AI SDK1 version - Latest release: 8 months ago - 32 downloads last month - 27 stars on GitHub - 1 maintainer
nexai 0.0.0.dev0
Nexa AI SDK1 version - Latest release: 8 months ago - 27 downloads last month - 27 stars on GitHub - 1 maintainer
Related Keywords
speech-recognition
73
speech-to-text
53
speech
39
tts
33
whisper
29
automatic-speech-recognition
24
stt
24
python
24
pytorch
22
voice-recognition
21
deep-learning
19
audio
17
transformers
16
text-to-speech
13
speech-synthesis
13
nlp
12
llm
12
speech-translation
11
conformer
10
speech recognition
10
speechrecognition
9
transcription
9
language
8
vad
8
language-model
7
recognition
7
artificial intelligence
7
kaldi
7
tensorflow
7
speaker-recognition
7
wav2vec2
7
transformer
7
speech-recognizer
7
speech-recognition-api
6
ai
6
sdk
6
speaker-diarization
6
speaker-verification
6
deep learning
6
streaming-asr
6
audio-generation
6
vocoder
5
streaming-tts
5
speech-alignment
5
sound-classification
5
self-supervised-learning
5
punctuation-restoration
5
openai
5
kws
5
code-switch
5
voice-cloning
5
automatic speech recognition
5
onnx
5
deeplearning
5
multimodal
5
mlx
5
vlm
5
audio-processing
5
speech-processing
5
ASR
5
on-device-ai
4
edge-computing
4
sdk-python
4
stable-diffusion
4
on-device-ml
4
framework
4
Automatic Speech Recognition
4
Voice Recognition
4
Speech Recognition
4
Speech-to-Text
4
hacktoberfest
4
processing
4
natural
4
android
4
cpp
4
ios
4
raspberry-pi
4
speech_recognition
4
deepspeech
4
voice-activity-detection
4
cli
4
subtitles
4
youtube
4
huggingface
4
speaker-diariazation
4
streaming
4
voiceactivitydetection
4
webrtc
4
generative-ai
4
large-language-models
4
machine-translation
4
neural-networks
4
speech-separation
3
speech-toolkit
3
wer
3
spoken-language-understanding
3
baidu
3
wrapper
3
mycroft
3
ovos
3