An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "speech-processing" keyword

View the packages on the pypi.org package registry that are tagged with the "speech-processing" keyword.

whisperer-ml 0.1.7
Go from raw audio to a text-audio dataset with OpenAI's Whisper
7 versions - Latest release: over 2 years ago - 1 dependent repositories - 28 downloads last month - 9,783 stars on GitHub - 1 maintainer
vak 1.0.4
A neural network framework for researchers studying acoustic communication
47 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 694 downloads last month - 67 stars on GitHub - 1 maintainer
Top 2.9% on pypi.org
torchscale 0.3.0
Transformers at any scale
5 versions - Latest release: almost 2 years ago - 8 dependent packages - 15 dependent repositories - 125 thousand downloads last month - 3,117 stars on GitHub - 1 maintainer
torchscale-gml 0.2.3
Transformers at any scale
4 versions - Latest release: 12 months ago - 1 dependent package - 501 downloads last month - 3,117 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
speechbrain 1.0.3
All-in-one speech toolkit in pure Python and Pytorch
17 versions - Latest release: 6 months ago - 32 dependent packages - 102 dependent repositories - 1.61 million downloads last month - 7,821 stars on GitHub - 2 maintainers
Top 6.2% on pypi.org
spafe 0.3.3 💰
Simplified Python Audio-Features Extraction.
8 versions - Latest release: over 1 year ago - 1 dependent package - 9 dependent repositories - 12.5 thousand downloads last month - 477 stars on GitHub - 1 maintainer
pyannote-audio 4.0.0 💰
State-of-the-art speaker diarization toolkit
16 versions - Latest release: 3 days ago - 840 thousand downloads last month - 8,374 stars on GitHub - 1 maintainer
stark-engine 4.1.0
S.T.A.R.K - Speech and Text Algorithmic Recognition Kit. Modern framework for creating powerfull ...
16 versions - Latest release: 10 days ago - 635 downloads last month - 65 stars on GitHub - 1 maintainer
bournemouth-forced-aligner 0.1.6
Bournemouth Forced Aligner - Phoneme-level timestamp extraction
7 versions - Latest release: 2 days ago - 570 downloads last month - 73 stars on GitHub - 1 maintainer
scoreq 1.0.1
A Python package for advanced speech quality assessment using the SCOREQ model
3 versions - Latest release: 3 months ago - 174 downloads last month - 88 stars on GitHub - 1 maintainer
pysptk-speechify 0.2.0.1 💰
A python wrapper for Speech Signal Processing Toolkit (SPTK)
1 version - Latest release: almost 3 years ago - 7 downloads last month - 446 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
pysptk 1.0.1 💰
A python wrapper for Speech Signal Processing Toolkit (SPTK)
27 versions - Latest release: over 1 year ago - 6 dependent packages - 118 dependent repositories - 21.2 thousand downloads last month - 446 stars on GitHub - 1 maintainer
polyglotdb 1.3.4
PolyglotDB is a package for phonetic corpus storage and analysis
41 versions - Latest release: 3 months ago - 2 dependent repositories - 269 downloads last month - 47 stars on GitHub - 2 maintainers
speechbrain-geoph9 0.5.12a0
All-in-one speech toolkit in pure Python and Pytorch
1 version - Latest release: about 3 years ago - 1 dependent repositories - 215 downloads last month - 9,783 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
pb-bss-eval 0.0.2
EM algorithms for integrated spatial and spectral models.
2 versions - Latest release: over 5 years ago - 1 dependent package - 6 dependent repositories - 27.1 thousand downloads last month - 278 stars on GitHub - 1 maintainer
gryannote 0.3.3
Provide Gradio custom components to make the diarization-based audio annotation process easier
12 versions - Latest release: 11 months ago - 572 downloads last month - 65 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
fastwer 0.1.3
A PyPI package for fast word/character error rate (WER/CER) calculation
2 versions - Latest release: over 5 years ago - 2 dependent packages - 31 dependent repositories - 1.83 thousand downloads last month - 72 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
nnmnkwii 0.1.3 💰
Library to build speech synthesis systems for fast prototyping
27 versions - Latest release: over 1 year ago - 1 dependent package - 49 dependent repositories - 6.99 thousand downloads last month - 398 stars on GitHub - 1 maintainer
indic-num2words 1.3.2
Package to convert numbers to words with support of multiple indian languages.
8 versions - Latest release: 4 months ago - 1 dependent package - 1 dependent repositories - 5.1 thousand downloads last month - 36 stars on GitHub - 1 maintainer
myspokenlanguagedetection 5
Spoken language identification with CNN and RNN - Improved Version: accuracy up
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 21 downloads last month - 3 stars on GitHub - 1 maintainer
torchsubband 0.0.9 💰
This package is written for subband operations.
9 versions - Latest release: about 3 years ago - 1 dependent repositories - 50 downloads last month - 92 stars on GitHub - 1 maintainer
ten-vad 1.0.6
Voice Activity Detection (VAD) : low-latency, high-performance and lightweight
13 versions - Latest release: about 1 month ago - 1.39 thousand downloads last month - 1,415 stars on GitHub - 1 maintainer
wavencoder 0.1.3
WavEncoder - PyTorch backed audio encoder
4 versions - Latest release: over 4 years ago - 6 dependent repositories - 86 downloads last month - 91 stars on GitHub - 1 maintainer
phonet 0.3.7 💰
Compute phonological posteriors from speech signals using a deep learning scheme
3 versions - Latest release: over 3 years ago - 5 dependent repositories - 385 downloads last month - 44 stars on GitHub - 1 maintainer
yet-another-retnet 0.5.1
yet-another-retnet
13 versions - Latest release: almost 2 years ago - 86 downloads last month - 3,074 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
voicefixer 0.1.3 💰
This package is written for the restoration of degraded speech
22 versions - Latest release: almost 2 years ago - 6 dependent repositories - 1.93 thousand downloads last month - 1,209 stars on GitHub - 1 maintainer
nsnet2-denoiser 0.2.3
NSNet2 Deep Noise Suppression (DNS) package
3 versions - Latest release: about 3 years ago - 1 dependent package - 103 downloads last month - 36 stars on GitHub - 3 maintainers
silero-vad 6.0.0
Voice Activity Detector (VAD) by Silero
19 versions - Latest release: about 1 month ago - 296 thousand downloads last month - 4,266 stars on GitHub - 2 maintainers
torchmm 0.0.2.alpha
PyTorch DataLoader and Abstraction for multi-modal data.
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 18 downloads last month - 0 stars on GitHub - 1 maintainer
pytorch-speech-features 0.0.3
PyTorch Speech Feature extraction
3 versions - Latest release: over 2 years ago - 13 downloads last month - 1 stars on GitHub - 1 maintainer
dhvagna-npi 0.1.3
Advanced voice transcription tool with multi-language support
2 versions - Latest release: 5 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
simpleder 0.0.3
A lightweight library to compute Diarization Error Rate (DER).
3 versions - Latest release: almost 5 years ago - 3 dependent repositories - 403 downloads last month - 60 stars on GitHub - 1 maintainer
vistec-ser 0.4.6a3
Speech Emotion Recognition models and training using PyTorch
31 versions - Latest release: almost 4 years ago - 1 dependent repositories - 31 downloads last month - 2 stars on GitHub - 1 maintainer
blabla 0.2.2
Novoic linguistics feature extraction package.
4 versions - Latest release: about 5 years ago - 21 downloads last month - 32 stars on GitHub - 1 maintainer
silero-vad-lite 0.2.1 💰
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package depende...
3 versions - Latest release: about 1 year ago - 830 downloads last month - 15 stars on GitHub - 1 maintainer
odin-ai 1.2.5
Deep learning for research and production
6 versions - Latest release: over 5 years ago - 1 dependent repositories - 12 downloads last month - 22 stars on GitHub - 1 maintainer
signal-transformation 2.5.0
The package allows performing a transformation of a signal using TensorFlow, Pytorch or LibROSA
61 versions - Latest release: over 2 years ago - 2 dependent repositories - 87 downloads last month - 1 stars on GitHub - 1 maintainer
audioinfo 0.2.1
Count audio files in a directory.
9 versions - Latest release: over 2 years ago - 93 downloads last month - 14 stars on GitHub - 1 maintainer
everyvoice 0.3.0
Text-to-Speech Synthesis for the Speech Generation for Indigenous Language Education Small Teams ...
9 versions - Latest release: 4 months ago - 131 downloads last month - 38 stars on GitHub - 3 maintainers
ttslearn 0.2.2 💰
ttslearn: Text-to-speech with Python
4 versions - Latest release: over 3 years ago - 2 dependent repositories - 180 downloads last month - 261 stars on GitHub - 1 maintainer
phonemeser 1.0.3
Predict speech emotions from wav files.
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 12 downloads last month - 4 stars on GitHub - 1 maintainer
slg-nimrod 0.0.11
minimal deep learning framework
11 versions - Latest release: about 1 year ago - 67 downloads last month - 2 stars on GitHub - 1 maintainer
swift-f0 0.1.2
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks
3 versions - Latest release: 2 months ago - 86 downloads last month - 38 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
surfboard 0.2.0
Novoic's audio feature extraction library https://novoic.com
3 versions - Latest release: about 5 years ago - 4 dependent repositories - 93 downloads last month - 437 stars on GitHub - 1 maintainer
mexca 1.0.4
Emotion expression capture from multiple modalities.
11 versions - Latest release: over 1 year ago - 60 downloads last month - 36 stars on GitHub - 1 maintainer
pyssp 0.1.9
python speech signal processing library for education
25 versions - Latest release: over 8 years ago - 10 dependent repositories - 28 downloads last month - 18 stars on GitHub - 1 maintainer
npvcc2016 3.0.0
npvcc2016: Python loader of npVCC2016 speech corpus
17 versions - Latest release: almost 5 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on GitHub - 1 maintainer
resemble-enhance 0.0.1
Speech denoising and enhancement with deep learning
5 versions - Latest release: almost 2 years ago - 6.76 thousand downloads last month - 1,941 stars on GitHub - 1 maintainer
deepvoice3_pytorch 0.1.0 💰
PyTorch implementation of convolutional networks-based text-to-speech synthesis models.
6 versions - Latest release: almost 7 years ago - 2 dependent repositories - 163 downloads last month - 1,980 stars on GitHub - 1 maintainer
vbdiar 0.0.1
VB Diarization with Eigenvoice and HMM Priors
1 version - Latest release: over 6 years ago - 1 dependent repositories - 5 downloads last month - 15 stars on GitHub - 1 maintainer
sfeatpy 0.0.1
Functions to extract signal speech parameters
1 version - Latest release: over 4 years ago - 1 dependent repositories - 6 downloads last month - 2 stars on GitHub - 1 maintainer
myvocrec 1
Capturing Microphone Data into an Audio File
1 version - Latest release: over 6 years ago - 1 dependent repositories - 14 downloads last month - 3 stars on GitHub - 1 maintainer
senko 0.1.0rc1 💰
Very fast speaker diarization
1 version - Latest release: about 1 month ago - 93 stars on GitHub
silero-vad-fork 0.1.0
A packaged version of the Silero VAD model
1 version - Latest release: about 2 years ago - 67 downloads last month - 4,206 stars on GitHub - 1 maintainer
Top 4.1% on pypi.org
whisper-timestamped 1.15.9
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word time...
16 versions - Latest release: 23 days ago - 8 dependent packages - 4 dependent repositories - 84.1 thousand downloads last month - 2,581 stars on GitHub - 3 maintainers
voicelab 2.0.0
Automated Reproducible Voice Analysis
7 versions - Latest release: over 2 years ago - 1 dependent repositories - 57 downloads last month - 150 stars on GitHub - 1 maintainer
retentive-network 0.1.0
Unofficial codebase for the "Retentive Network: A Successor to Transformer for Large Language Mod...
2 versions - Latest release: about 2 years ago - 15 downloads last month - 3,074 stars on GitHub - 1 maintainer
Related Keywords
python 21 speech 20 machine-learning 17 pytorch 14 speech-recognition 12 audio 10 deep-learning 10 audio-processing 9 speaker-diarization 9 voice-recognition 9 natural-language-processing 9 speech-synthesis 8 speech-to-text 7 speech-enhancement 7 tts 6 computer-vision 6 voice-activity-detection 6 python3 6 multimodal 5 speaker-verification 5 speaker-recognition 5 signal-processing 5 language-model 5 asr 5 pretrained-language-model 4 transformer 4 translation 4 voice 4 speech-analysis 4 vad 4 voice-commands 4 transformers 4 speechrecognition 4 text-to-speech 4 digital-signal-processing 3 feature-extraction 3 tensorflow 3 dsp 3 voice-control 3 huggingface 3 speech-separation 3 speech-toolkit 3 onnx 3 spoken-language-understanding 3 scale 2 semi-supervised-learning 2 python-wrapper 2 voice-detection 2 SPTK 2 silero-vad 2 healthcare 2 optimisation 2 neural-network 2 language-detection 2 Natural Language Processing and Understanding 2 speech signal processing 2 nlp 2 emotion-recognition 2 TTS 2 phonemes 2 linguistics 2 any 2 audio-analysis 2 at 2 phonetics 2 corpus 2 Transformers 2 denoise 2 pyannote 2 attention-mechanism 2 mfcc 2 parkinsons-disease 2 onnxruntime 2 deep-neural-networks 2 text-processing 2 alzheimers-disease 2 sptk 2 onnx-runtime 2 multimodal-deep-learning 1 image-processing 1 graph-algorithms 1 generative-model 1 bayesian-methods 1 probabilistic-graphical-models 1 probabilistic-programming 1 variational-autoencoder 1 variational-autoencoders 1 disentangled-representations 1 disentanglement-learning 1 factor-vae 1 recognition 1 transcription 1 api 1 dhvagna-npi 1 github 1 medium-article 1 diarization 1 metrics 1 speech-emotion-recognition 1 language 1