pypi.org "speech-processing" keyword
View the packages on the pypi.org package registry that are tagged with the "speech-processing" keyword.
whisperer-ml 0.1.7
Go from raw audio to a text-audio dataset with OpenAI's Whisper7 versions - Latest release: over 2 years ago - 1 dependent repositories - 28 downloads last month - 9,783 stars on GitHub - 1 maintainer
vak 1.0.4
A neural network framework for researchers studying acoustic communication47 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 694 downloads last month - 67 stars on GitHub - 1 maintainer
Top 2.9% on pypi.org
5 versions - Latest release: almost 2 years ago - 8 dependent packages - 15 dependent repositories - 125 thousand downloads last month - 3,117 stars on GitHub - 1 maintainer
torchscale 0.3.0
Transformers at any scale5 versions - Latest release: almost 2 years ago - 8 dependent packages - 15 dependent repositories - 125 thousand downloads last month - 3,117 stars on GitHub - 1 maintainer
torchscale-gml 0.2.3
Transformers at any scale4 versions - Latest release: 12 months ago - 1 dependent package - 501 downloads last month - 3,117 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
17 versions - Latest release: 6 months ago - 32 dependent packages - 102 dependent repositories - 1.61 million downloads last month - 7,821 stars on GitHub - 2 maintainers
speechbrain 1.0.3
All-in-one speech toolkit in pure Python and Pytorch17 versions - Latest release: 6 months ago - 32 dependent packages - 102 dependent repositories - 1.61 million downloads last month - 7,821 stars on GitHub - 2 maintainers
Top 6.2% on pypi.org
8 versions - Latest release: over 1 year ago - 1 dependent package - 9 dependent repositories - 12.5 thousand downloads last month - 477 stars on GitHub - 1 maintainer
spafe 0.3.3 💰
Simplified Python Audio-Features Extraction.8 versions - Latest release: over 1 year ago - 1 dependent package - 9 dependent repositories - 12.5 thousand downloads last month - 477 stars on GitHub - 1 maintainer
pyannote-audio 4.0.0 💰
State-of-the-art speaker diarization toolkit16 versions - Latest release: 3 days ago - 840 thousand downloads last month - 8,374 stars on GitHub - 1 maintainer
stark-engine 4.1.0
S.T.A.R.K - Speech and Text Algorithmic Recognition Kit. Modern framework for creating powerfull ...16 versions - Latest release: 10 days ago - 635 downloads last month - 65 stars on GitHub - 1 maintainer
bournemouth-forced-aligner 0.1.6
Bournemouth Forced Aligner - Phoneme-level timestamp extraction7 versions - Latest release: 2 days ago - 570 downloads last month - 73 stars on GitHub - 1 maintainer
scoreq 1.0.1
A Python package for advanced speech quality assessment using the SCOREQ model3 versions - Latest release: 3 months ago - 174 downloads last month - 88 stars on GitHub - 1 maintainer
pysptk-speechify 0.2.0.1 💰
A python wrapper for Speech Signal Processing Toolkit (SPTK)1 version - Latest release: almost 3 years ago - 7 downloads last month - 446 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
27 versions - Latest release: over 1 year ago - 6 dependent packages - 118 dependent repositories - 21.2 thousand downloads last month - 446 stars on GitHub - 1 maintainer
pysptk 1.0.1 💰
A python wrapper for Speech Signal Processing Toolkit (SPTK)27 versions - Latest release: over 1 year ago - 6 dependent packages - 118 dependent repositories - 21.2 thousand downloads last month - 446 stars on GitHub - 1 maintainer
polyglotdb 1.3.4
PolyglotDB is a package for phonetic corpus storage and analysis41 versions - Latest release: 3 months ago - 2 dependent repositories - 269 downloads last month - 47 stars on GitHub - 2 maintainers
speechbrain-geoph9 0.5.12a0
All-in-one speech toolkit in pure Python and Pytorch1 version - Latest release: about 3 years ago - 1 dependent repositories - 215 downloads last month - 9,783 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
2 versions - Latest release: over 5 years ago - 1 dependent package - 6 dependent repositories - 27.1 thousand downloads last month - 278 stars on GitHub - 1 maintainer
pb-bss-eval 0.0.2
EM algorithms for integrated spatial and spectral models.2 versions - Latest release: over 5 years ago - 1 dependent package - 6 dependent repositories - 27.1 thousand downloads last month - 278 stars on GitHub - 1 maintainer
gryannote 0.3.3
Provide Gradio custom components to make the diarization-based audio annotation process easier12 versions - Latest release: 11 months ago - 572 downloads last month - 65 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
2 versions - Latest release: over 5 years ago - 2 dependent packages - 31 dependent repositories - 1.83 thousand downloads last month - 72 stars on GitHub - 1 maintainer
fastwer 0.1.3
A PyPI package for fast word/character error rate (WER/CER) calculation2 versions - Latest release: over 5 years ago - 2 dependent packages - 31 dependent repositories - 1.83 thousand downloads last month - 72 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
27 versions - Latest release: over 1 year ago - 1 dependent package - 49 dependent repositories - 6.99 thousand downloads last month - 398 stars on GitHub - 1 maintainer
nnmnkwii 0.1.3 💰
Library to build speech synthesis systems for fast prototyping27 versions - Latest release: over 1 year ago - 1 dependent package - 49 dependent repositories - 6.99 thousand downloads last month - 398 stars on GitHub - 1 maintainer
indic-num2words 1.3.2
Package to convert numbers to words with support of multiple indian languages.8 versions - Latest release: 4 months ago - 1 dependent package - 1 dependent repositories - 5.1 thousand downloads last month - 36 stars on GitHub - 1 maintainer
myspokenlanguagedetection 5
Spoken language identification with CNN and RNN - Improved Version: accuracy up3 versions - Latest release: over 6 years ago - 1 dependent repositories - 21 downloads last month - 3 stars on GitHub - 1 maintainer
torchsubband 0.0.9 💰
This package is written for subband operations.9 versions - Latest release: about 3 years ago - 1 dependent repositories - 50 downloads last month - 92 stars on GitHub - 1 maintainer
ten-vad 1.0.6
Voice Activity Detection (VAD) : low-latency, high-performance and lightweight13 versions - Latest release: about 1 month ago - 1.39 thousand downloads last month - 1,415 stars on GitHub - 1 maintainer
wavencoder 0.1.3
WavEncoder - PyTorch backed audio encoder4 versions - Latest release: over 4 years ago - 6 dependent repositories - 86 downloads last month - 91 stars on GitHub - 1 maintainer
phonet 0.3.7 💰
Compute phonological posteriors from speech signals using a deep learning scheme3 versions - Latest release: over 3 years ago - 5 dependent repositories - 385 downloads last month - 44 stars on GitHub - 1 maintainer
yet-another-retnet 0.5.1
yet-another-retnet13 versions - Latest release: almost 2 years ago - 86 downloads last month - 3,074 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
22 versions - Latest release: almost 2 years ago - 6 dependent repositories - 1.93 thousand downloads last month - 1,209 stars on GitHub - 1 maintainer
voicefixer 0.1.3 💰
This package is written for the restoration of degraded speech22 versions - Latest release: almost 2 years ago - 6 dependent repositories - 1.93 thousand downloads last month - 1,209 stars on GitHub - 1 maintainer
nsnet2-denoiser 0.2.3
NSNet2 Deep Noise Suppression (DNS) package3 versions - Latest release: about 3 years ago - 1 dependent package - 103 downloads last month - 36 stars on GitHub - 3 maintainers
silero-vad 6.0.0
Voice Activity Detector (VAD) by Silero19 versions - Latest release: about 1 month ago - 296 thousand downloads last month - 4,266 stars on GitHub - 2 maintainers
torchmm 0.0.2.alpha
PyTorch DataLoader and Abstraction for multi-modal data.2 versions - Latest release: about 5 years ago - 1 dependent repositories - 18 downloads last month - 0 stars on GitHub - 1 maintainer
pytorch-speech-features 0.0.3
PyTorch Speech Feature extraction3 versions - Latest release: over 2 years ago - 13 downloads last month - 1 stars on GitHub - 1 maintainer
dhvagna-npi 0.1.3
Advanced voice transcription tool with multi-language support2 versions - Latest release: 5 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
simpleder 0.0.3
A lightweight library to compute Diarization Error Rate (DER).3 versions - Latest release: almost 5 years ago - 3 dependent repositories - 403 downloads last month - 60 stars on GitHub - 1 maintainer
vistec-ser 0.4.6a3
Speech Emotion Recognition models and training using PyTorch31 versions - Latest release: almost 4 years ago - 1 dependent repositories - 31 downloads last month - 2 stars on GitHub - 1 maintainer
blabla 0.2.2
Novoic linguistics feature extraction package.4 versions - Latest release: about 5 years ago - 21 downloads last month - 32 stars on GitHub - 1 maintainer
silero-vad-lite 0.2.1 💰
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package depende...3 versions - Latest release: about 1 year ago - 830 downloads last month - 15 stars on GitHub - 1 maintainer
odin-ai 1.2.5
Deep learning for research and production6 versions - Latest release: over 5 years ago - 1 dependent repositories - 12 downloads last month - 22 stars on GitHub - 1 maintainer
signal-transformation 2.5.0
The package allows performing a transformation of a signal using TensorFlow, Pytorch or LibROSA61 versions - Latest release: over 2 years ago - 2 dependent repositories - 87 downloads last month - 1 stars on GitHub - 1 maintainer
audioinfo 0.2.1
Count audio files in a directory.9 versions - Latest release: over 2 years ago - 93 downloads last month - 14 stars on GitHub - 1 maintainer
everyvoice 0.3.0
Text-to-Speech Synthesis for the Speech Generation for Indigenous Language Education Small Teams ...9 versions - Latest release: 4 months ago - 131 downloads last month - 38 stars on GitHub - 3 maintainers
ttslearn 0.2.2 💰
ttslearn: Text-to-speech with Python4 versions - Latest release: over 3 years ago - 2 dependent repositories - 180 downloads last month - 261 stars on GitHub - 1 maintainer
phonemeser 1.0.3
Predict speech emotions from wav files.2 versions - Latest release: over 3 years ago - 1 dependent repositories - 12 downloads last month - 4 stars on GitHub - 1 maintainer
slg-nimrod 0.0.11
minimal deep learning framework11 versions - Latest release: about 1 year ago - 67 downloads last month - 2 stars on GitHub - 1 maintainer
swift-f0 0.1.2
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks3 versions - Latest release: 2 months ago - 86 downloads last month - 38 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
3 versions - Latest release: about 5 years ago - 4 dependent repositories - 93 downloads last month - 437 stars on GitHub - 1 maintainer
surfboard 0.2.0
Novoic's audio feature extraction library https://novoic.com3 versions - Latest release: about 5 years ago - 4 dependent repositories - 93 downloads last month - 437 stars on GitHub - 1 maintainer
mexca 1.0.4
Emotion expression capture from multiple modalities.11 versions - Latest release: over 1 year ago - 60 downloads last month - 36 stars on GitHub - 1 maintainer
pyssp 0.1.9
python speech signal processing library for education25 versions - Latest release: over 8 years ago - 10 dependent repositories - 28 downloads last month - 18 stars on GitHub - 1 maintainer
npvcc2016 3.0.0
npvcc2016: Python loader of npVCC2016 speech corpus17 versions - Latest release: almost 5 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on GitHub - 1 maintainer
resemble-enhance 0.0.1
Speech denoising and enhancement with deep learning5 versions - Latest release: almost 2 years ago - 6.76 thousand downloads last month - 1,941 stars on GitHub - 1 maintainer
deepvoice3_pytorch 0.1.0 💰
PyTorch implementation of convolutional networks-based text-to-speech synthesis models.6 versions - Latest release: almost 7 years ago - 2 dependent repositories - 163 downloads last month - 1,980 stars on GitHub - 1 maintainer
vbdiar 0.0.1
VB Diarization with Eigenvoice and HMM Priors1 version - Latest release: over 6 years ago - 1 dependent repositories - 5 downloads last month - 15 stars on GitHub - 1 maintainer
sfeatpy 0.0.1
Functions to extract signal speech parameters1 version - Latest release: over 4 years ago - 1 dependent repositories - 6 downloads last month - 2 stars on GitHub - 1 maintainer
myvocrec 1
Capturing Microphone Data into an Audio File1 version - Latest release: over 6 years ago - 1 dependent repositories - 14 downloads last month - 3 stars on GitHub - 1 maintainer
senko 0.1.0rc1 💰
Very fast speaker diarization1 version - Latest release: about 1 month ago - 93 stars on GitHub
silero-vad-fork 0.1.0
A packaged version of the Silero VAD model1 version - Latest release: about 2 years ago - 67 downloads last month - 4,206 stars on GitHub - 1 maintainer
Top 4.1% on pypi.org
16 versions - Latest release: 23 days ago - 8 dependent packages - 4 dependent repositories - 84.1 thousand downloads last month - 2,581 stars on GitHub - 3 maintainers
whisper-timestamped 1.15.9
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word time...16 versions - Latest release: 23 days ago - 8 dependent packages - 4 dependent repositories - 84.1 thousand downloads last month - 2,581 stars on GitHub - 3 maintainers
voicelab 2.0.0
Automated Reproducible Voice Analysis7 versions - Latest release: over 2 years ago - 1 dependent repositories - 57 downloads last month - 150 stars on GitHub - 1 maintainer
retentive-network 0.1.0
Unofficial codebase for the "Retentive Network: A Successor to Transformer for Large Language Mod...2 versions - Latest release: about 2 years ago - 15 downloads last month - 3,074 stars on GitHub - 1 maintainer
Related Keywords
python
21
speech
20
machine-learning
17
pytorch
14
speech-recognition
12
audio
10
deep-learning
10
audio-processing
9
speaker-diarization
9
voice-recognition
9
natural-language-processing
9
speech-synthesis
8
speech-to-text
7
speech-enhancement
7
tts
6
computer-vision
6
voice-activity-detection
6
python3
6
multimodal
5
speaker-verification
5
speaker-recognition
5
signal-processing
5
language-model
5
asr
5
pretrained-language-model
4
transformer
4
translation
4
voice
4
speech-analysis
4
vad
4
voice-commands
4
transformers
4
speechrecognition
4
text-to-speech
4
digital-signal-processing
3
feature-extraction
3
tensorflow
3
dsp
3
voice-control
3
huggingface
3
speech-separation
3
speech-toolkit
3
onnx
3
spoken-language-understanding
3
scale
2
semi-supervised-learning
2
python-wrapper
2
voice-detection
2
SPTK
2
silero-vad
2
healthcare
2
optimisation
2
neural-network
2
language-detection
2
Natural Language Processing and Understanding
2
speech signal processing
2
nlp
2
emotion-recognition
2
TTS
2
phonemes
2
linguistics
2
any
2
audio-analysis
2
at
2
phonetics
2
corpus
2
Transformers
2
denoise
2
pyannote
2
attention-mechanism
2
mfcc
2
parkinsons-disease
2
onnxruntime
2
deep-neural-networks
2
text-processing
2
alzheimers-disease
2
sptk
2
onnx-runtime
2
multimodal-deep-learning
1
image-processing
1
graph-algorithms
1
generative-model
1
bayesian-methods
1
probabilistic-graphical-models
1
probabilistic-programming
1
variational-autoencoder
1
variational-autoencoders
1
disentangled-representations
1
disentanglement-learning
1
factor-vae
1
recognition
1
transcription
1
api
1
dhvagna-npi
1
github
1
medium-article
1
diarization
1
metrics
1
speech-emotion-recognition
1
language
1