Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "natural-language-processing" keyword

r-udpipe 0.8.10
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based o...
4 versions - Latest release: over 1 year ago - 202 stars on GitHub
python-flair 0.11.3
A very simple framework for state-of-the-art Natural Language Processing (NLP)
10 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 12,581 stars on GitHub
small-text 1.1.1
Active Learning for Text Classification in Python
4 versions - Latest release: over 1 year ago - 426 stars on GitHub
thinc 8.1.5
A refreshing functional take on deep learning, compatible with your favorite libraries
45 versions - Latest release: over 1 year ago - 2 dependent packages - 63 dependent repositories - 2,684 stars on GitHub
d2l 0.17.6
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 400 u...
2 versions - Latest release: over 1 year ago - 16,875 stars on GitHub
nlpaug 1.1.11 💰
This python library helps you with augmenting NLP for your machine learning projects. `Augmenter...
7 versions - Latest release: almost 2 years ago - 3,846 stars on GitHub
flaml 1.0.14
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
31 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 2,337 stars on GitHub
textacy 0.11.0
textacy is a Python library for performing higher-level natural language processing (NLP) tasks, ...
11 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 2,042 stars on GitHub
pattern 3.6.0
Web mining module for Python, with tools for scraping, natural language processing, machine learn...
1 version - Latest release: over 4 years ago - 1 dependent package - 1 dependent repositories - 8,429 stars on GitHub
Top 4.1% on conda-forge.org
datasets 2.7.0
Datasets is a lightweight library providing one-line dataloaders for many public datasets and one...
34 versions - Latest release: over 1 year ago - 13 dependent packages - 29 dependent repositories - 15,569 stars on GitHub
spacy-transformers 1.1.8
This package provides spaCy components and architectures to use transformer models via Hugging Fa...
7 versions - Latest release: over 1 year ago - 8 dependent packages - 2 dependent repositories - 1,219 stars on GitHub
usaddress 0.5.10
:us: a python library for parsing unstructured United States address strings into address components
2 versions - Latest release: over 1 year ago - 1,410 stars on GitHub
nsbm 0.5.1
Package to run n-partite Stocastich Block Modeling.
7 versions - Latest release: about 2 years ago - 1 stars on GitHub
konoha 5.3.0 💰
Konoha is a Python library for providing easy-to-use integrated interface of various Japanese tok...
10 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 190 stars on GitHub
Top 6.6% on conda-forge.org
tokenizers 0.13.1
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
16 versions - Latest release: over 1 year ago - 6 dependent packages - 35 dependent repositories - 6,601 stars on GitHub
Top 1.8% on conda-forge.org
nltk 3.6.7
NLTK Source
15 versions - Latest release: over 2 years ago - 43 dependent packages - 717 dependent repositories - 11,675 stars on GitHub
knockknock 0.1.7
🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 2,570 stars on GitHub
Top 1.6% on conda-forge.org
transformers 4.24.0
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
68 versions - Latest release: over 1 year ago - 24 dependent packages - 101 dependent repositories - 86,720 stars on GitHub
seqeval 1.2.2 💰
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
5 versions - Latest release: over 3 years ago - 5 dependent repositories - 906 stars on GitHub
allennlp 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
16 versions - Latest release: almost 2 years ago - 6 dependent packages - 11,429 stars on GitHub
allennlp-checklist 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
4 versions - Latest release: almost 2 years ago - 1 dependent package - 11,429 stars on GitHub
allennlp-all 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
4 versions - Latest release: almost 2 years ago - 1 dependent package - 11,429 stars on GitHub
Top 5.2% on conda-forge.org
sentencepiece 0.1.96
SentencePiece is an unsupervised text tokenizer and detokenizer mainly for Neural Network-based t...
4 versions - Latest release: about 2 years ago - 14 dependent packages - 25 dependent repositories - 6,853 stars on GitHub
abydos 0.5.0
Abydos includes phonetic algorithms, such as Soundex, (Double) Metaphone, & NYSIIS, string distan...
7 versions - Latest release: over 4 years ago - 154 stars on GitHub
bert_score 0.3.9
BERT score for text generation
5 versions - Latest release: about 3 years ago - 1 dependent package - 1,064 stars on GitHub
r-tidytext 0.3.4
Text mining using tidy tools :sparkles::page_facing_up::sparkles:
14 versions - Latest release: over 1 year ago - 5 dependent packages - 1 dependent repositories - 1,114 stars on GitHub
r-doc2vec 0.2.0
Distributed Representations of Sentences and Documents
1 version - Latest release: over 2 years ago - 35 stars on GitHub
hunspell 1.7.0
Hunspell is the spell checker of LibreOffice, OpenOffice.org, Mozilla Firefox 3 & Thunderbird, Go...
3 versions - Latest release: over 5 years ago - 6 dependent packages - 1 dependent repositories - 1,730 stars on GitHub
r-word2vec 0.3.4
Distributed Representations of Words using word2vec
1 version - Latest release: over 2 years ago - 55 stars on GitHub
lingua-language-detector 1.1.3
The most accurate natural language detection library for Python, suitable for long and short text...
3 versions - Latest release: over 1 year ago - 452 stars on GitHub
chemdataextractor 1.3.0
Automatically extract chemical information from scientific documents
1 version - Latest release: almost 6 years ago - 1 dependent package - 1 dependent repositories - 239 stars on GitHub
doccano 1.6.2 💰
Open source annotation tool for machine learning practitioners.
7 versions - Latest release: about 2 years ago - 7,490 stars on GitHub
ecco 0.1.2
Ecco is a python library for explaining Natural Language Processing models using interactive visu...
3 versions - Latest release: over 2 years ago - 1,629 stars on GitHub
scattertext 0.1.9
A tool for finding distinguishing terms in small-to-medium-sized corpora, and presenting them in ...
17 versions - Latest release: over 1 year ago - 1 dependent repositories - 2,057 stars on GitHub
lit-nlp 13.0.0
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior ...
6 versions - Latest release: over 2 years ago - 1 dependent repositories - 3,085 stars on GitHub
textattack 0.3.0
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model trainin...
4 versions - Latest release: almost 3 years ago - 2,273 stars on GitHub
bert-tensorflow 1.0.4
TensorFlow code and pre-trained models for BERT
3 versions - Latest release: over 3 years ago - 33,506 stars on GitHub
spacy-model-en_core_web_lg 3.4.0
Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
11 versions - Latest release: almost 2 years ago - 5 dependent repositories - 1,269 stars on GitHub
spacy-model-en_core_web_md 3.4.0
Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
11 versions - Latest release: almost 2 years ago - 3 dependent repositories - 1,269 stars on GitHub
spacy-model-en_core_web_trf 3.4.0
Components: transformer, tagger, parser, ner, attribute_ruler, lemmatizer.
3 versions - Latest release: almost 2 years ago - 1 dependent repositories - 1,269 stars on GitHub
spacy-model-en_vectors_web_lg 2.1.0
💫 Models for the spaCy Natural Language Processing (NLP) library
2 versions - Latest release: about 5 years ago - 1,269 stars on GitHub
spacy-loggers 1.0.3
Enables alternatives to spaCy's built-in console logger
4 versions - Latest release: over 1 year ago - 2 dependent packages - 12 dependent repositories - 10 stars on GitHub
sense2vec 2.0.0
🦆 Contextually-keyed word vectors
3 versions - Latest release: almost 3 years ago - 1,480 stars on GitHub
spacy-lookups-data 1.0.0
This package contains additional data files to be used with spaCy v2.2+. When it's installed in t...
4 versions - Latest release: about 3 years ago - 1 dependent repositories - 79 stars on GitHub
skweak 0.3.3
skweak: A software toolkit for weak supervision applied to NLP tasks
4 versions - Latest release: over 1 year ago - 870 stars on GitHub
spacy-model-en_core_web_sm 3.4.0
Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
11 versions - Latest release: almost 2 years ago - 15 dependent repositories - 1,269 stars on GitHub
quantulum3 0.7.11 💰
Python library for information extraction of quantities, measurements and their units from unstru...
1 version - Latest release: over 1 year ago - 105 stars on GitHub
geotext 0.4.0
Geotext extracts country and city mentions from text
2 versions - Latest release: over 5 years ago - 119 stars on GitHub
Top 6.9% on conda-forge.org
textblob 0.15.3
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extrac...
7 versions - Latest release: about 5 years ago - 4 dependent packages - 24 dependent repositories - 8,491 stars on GitHub
stanza 1.4.2
Official Stanford NLP Python Library for Many Human Languages
7 versions - Latest release: over 1 year ago - 1 dependent package - 6 dependent repositories - 6,537 stars on GitHub
conllu 4.5.2
CoNLL-U Parser parses a CoNLL-U formatted string into a nested python dictionary
38 versions - Latest release: over 1 year ago - 6 dependent packages - 290 stars on GitHub
Top 1.6% on conda-forge.org
spacy 3.4.3
spaCy is a library for advanced natural language processing in Python and Cython.
68 versions - Latest release: over 1 year ago - 92 dependent packages - 174 dependent repositories - 25,557 stars on GitHub
youtokentome 1.0.6
Unsupervised text tokenizer focused on computational efficiency
4 versions - Latest release: over 2 years ago - 859 stars on GitHub
marqo 0.5.6
Tensor search for humans.
13 versions - Latest release: over 1 year ago - 2,455 stars on GitHub
Top 9.6% on conda-forge.org
huggingface_hub 0.11.0
The `huggingface_hub` is a client library to interact with the Hugging Face Hub. The Hugging Face...
33 versions - Latest release: over 1 year ago - 16 dependent packages - 31 dependent repositories - 755 stars on GitHub
lightning-bolts 0.6.0
Toolbox of models, callbacks, and datasets for AI/ML researchers.
4 versions - Latest release: over 1 year ago - 3 dependent repositories - 1,508 stars on GitHub
nerval 1.0.9
Python framework to evaluate Named Entity Recognition (NER) models. Creates entity-level confusio...
1 version - Latest release: about 2 years ago - 4 stars on GitHub
rubrix 0.18.0
Rubrix is a **production-ready Python framework for exploring, annotating, and managing data** in...
21 versions - Latest release: over 1 year ago - 1 dependent package - 1,710 stars on GitHub
Top 2.7% on conda-forge.org
gensim 4.2.0 💰
Gensim is a Python library for topic modelling, document indexing and similarity retrieval with l...
18 versions - Latest release: almost 2 years ago - 17 dependent packages - 105 dependent repositories - 14,085 stars on GitHub
pykakasi 2.2.1 💰
`pykakasi` is a Python Natural Language Processing (NLP) library to transliterate _hiragana_, _ka...
7 versions - Latest release: over 2 years ago - 371 stars on GitHub
hazm 0.7.0
Python library for digesting Persian text.
1 version - Latest release: almost 5 years ago - 835 stars on GitHub
rubrix-server 0.18.0
Rubrix is a **production-ready Python framework for exploring, annotating, and managing data** in...
15 versions - Latest release: over 1 year ago - 1,710 stars on GitHub
libpostal 1.1.0
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP...
4 versions - Latest release: about 4 years ago - 1 dependent package - 3,669 stars on GitHub
stripnet 0.0.7
Leverage the power of NLP Topic Modeling, Semantic Similarity and Network analysis to study the t...
2 versions - Latest release: about 2 years ago - 82 stars on GitHub
auto-labeling-pipeline 0.1.21
doccano auto labeling pipeline helps doccano to annotate a document automatically.
1 version - Latest release: almost 3 years ago - 33 stars on GitHub
lexicalrichness 0.3.0
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 50 stars on GitHub
bpemb 0.3.3
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
3 versions - Latest release: almost 3 years ago - 1 dependent package - 1 dependent repositories - 1,112 stars on GitHub
textract 1.6.5
extract text from any document. no muss. no fuss.
5 versions - Latest release: about 2 years ago - 1 dependent repositories - 3,425 stars on GitHub
transformers-interpret 0.8.1
Transformers Interpret is a model explainability tool designed to work exclusively with the :hugs...
1 version - Latest release: over 1 year ago - 981 stars on GitHub
Related Keywords
nlp 45 machine-learning 41 python 34 deep-learning 17 spacy 15 pytorch 14 data-science 13 hacktoberfest 9 artificial-intelligence 7 models 6 spacy-models 5 named-entity-recognition 5 word2vec 5 text-mining 5 statistical-models 5 machine-learning-models 5 tensorflow 5 ai 4 transformers 4 text-classification 4 topic-modeling 4 annotation-tool 4 computer-vision 4 text-annotation 4 weak-supervision 3 natural-language-understanding 3 language-model 3 bert 3 information-retrieval 3 jax 3 embeddings 3 active-learning 3 word-embeddings 3 sequence-labeling 3 nlp-library 3 data-mining 3 r-package 3 visualization 3 neural-network 3 datasets 2 dataops 2 gensim 2 network-analysis 2 developer-tools 2 google 2 word-segmentation 2 sequence-labeling-evaluation 2 address 2 address-parser 2 python-library 2 pretrained-models 2 japanese 2 model-hub 2 language-models 2 tokenization 2 gpt 2 nltk 2 neural-networks 2 mxnet 2 adversarial-attacks 2 data-labeling 2 information-extraction 2 r 2 lemmatization 2 hyperparameter-optimization 2 conll 2 weakly-supervised-learning 2 text-labeling 2 mlops 2 knowledge-graph 2 human-in-the-loop 2 python-3 1 logging 1 corenlp 1 semiotic-squares 1 scatter-plot 1 japanese-language 1 exploratory-data-analysis 1 eda 1 universal-dependencies 1 d3 1 dependency-parser 1 conll-u 1 fasttext 1 security 1 data-augmentation 1 gensim-word2vec 1 sense2vec 1 adversarial-machine-learning 1 adversarial-examples 1 distant-supervision 1 nlp-machine-learning 1 text-visualization 1 training-data 1 text-as-data 1 stylometry 1 quantities 1 stylometric 1 units-of-measure 1 sentiment 1