Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "nlp" keyword

python-flair 0.11.3
A very simple framework for state-of-the-art Natural Language Processing (NLP)
10 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 12,581 stars on GitHub
small-text 1.1.1
Active Learning for Text Classification in Python
4 versions - Latest release: over 1 year ago - 426 stars on GitHub
thinc 8.1.5
A refreshing functional take on deep learning, compatible with your favorite libraries
45 versions - Latest release: over 1 year ago - 2 dependent packages - 63 dependent repositories - 2,684 stars on GitHub
nlpaug 1.1.11 πŸ’°
This python library helps you with augmenting NLP for your machine learning projects. `Augmenter...
7 versions - Latest release: almost 2 years ago - 3,846 stars on GitHub
postal 1.1.9
The official Python bindings to libpostal, a fast statistical parser/normalizer for street addres...
3 versions - Latest release: about 2 years ago - 664 stars on GitHub
textacy 0.11.0
textacy is a Python library for performing higher-level natural language processing (NLP) tasks, ...
11 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 2,042 stars on GitHub
Top 4.1% on conda-forge.org
datasets 2.7.0
Datasets is a lightweight library providing one-line dataloaders for many public datasets and one...
34 versions - Latest release: over 1 year ago - 13 dependent packages - 29 dependent repositories - 15,569 stars on GitHub
spacy-transformers 1.1.8
This package provides spaCy components and architectures to use transformer models via Hugging Fa...
7 versions - Latest release: over 1 year ago - 8 dependent packages - 2 dependent repositories - 1,219 stars on GitHub
razdel 0.5.0
Rule-based token, sentence segmentation for Russian language
1 version - Latest release: over 2 years ago - 2 dependent packages - 213 stars on GitHub
slovnet 0.5.0
Deep Learning based NLP modeling for Russian language
1 version - Latest release: over 2 years ago - 1 dependent package - 178 stars on GitHub
yargy 0.15.0
Rule-based facts extraction for Russian language
1 version - Latest release: over 2 years ago - 1 dependent package - 290 stars on GitHub
navec 0.10.0
Compact high quality word embeddings for Russian language
1 version - Latest release: over 2 years ago - 2 dependent packages - 132 stars on GitHub
jamotools 0.1.10
A library for Korean Jamo split and vectorize.
1 version - Latest release: over 4 years ago - 22 stars on GitHub
usaddress 0.5.10
:us: a python library for parsing unstructured United States address strings into address components
2 versions - Latest release: over 1 year ago - 1,410 stars on GitHub
farm 0.8.0
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the ind...
2 versions - Latest release: almost 3 years ago - 1,645 stars on GitHub
konoha 5.3.0 πŸ’°
Konoha is a Python library for providing easy-to-use integrated interface of various Japanese tok...
10 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 190 stars on GitHub
Top 6.6% on conda-forge.org
tokenizers 0.13.1
πŸ’₯ Fast State-of-the-Art Tokenizers optimized for Research and Production
16 versions - Latest release: over 1 year ago - 6 dependent packages - 35 dependent repositories - 6,601 stars on GitHub
tika 1.23.1 πŸ’°
Tika-Python is a Python binding to the Apache Tikaβ„’ REST services allowing Tika to be called nati...
11 versions - Latest release: about 4 years ago - 2 dependent repositories - 1,253 stars on GitHub
Top 1.8% on conda-forge.org
nltk 3.6.7
NLTK Source
15 versions - Latest release: over 2 years ago - 43 dependent packages - 717 dependent repositories - 11,675 stars on GitHub
knockknock 0.1.7
πŸšͺ✊Knock Knock: Get notified when your training ends with only two additional lines of code
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 2,570 stars on GitHub
nncf 2.3.0
Neural Network Compression Framework for enhanced OpenVINOβ„’ inference
6 versions - Latest release: almost 2 years ago - 590 stars on GitHub
Top 1.6% on conda-forge.org
transformers 4.24.0
πŸ€— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
68 versions - Latest release: over 1 year ago - 24 dependent packages - 101 dependent repositories - 86,720 stars on GitHub
allennlp 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
16 versions - Latest release: almost 2 years ago - 6 dependent packages - 11,429 stars on GitHub
allennlp-checklist 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
4 versions - Latest release: almost 2 years ago - 1 dependent package - 11,429 stars on GitHub
allennlp-all 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
4 versions - Latest release: almost 2 years ago - 1 dependent package - 11,429 stars on GitHub
flashtext 2.7
Extract Keywords from sentence or Replace keywords in sentences.
1 version - Latest release: almost 4 years ago - 2 dependent packages - 2 dependent repositories - 5,373 stars on GitHub
abydos 0.5.0
Abydos includes phonetic algorithms, such as Soundex, (Double) Metaphone, & NYSIIS, string distan...
7 versions - Latest release: over 4 years ago - 154 stars on GitHub
pylexique 1.5.0 πŸ’°
Pylexique is a Python wrapper around Lexique383. It allows to extract lexical information from mo...
16 versions - Latest release: over 2 years ago - 3 stars on GitHub
textnets 0.8.5
textnets represents collections of texts as networks of documents and words. This provides novel ...
15 versions - Latest release: over 1 year ago - 251 stars on GitHub
sumy 0.11.0 πŸ’°
Module for automatic summarization of text documents and HTML pages.
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 3,090 stars on GitHub
neuralcoref 4.0
✨Fast Coreference Resolution in spaCy with Neural Networks
1 version - Latest release: about 4 years ago - 1 dependent repositories - 2,659 stars on GitHub
lingua-language-detector 1.1.3
The most accurate natural language detection library for Python, suitable for long and short text...
3 versions - Latest release: over 1 year ago - 452 stars on GitHub
numerizer 0.2.1
A Python module to convert natural language numerics into ints and floats.
1 version - Latest release: over 1 year ago - 201 stars on GitHub
chemdataextractor 1.3.0
Automatically extract chemical information from scientific documents
1 version - Latest release: almost 6 years ago - 1 dependent package - 1 dependent repositories - 239 stars on GitHub
pyphonetics 0.5.3
Pyphonetics is a Python 3 library for phonetic algorithms. Right now, the following algorithms ar...
1 version - Latest release: over 1 year ago - 93 stars on GitHub
freediscovery 1.3.1
FreeDiscovery is built on top of existing machine learning libraries (scikit-learn) and exposes a...
5 versions - Latest release: almost 6 years ago - 65 stars on GitHub
ecco 0.1.2
Ecco is a python library for explaining Natural Language Processing models using interactive visu...
3 versions - Latest release: over 2 years ago - 1,629 stars on GitHub
scattertext 0.1.9
A tool for finding distinguishing terms in small-to-medium-sized corpora, and presenting them in ...
17 versions - Latest release: over 1 year ago - 1 dependent repositories - 2,057 stars on GitHub
textattack 0.3.0
TextAttack πŸ™ is a Python framework for adversarial attacks, data augmentation, and model trainin...
4 versions - Latest release: almost 3 years ago - 2,273 stars on GitHub
syntok 1.4.2
Syntok is the successor of an earlier, very similar tool, segtok, but has evolved significantly i...
5 versions - Latest release: about 2 years ago - 168 stars on GitHub
bert-tensorflow 1.0.4
TensorFlow code and pre-trained models for BERT
3 versions - Latest release: over 3 years ago - 33,506 stars on GitHub
cld2-cffi 0.1.4
CFFI bindings around Google Chromium's embedded compact language detection library (CLD2)
1 version - Latest release: over 1 year ago - 3 dependent packages - 33 stars on GitHub
spacy-model-en_core_web_lg 3.4.0
Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
11 versions - Latest release: almost 2 years ago - 5 dependent repositories - 1,269 stars on GitHub
spacy-model-en_core_web_md 3.4.0
Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
11 versions - Latest release: almost 2 years ago - 3 dependent repositories - 1,269 stars on GitHub
spacy-model-en_core_web_trf 3.4.0
Components: transformer, tagger, parser, ner, attribute_ruler, lemmatizer.
3 versions - Latest release: almost 2 years ago - 1 dependent repositories - 1,269 stars on GitHub
spacy-model-en_vectors_web_lg 2.1.0
πŸ’« Models for the spaCy Natural Language Processing (NLP) library
2 versions - Latest release: about 5 years ago - 1,269 stars on GitHub
spacy-loggers 1.0.3
Enables alternatives to spaCy's built-in console logger
4 versions - Latest release: over 1 year ago - 2 dependent packages - 12 dependent repositories - 10 stars on GitHub
sense2vec 2.0.0
πŸ¦† Contextually-keyed word vectors
3 versions - Latest release: almost 3 years ago - 1,480 stars on GitHub
spacy-lookups-data 1.0.0
This package contains additional data files to be used with spaCy v2.2+. When it's installed in t...
4 versions - Latest release: about 3 years ago - 1 dependent repositories - 79 stars on GitHub
pyhacrf-datamade 0.2.6
:triangular_ruler: Hidden alignment conditional random field for classifying string pairs.
2 versions - Latest release: over 2 years ago - 1 dependent package - 24 stars on GitHub
spacy-model-en_core_web_sm 3.4.0
Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
11 versions - Latest release: almost 2 years ago - 15 dependent repositories - 1,269 stars on GitHub
Top 6.9% on conda-forge.org
textblob 0.15.3
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extrac...
7 versions - Latest release: about 5 years ago - 4 dependent packages - 24 dependent repositories - 8,491 stars on GitHub
stanza 1.4.2
Official Stanford NLP Python Library for Many Human Languages
7 versions - Latest release: over 1 year ago - 1 dependent package - 6 dependent repositories - 6,537 stars on GitHub
tango-fairscale 0.10.1
AI2 Tango is a platform that allows you to build machine learning experiments out of steps that c...
7 versions - Latest release: over 1 year ago - 1 dependent package - 315 stars on GitHub
tango-torch 0.10.1
AI2 Tango is a platform that allows you to build machine learning experiments out of steps that c...
9 versions - Latest release: over 1 year ago - 1 dependent package - 315 stars on GitHub
tango-pytorch_lightning 0.10.1
AI2 Tango is a platform that allows you to build machine learning experiments out of steps that c...
9 versions - Latest release: over 1 year ago - 1 dependent package - 315 stars on GitHub
tango-transformers 0.10.1
AI2 Tango is a platform that allows you to build machine learning experiments out of steps that c...
8 versions - Latest release: over 1 year ago - 1 dependent package - 315 stars on GitHub
tango-beaker 0.10.1
AI2 Tango is a platform that allows you to build machine learning experiments out of steps that c...
3 versions - Latest release: over 1 year ago - 1 dependent package - 315 stars on GitHub
tango-datasets 0.10.1
AI2 Tango is a platform that allows you to build machine learning experiments out of steps that c...
9 versions - Latest release: over 1 year ago - 1 dependent package - 315 stars on GitHub
tango-wandb 0.10.1
AI2 Tango is a platform that allows you to build machine learning experiments out of steps that c...
9 versions - Latest release: over 1 year ago - 1 dependent package - 315 stars on GitHub
allennlp-semparse 0.0.4
A framework for building semantic parsers (including neural module networks) with AllenNLP, built...
3 versions - Latest release: about 2 years ago - 105 stars on GitHub
allennlp-models 2.9.3
Officially supported AllenNLP models
11 versions - Latest release: about 2 years ago - 1 dependent package - 471 stars on GitHub
mlconjug 3.4.0
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian...
9 versions - Latest release: almost 5 years ago - 64 stars on GitHub
natasha 1.4.0
Solves basic Russian NLP tasks, API for lower level Natasha projects
1 version - Latest release: over 2 years ago - 1,030 stars on GitHub
Top 1.6% on conda-forge.org
spacy 3.4.3
spaCy is a library for advanced natural language processing in Python and Cython.
68 versions - Latest release: over 1 year ago - 92 dependent packages - 174 dependent repositories - 25,557 stars on GitHub
ipymarkup 0.9.0
NER, syntax markup visualizations
1 version - Latest release: over 2 years ago - 1 dependent package - 118 stars on GitHub
youtokentome 1.0.6
Unsupervised text tokenizer focused on computational efficiency
4 versions - Latest release: over 2 years ago - 859 stars on GitHub
language_tool_python 2.5.1
a free python grammar checker πŸ“βœ…
3 versions - Latest release: over 3 years ago - 1 dependent package - 323 stars on GitHub
cleantext 1.1.4
An open-source package for python to clean raw text data
1 version - Latest release: almost 2 years ago - 46 stars on GitHub
nerval 1.0.9
Python framework to evaluate Named Entity Recognition (NER) models. Creates entity-level confusio...
1 version - Latest release: about 2 years ago - 4 stars on GitHub
rubrix 0.18.0
Rubrix is a **production-ready Python framework for exploring, annotating, and managing data** in...
21 versions - Latest release: over 1 year ago - 1 dependent package - 1,710 stars on GitHub
lemminflect 0.2.2
A python module for English lemmatization and inflection.
2 versions - Latest release: over 2 years ago - 1 dependent package - 3 dependent repositories - 202 stars on GitHub
word2number 1.1
This is a Python module to convert number words (eg. twenty one) to numeric digits (21). It works...
1 version - Latest release: almost 5 years ago - 2 dependent packages - 1 dependent repositories - 145 stars on GitHub
autocorrect 2.6.1
Spelling corrector in python
1 version - Latest release: over 1 year ago - 352 stars on GitHub
tango-all 0.10.1
AI2 Tango is a platform that allows you to build machine learning experiments out of steps that c...
9 versions - Latest release: over 1 year ago - 315 stars on GitHub
sacremoses 0.0.53
Python port of Moses tokenizer, truecaser and normalizer
11 versions - Latest release: almost 2 years ago - 2 dependent packages - 33 dependent repositories - 446 stars on GitHub
Top 2.7% on conda-forge.org
gensim 4.2.0 πŸ’°
Gensim is a Python library for topic modelling, document indexing and similarity retrieval with l...
18 versions - Latest release: almost 2 years ago - 17 dependent packages - 105 dependent repositories - 14,085 stars on GitHub
bertopic 0.12.0
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
4 versions - Latest release: over 1 year ago - 1 dependent package - 3,956 stars on GitHub
rubrix-server 0.18.0
Rubrix is a **production-ready Python framework for exploring, annotating, and managing data** in...
15 versions - Latest release: over 1 year ago - 1,710 stars on GitHub
tango 0.10.1
AI2 Tango is a platform that allows you to build machine learning experiments out of steps that c...
9 versions - Latest release: over 1 year ago - 8 dependent packages - 2 dependent repositories - 315 stars on GitHub
libpostal 1.1.0
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP...
4 versions - Latest release: about 4 years ago - 1 dependent package - 3,669 stars on GitHub
stripnet 0.0.7
Leverage the power of NLP Topic Modeling, Semantic Similarity and Network analysis to study the t...
2 versions - Latest release: about 2 years ago - 82 stars on GitHub
unifunc 1.4.5
UniFunc is a text mining tool that processes and analysis text similarity between a pair of prote...
7 versions - Latest release: about 2 years ago - 4 stars on GitHub
multi_rake 0.0.2
Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python
2 versions - Latest release: over 2 years ago - 240 stars on GitHub
lexicalrichness 0.3.0
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 50 stars on GitHub
bpemb 0.3.3
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
3 versions - Latest release: almost 3 years ago - 1 dependent package - 1 dependent repositories - 1,112 stars on GitHub
transformers-interpret 0.8.1
Transformers Interpret is a model explainability tool designed to work exclusively with the :hugs...
1 version - Latest release: over 1 year ago - 981 stars on GitHub
datefinder 0.7.3 πŸ’°
Find dates inside text using Python and get back datetime objects
2 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 578 stars on GitHub
jamo 0.4.1
Python-jamo is a Python Hangul syllable decomposition and synthesis library for working with Hang...
1 version - Latest release: over 1 year ago - 1 dependent package - 85 stars on GitHub
unidic-lite 1.0.8 πŸ’°
A small version of UniDic for easy pip installs.
1 version - Latest release: about 3 years ago - 1 dependent package - 24 stars on GitHub
eli5 0.13.0
ELI5 is a Python package which helps to debug machine learning classifiers and explain their pred...
11 versions - Latest release: almost 2 years ago - 1 dependent package - 14 dependent repositories - 2,652 stars on GitHub
r-udpipe 0.8.10
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based o...
4 versions - Latest release: over 1 year ago - 202 stars on GitHub
Related Keywords
python 52 natural-language-processing 45 machine-learning 44 pytorch 25 spacy 17 deep-learning 15 ai 12 data-science 11 python3 9 bert 7 text-mining 6 russian 6 artificial-intelligence 6 models 5 spacy-models 5 statistical-models 5 hacktoberfest 5 ner 5 word2vec 5 nlp-library 5 visualization 5 text-classification 5 tensorflow 5 machine-learning-models 5 transformers 5 topic-modeling 4 tokenizer 4 named-entity-recognition 4 active-learning 3 language-model 3 natural-language-understanding 3 embeddings 3 neural-networks 3 spacy-extension 3 nlp-machine-learning 3 information-retrieval 3 information-extraction 3 tokenization 3 morphology 3 word-embeddings 3 lemmatization 3 computer-vision 3 neural-network 3 address-parser 3 address 3 language-models 3 network-analysis 2 text-as-data 2 japanese 2 dependency-parser 2 data-mining 2 pretrained-models 2 computational-social-science 2 python-library 2 text-annotation 2 text-labeling 2 weak-supervision 2 weakly-supervised-learning 2 mlops 2 knowledge-graph 2 human-in-the-loop 2 developer-tools 2 dataops 2 annotation-tool 2 keyword-extraction 2 multilingual 2 nltk 2 phonetic-algorithms 2 soundex 2 text-extraction 2 spellchecker 2 hangul 2 quantization 2 levenshtein-distance 2 spacy-pipeline 2 syntax 2 google 2 sentence-segmentation 2 sequence-labeling 2 allennlp 2 korean 2 transfer-learning 2 jax 2 gensim 2 international 2 sentence-boundary-detection 2 conditional-random-fields 2 language-detection 2 adversarial-attacks 2 classification-report 1 conjugation 1 confusion-matrix 1 python-2 1 pattern 1 f1-score 1 precison-recall 1 string-distance 1 sequence-labeling-evaluation 1 edit-distance 1 sense2vec 1