An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "natural-language-processing" keyword

View the packages on the pypi.org package registry that are tagged with the "natural-language-processing" keyword.

textwiser 2.0.2
TextWiser: Text Featurization Library
9 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 543 downloads last month - 49 stars on GitHub - 3 maintainers
indoxminer 0.1.5
Indox Data Extraction
19 versions - Latest release: 2 months ago - 785 downloads last month - 20 stars on GitHub - 2 maintainers
Top 1.0% on pypi.org
textacy 0.13.0
NLP, before and after spaCy
32 versions - Latest release: about 2 years ago - 18 dependent packages - 436 dependent repositories - 28.9 thousand downloads last month - 2,214 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
lightning-bolts 0.7.0
Lightning Bolts is a community contribution for ML researchers.
11 versions - Latest release: almost 2 years ago - 21 dependent packages - 299 dependent repositories - 40.6 thousand downloads last month - 1,688 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
clean-text 0.6.0
Functions to preprocess and normalize text.
7 versions - Latest release: about 3 years ago - 15 dependent packages - 97 dependent repositories - 71.8 thousand downloads last month - 975 stars on GitHub - 1 maintainer
tei2neo 0.6.1 πŸ’°
TEI (Text Encoding Initiative) parser to extract information and store it in Neo4j database
11 versions - Latest release: 6 months ago - 1 dependent repositories - 300 downloads last month - 1,612 stars on GitHub - 1 maintainer
Top 3.7% on pypi.org
lit-nlp 1.3.1
πŸ”₯LIT: The Learning Interpretability Tool
20 versions - Latest release: 4 months ago - 2 dependent packages - 6 dependent repositories - 7.7 thousand downloads last month - 3,538 stars on GitHub - 6 maintainers
verbecc 1.9.7
Verbs Completely Conjugated: machine learning conjugator for Catalan, French, Italian, Portuguese...
56 versions - Latest release: over 1 year ago - 6 dependent repositories - 1.48 thousand downloads last month - 72 stars on GitHub - 1 maintainer
noisemix 0.1.1
NoiseMix is a library for data generation for text datasets.
2 versions - Latest release: almost 7 years ago - 1 dependent repositories - 120 downloads last month - 40 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
allennlp 2.10.1
An open-source NLP research library, built on PyTorch.
265 versions - Latest release: over 2 years ago - 23 dependent packages - 1,128 dependent repositories - 66.8 thousand downloads last month - 11,697 stars on GitHub - 3 maintainers
Top 0.2% on pypi.org
nltk 3.9.1
Natural Language Toolkit
63 versions - Latest release: 8 months ago - 1,440 dependent packages - 57,572 dependent repositories - 33.2 million downloads last month - 13,994 stars on GitHub - 5 maintainers
Top 1.2% on pypi.org
spacy-lookups-data 1.0.5
Additional lookup tables and data resources for spaCy
18 versions - Latest release: over 1 year ago - 7 dependent packages - 118 dependent repositories - 89 thousand downloads last month - 98 stars on GitHub - 3 maintainers
Top 0.6% on pypi.org
pytorch-pretrained-bert 0.6.2
PyTorch version of Google AI BERT model with script to load Google pre-trained models
10 versions - Latest release: almost 6 years ago - 11 dependent packages - 940 dependent repositories - 74.9 thousand downloads last month - 129,185 stars on GitHub - 1 maintainer
Top 0.6% on pypi.org
rasa 3.6.21
Open source machine learning framework to automate text- and voice-based conversations: NLU, dial...
374 versions - Latest release: 3 months ago - 7 dependent packages - 584 dependent repositories - 230 thousand downloads last month - 19,969 stars on GitHub - 3 maintainers
relevanceai-dev 0.17.0
Home of the AI workforce - Multi-agent system, AI agents & tools
1,005 versions - Latest release: over 3 years ago - 1 dependent repositories - 15.4 thousand downloads last month - 227 stars on GitHub - 1 maintainer
blendsql 0.0.141
Query language for blending SQL logic and LLM reasoning across multi-modal data. [Findings of ACL...
26 versions - Latest release: 11 months ago - 1.44 thousand downloads last month - 95 stars on GitHub - 1 maintainer
unstructured-cpu 0.15.1
A library that prepares raw documents for downstream ML tasks.
13 versions - Latest release: 8 months ago - 368 downloads last month - 10,877 stars on GitHub - 1 maintainer
Top 1.1% on pypi.org
spacy-transformers 1.3.8
spaCy pipelines for pre-trained BERT and other transformers
78 versions - Latest release: 2 months ago - 32 dependent packages - 192 dependent repositories - 225 thousand downloads last month - 1,377 stars on GitHub - 3 maintainers
Top 4.0% on pypi.org
autogluon.timeseries 1.1.1
Fast and Accurate ML in 3 Lines of Code
986 versions - Latest release: 10 months ago - 4 dependent repositories - 152 thousand downloads last month - 8,655 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
autogluon.tabular 1.1.1
Fast and Accurate ML in 3 Lines of Code
1,609 versions - Latest release: 10 months ago - 7 dependent packages - 44 dependent repositories - 214 thousand downloads last month - 7,185 stars on GitHub - 3 maintainers
Top 1.8% on pypi.org
autogluon.multimodal 1.1.1
Fast and Accurate ML in 3 Lines of Code
993 versions - Latest release: 10 months ago - 3 dependent packages - 15 dependent repositories - 163 thousand downloads last month - 6,566 stars on GitHub - 1 maintainer
Top 1.3% on pypi.org
autogluon.common 1.1.1
Fast and Accurate ML in 3 Lines of Code
1,200 versions - Latest release: 10 months ago - 11 dependent packages - 23 dependent repositories - 195 thousand downloads last month - 6,566 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
multimodal-transformers 0.4.0
Multimodal Extension Library for PyTorch HuggingFace Transformers
7 versions - Latest release: 7 months ago - 1 dependent repositories - 600 downloads last month - 555 stars on GitHub - 2 maintainers
Top 1.9% on pypi.org
es-core-news-sm 3.1.0
Spanish pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attr...
2 versions - Latest release: over 3 years ago - 2 dependent packages - 43 dependent repositories - 1.4 thousand downloads last month - 98 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
bert-tensorflow 1.0.4
BERT
3 versions - Latest release: over 4 years ago - 2 dependent packages - 239 dependent repositories - 3.81 thousand downloads last month - 37,772 stars on GitHub - 1 maintainer
text2class 0.0.4
Multi-class text categorization using state-of-the-art pre-trained contextualized language models...
4 versions - Latest release: about 5 years ago - 1 dependent repositories - 233 downloads last month - 21 stars on GitHub - 1 maintainer
pytextract 2.0.1
extract text from any document. no muss. no fuss.
1 version - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 85 downloads last month - 4,072 stars on GitHub - 1 maintainer
textract-edited-dependencies 0.0.2
extract text from any document. no muss. no fuss.
2 versions - Latest release: over 1 year ago - 98 downloads last month - 4,072 stars on GitHub - 1 maintainer
textract3 1.6.4.post1
extract text from any document. no muss. no fuss. (A fork with python3 support only)
1 version - Latest release: over 3 years ago - 1 dependent repositories - 61 downloads last month - 3,754 stars on GitHub - 1 maintainer
discoursegraphs 0.4.14
graph-based processing of multi-level annotated corpora
18 versions - Latest release: about 4 years ago - 5 dependent repositories - 575 downloads last month - 49 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
rasa-nlu 0.15.1
Rasa NLU a natural language parser for bots
63 versions - Latest release: almost 6 years ago - 2 dependent packages - 223 dependent repositories - 6.43 thousand downloads last month - 17,893 stars on GitHub - 2 maintainers
Top 0.7% on pypi.org
flair 0.15.1 πŸ’°
A very simple framework for state-of-the-art NLP
34 versions - Latest release: 2 months ago - 39 dependent packages - 497 dependent repositories - 115 thousand downloads last month - 13,329 stars on GitHub - 1 maintainer
Top 0.1% on pypi.org
transformers 4.51.3
State-of-the-art Machine Learning for JAX, PyTorch and TensorFlow
179 versions - Latest release: 5 days ago - 2,589 dependent packages - 31,800 dependent repositories - 61.9 million downloads last month - 134,132 stars on GitHub - 4 maintainers
Top 2.3% on pypi.org
ludwig 0.10.4
Declarative machine learning: End-to-end machine learning pipelines using data-driven configurati...
56 versions - Latest release: 9 months ago - 1 dependent package - 119 dependent repositories - 2.7 thousand downloads last month - 10,876 stars on GitHub - 5 maintainers
augly-jp 2021.9.30
Data Augmentation for Japanese Text
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 277 downloads last month - 7 stars on GitHub - 1 maintainer
Top 8.2% on pypi.org
spacy-llm 0.7.3 πŸ’°
Integrating LLMs into structured NLP pipelines
23 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 15.7 thousand downloads last month - 1,059 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
spark-nlp 5.5.3
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML...
151 versions - Latest release: 3 months ago - 35 dependent packages - 35 dependent repositories - 4.22 million downloads last month - 3,717 stars on GitHub - 3 maintainers
picollm 1.3.0
picoLLM Inference Engine
9 versions - Latest release: about 1 month ago - 500 downloads last month - 154 stars on GitHub - 1 maintainer
picollmdemo 1.3.0
picoLLM Inference Engine demos
9 versions - Latest release: about 1 month ago - 400 downloads last month - 154 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
konoha 5.5.6 πŸ’°
Add your description here
28 versions - Latest release: 11 months ago - 3 dependent packages - 134 dependent repositories - 101 thousand downloads last month - 241 stars on GitHub - 1 maintainer
Top 0.2% on pypi.org
gensim 4.3.3 πŸ’°
Python framework for fast Vector Space Modelling
91 versions - Latest release: 9 months ago - 426 dependent packages - 13,895 dependent repositories - 4.89 million downloads last month - 15,255 stars on GitHub - 2 maintainers
easyeditor 0.0.1.dev0
easyeditor - Editing Large Language Models
1 version - Latest release: over 1 year ago - 91 downloads last month - 1,610 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
nlpaug 1.1.11 πŸ’°
Natural language processing augmentation library for deep neural networks
37 versions - Latest release: almost 3 years ago - 28 dependent packages - 141 dependent repositories - 172 thousand downloads last month - 4,399 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
bert-score 0.3.13
PyTorch implementation of BERT score
21 versions - Latest release: about 2 years ago - 51 dependent packages - 220 dependent repositories - 603 thousand downloads last month - 1,445 stars on GitHub - 2 maintainers
reghub-pack 0.1.6
6 versions - Latest release: about 1 year ago - 106 downloads last month - 0 stars on GitHub - 3 maintainers
megabots 0.0.11
πŸ€– Megabots provides State-of-the-art, production ready bots made mega-easy, so you don't have to ...
5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 274 downloads last month - 341 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
gluonnlp 0.10.0
MXNet Gluon NLP Toolkit
25 versions - Latest release: over 4 years ago - 9 dependent packages - 224 dependent repositories - 57.5 thousand downloads last month - 2,553 stars on GitHub - 4 maintainers
camel-ai 0.2.45
Communicative Agents for AI Society Study
84 versions - Latest release: 5 days ago - 32.3 thousand downloads last month - 5,151 stars on GitHub - 2 maintainers
openfactcheck 0.3.9
An Open-source Factuality Evaluation Demo for LLMs
28 versions - Latest release: 6 months ago - 1.48 thousand downloads last month - 27 stars on GitHub - 1 maintainer
isaacus 0.5.0
The official Python library for the isaacus API
13 versions - Latest release: about 13 hours ago - 934 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
thinc-gpu-ops 0.0.4
CUDA kernels for Thinc
4 versions - Latest release: over 6 years ago - 2 dependent packages - 10 dependent repositories - 586 downloads last month - 2,840 stars on GitHub - 3 maintainers
Top 1.1% on pypi.org
huggingface-hub 0.30.2
Client library to download and publish models, datasets and other repos on the huggingface.co hub
167 versions - Latest release: 11 days ago - 665 dependent packages - 15,445 dependent repositories - 80.2 million downloads last month - 1,641 stars on GitHub - 4 maintainers
aquila-resolve 0.1.4
Augmented Neural English G2p converter with Inflectional Orthography.
4 versions - Latest release: over 2 years ago - 108 downloads last month - 7 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
seqeval 1.2.2 πŸ’°
Testing framework for sequence labeling
24 versions - Latest release: over 4 years ago - 108 dependent packages - 2,539 dependent repositories - 425 thousand downloads last month - 1,092 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
flaml 2.3.4
A fast library for automated machine learning and tuning
99 versions - Latest release: 2 months ago - 14 dependent packages - 388 dependent repositories - 434 thousand downloads last month - 3,842 stars on GitHub - 1 maintainer
minicons 0.3.29
A package of useful functions to analyze transformer based language models.
98 versions - Latest release: about 14 hours ago - 1 dependent package - 1 dependent repositories - 4.68 thousand downloads last month - 136 stars on GitHub - 1 maintainer
bidaf-keras 1.0.0
Implementation of Bidirectional Attention Flow for Machine Comprehension in Keras 2
1 version - Latest release: almost 6 years ago - 3 dependent repositories - 81 downloads last month - 64 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
pythainlp 5.1.1
Thai Natural Language Processing library
113 versions - Latest release: 19 days ago - 37 dependent packages - 183 dependent repositories - 357 thousand downloads last month - 1,026 stars on GitHub - 2 maintainers
udon2 0.1.0
Prepare your UD trees to be served!
6 versions - Latest release: about 3 years ago - 3 dependent repositories - 896 downloads last month - 9 stars on GitHub - 1 maintainer
rasa-pro 3.12.6
State-of-the-art open-core Conversational AI framework for Enterprises that natively leverages ge...
95 versions - Latest release: 4 days ago - 12.3 thousand downloads last month - 19,964 stars on GitHub - 2 maintainers
Top 5.0% on pypi.org
libqutrub 1.2.4 πŸ’°
libqutrub Arabic verb conjuagtion library
5 versions - Latest release: almost 5 years ago - 2 dependent packages - 8 dependent repositories - 1.47 thousand downloads last month - 81 stars on GitHub - 1 maintainer
kiri 0.5.1
Kiri
30 versions - Latest release: about 4 years ago - 1 dependent repositories - 1.13 thousand downloads last month - 243 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
textblob 0.19.0
Simple, Pythonic text processing. Sentiment analysis, part-of-speech tagging, noun phrase parsing...
60 versions - Latest release: 3 months ago - 93 dependent packages - 6,514 dependent repositories - 1.3 million downloads last month - 8,914 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
openvino-dev 2024.6.0
OpenVINO(TM) Development Tools
37 versions - Latest release: 4 months ago - 38 dependent packages - 498 dependent repositories - 147 thousand downloads last month - 6,310 stars on GitHub - 1 maintainer
Top 0.9% on pypi.org
thinc 9.1.1
A refreshing functional take on deep learning, compatible with your favorite libraries
243 versions - Latest release: 7 months ago - 40 dependent packages - 8,632 dependent repositories - 15.9 million downloads last month - 2,821 stars on GitHub - 3 maintainers
keras-hub 0.20.0
Industry-strength Natural Language Processing extensions for Keras.
37 versions - Latest release: 16 days ago - 72.1 thousand downloads last month - 886 stars on GitHub - 2 maintainers
kaamiki 0.0.0
Kaamiki is a simple machine learning framework for obvious tasks.
1 version - Latest release: over 4 years ago - 1 dependent repositories - 56 downloads last month - 1 stars on GitHub - 1 maintainer
pyqna 0.0.4
A simple python package for question answering
3 versions - Latest release: over 3 years ago - 128 downloads last month - 9 stars on GitHub - 1 maintainer
rutermextract 0.3
Term extraction for Russian language
3 versions - Latest release: over 7 years ago - 8 dependent repositories - 117 downloads last month - 88 stars on GitHub - 1 maintainer
Top 0.9% on pypi.org
textract 1.6.5
extract text from any document. no muss. no fuss.
18 versions - Latest release: about 3 years ago - 23 dependent packages - 739 dependent repositories - 207 thousand downloads last month - 3,754 stars on GitHub - 1 maintainer
rara-subject-indexer 3.0.0
Automatically detect subject indices.
11 versions - Latest release: about 18 hours ago - 473 downloads last month - 1,948 stars on GitHub - 1 maintainer
senta 2.0.0
A sentiment classification tools made by Baidu NLP.
5 versions - Latest release: almost 5 years ago - 1 dependent repositories - 269 downloads last month - 1,958 stars on GitHub - 2 maintainers
Top 1.7% on pypi.org
trafilatura 2.0.0 πŸ’°
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction...
50 versions - Latest release: 5 months ago - 71 dependent packages - 63 dependent repositories - 944 thousand downloads last month - 4,118 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
nlp 0.4.0
HuggingFace/NLP is an open library of NLP datasets.
8 versions - Latest release: over 4 years ago - 4 dependent packages - 104 dependent repositories - 6.53 thousand downloads last month - 19,969 stars on GitHub - 2 maintainers
deltakg 0.0.6
A Library for Dynamically Editing PLMs-Based Knowledge Graph Embeddings.
6 versions - Latest release: almost 2 years ago - 209 downloads last month - 716 stars on GitHub - 1 maintainer
kgeditor 1.0.0
A library that provides a tool to edit model easily.
4 versions - Latest release: about 2 years ago - 60 downloads last month - 715 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
cherche 2.2.1
Neural Search
23 versions - Latest release: 11 months ago - 3 dependent repositories - 682 downloads last month - 296 stars on GitHub - 1 maintainer
Top 8.5% on pypi.org
matchzoo 2.2.0
Facilitating the design, comparison and sharing of deep text matching models.
4 versions - Latest release: over 5 years ago - 3 dependent repositories - 156 downloads last month - 3,819 stars on GitHub - 1 maintainer
Top 2.0% on pypi.org
clarifai 11.2.3
Clarifai Python SDK
195 versions - Latest release: 9 days ago - 29 dependent packages - 801 dependent repositories - 64.4 thousand downloads last month - 24 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
pytorch-transformers 1.2.0
Repository of pre-trained NLP Transformer models: BERT & RoBERTa, GPT & GPT-2, Transformer-XL, XL...
4 versions - Latest release: over 5 years ago - 16 dependent packages - 772 dependent repositories - 27.6 thousand downloads last month - 129,185 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
catalyst 22.2.1 πŸ’°
Catalyst. Accelerated deep learning R&D with PyTorch.
104 versions - Latest release: about 3 years ago - 13 dependent packages - 179 dependent repositories - 23.4 thousand downloads last month - 3,235 stars on GitHub - 1 maintainer
edu-convokit 0.4.0
Edu-ConvoKit: An Open-Source Framework for Education Conversation Data
2 versions - Latest release: over 1 year ago - 212 downloads last month - 91 stars on GitHub - 1 maintainer
rakun2 0.30
RaKUn 2.0; Better faster stronger lighter
12 versions - Latest release: about 22 hours ago - 1 dependent repositories - 919 downloads last month - 66 stars on GitHub - 1 maintainer
transformers-domain-adaptation 0.3.1
Adapt Transformer-based language models to new text domains
6 versions - Latest release: about 4 years ago - 1 dependent repositories - 269 downloads last month - 87 stars on GitHub - 1 maintainer
Top 0.1% on pypi.org
spacy 3.8.5 πŸ’°
Industrial-strength Natural Language Processing (NLP) in Python
216 versions - Latest release: 18 days ago - 873 dependent packages - 15,793 dependent repositories - 17.2 million downloads last month - 29,548 stars on GitHub - 3 maintainers
zensols.mimic 1.8.0
MIMIC III Corpus Parsing
18 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 465 downloads last month - 1 stars on GitHub - 1 maintainer
defsent 0.1.0
DefSent: Sentence Embeddings using Definition Sentences
1 version - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 64 downloads last month - 20 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
argilla 2.8.0
The Argilla python server SDK
67 versions - Latest release: about 1 month ago - 10 dependent packages - 634 dependent repositories - 517 thousand downloads last month - 3,919 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
synergy-dataset 1.0.3 πŸ’°
Python package for the SYNERGY dataset
12 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 6.44 thousand downloads last month - 76 stars on GitHub - 1 maintainer
Top 0.2% on pypi.org
datasets 3.5.0
HuggingFace community-driven open-source library of datasets
97 versions - Latest release: 23 days ago - 931 dependent packages - 14,962 dependent repositories - 26.5 million downloads last month - 19,203 stars on GitHub - 4 maintainers
forte.health 0.1.0
NLP pipeline framework for biomedical and clinical domains
1 version - Latest release: almost 3 years ago - 54 downloads last month - 10 stars on GitHub - 3 maintainers
pospairwordembeddings 0.0.4
POSPair Word Embeddings- Python framework for fast Vector Space Modelling
4 versions - Latest release: almost 6 years ago - 1 dependent repositories - 192 downloads last month - 5 stars on GitHub - 1 maintainer
chariot 0.5.6
Deliver the ready-to-train data to your NLP model.
19 versions - Latest release: over 5 years ago - 1 dependent repositories - 628 downloads last month - 121 stars on GitHub - 1 maintainer
camel-oasis 0.0.1
Open Agents Social Interaction Simulations on a Large Scale
1 version - Latest release: 1 day ago - 1,310 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
johnsnowlabs 5.5.5
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...
171 versions - Latest release: about 1 month ago - 1 dependent package - 3 dependent repositories - 15.1 thousand downloads last month - 65 stars on GitHub - 2 maintainers
bent 0.0.80
BENT: Biomedical Entity Annotator
60 versions - Latest release: 7 months ago - 1 dependent repositories - 1.53 thousand downloads last month - 9 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
stanza 1.10.1
A Python NLP Library for Many Human Languages, by the Stanford NLP Group
26 versions - Latest release: 4 months ago - 52 dependent packages - 450 dependent repositories - 352 thousand downloads last month - 7,278 stars on GitHub - 8 maintainers
charylu-tokenizer 0.0.6
Biblioteca com tokenizadores criados por Luis Chary
3 versions - Latest release: 8 months ago - 152 downloads last month - 9,580 stars on GitHub - 1 maintainer
tokenizers-gt 0.15.2.post0
πŸ’₯ Fast State-of-the-Art Tokenizers optimized for Research and Production
3 versions - Latest release: about 1 year ago - 1.97 thousand downloads last month - 9,580 stars on GitHub - 1 maintainer
divyanx-tokenizers 0.20.0.dev0
πŸ’₯ Fast State-of-the-Art Tokenizers optimized for Research and Production
1 version - Latest release: 8 months ago - 29 downloads last month - 9,580 stars on GitHub - 1 maintainer
Related Keywords
nlp 648 machine-learning 428 python 415 deep-learning 244 pytorch 188 data-science 113 NLP 111 spacy 99 bert 94 named-entity-recognition 92 artificial-intelligence 91 ai 85 transformers 82 transformer 75 computer-vision 69 text-classification 67 tensorflow 65 natural language processing 65 language-model 64 nlp-library 62 natural-language-understanding 61 llm 56 learning 52 hacktoberfest 52 ner 46 neural-network 42 text-processing 41 text 41 python3 39 language 38 sentiment-analysis 38 speech-recognition 38 transfer-learning 36 deep 36 large-language-models 34 information-extraction 34 information-retrieval 33 scikit-learn 32 automl 32 text-mining 31 linguistics 31 tokenizer 31 dataset 31 language-models 31 text-analysis 30 datasets 30 hyperparameter-optimization 29 structured-data 29 ml 29 pretrained-models 29 tokenization 29 object-detection 28 huggingface 28 question-answering 28 tabular-data 27 machine 26 computational-linguistics 26 gluon 25 machine learning 25 ensemble-learning 25 time-series 25 automated-machine-learning 25 seq2seq 25 forecasting 24 BERT 24 corpus 24 autogluon 24 chatbot 23 openai 23 model-hub 22 speech 22 jax 22 neural-networks 22 embeddings 22 pytorch-transformers 21 nlp-machine-learning 21 nlu 20 natural-language 20 topic-modeling 20 pos-tagging 20 deep learning 20 gpt 19 preprocessing 19 flax 18 natural-language-generation 18 word-embeddings 18 processing 18 api 17 word-segmentation 17 parser 17 natural 17 spacy-extension 17 gpt-4 17 relation-extraction 17 visualization 16 bot 16 Natural Language Processing 16 annotation-tool 16 machine-translation 16 keyword-extraction 16