Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data-extraction" keyword
taupe 1.2.0
Taupe: a tool to extract URLs from your personal Twitter archive4 versions - Latest release: over 1 year ago - 32 downloads last month - 27 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
32 versions - Latest release: over 1 year ago - 1 dependent repositories - 277 downloads last month - 1,447 stars on GitHub - 2 maintainers
pyoptimus 0.1.0
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion.32 versions - Latest release: over 1 year ago - 1 dependent repositories - 277 downloads last month - 1,447 stars on GitHub - 2 maintainers
scrappeycom 0.3.8
An API wrapper for Scrappey.com written in Python (cloudflare bypass & solver)11 versions - Latest release: 12 months ago - 46 downloads last month - 10 stars on GitHub - 1 maintainer
superpipe-py 0.1.8
build unstructured to structured data transformation pipelines7 versions - Latest release: about 2 months ago - 117 downloads last month - 99 stars on GitHub - 1 maintainer
labelkit 0.1.0
build unstructured to structured data transformation pipelines5 versions - Latest release: 4 months ago - 87 downloads last month - 98 stars on GitHub - 1 maintainer
xtweet 1.0.2
Es una biblioteca que te permite interactuar de manera eficiente con la API de Twitter.3 versions - Latest release: 10 months ago - 3 downloads last month - 2 stars on GitHub - 1 maintainer
yirabot 1.0.9
YiraBot: Simplifying Web Scraping for All. A user-friendly tool for developers and enthusiasts, o...20 versions - Latest release: 3 months ago - 217 downloads last month - 13 stars on GitHub - 1 maintainer
wiktionary-de-parser 0.11.5
Extracts data from German Wiktionary dump files.33 versions - Latest release: 4 months ago - 2 dependent repositories - 2.14 thousand downloads last month - 24 stars on GitHub - 1 maintainer
filmweb 0.9
Export movie ratings from filmweb.pl8 versions - Latest release: 6 months ago - 1 dependent repositories - 41 downloads last month - 12 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
9 versions - Latest release: 10 months ago - 1 dependent package - 4 dependent repositories - 865 downloads last month - 91 stars on GitHub - 1 maintainer
cyac 1.9
High performance Trie and Ahocorasick automata (AC automata) for python9 versions - Latest release: 10 months ago - 1 dependent package - 4 dependent repositories - 865 downloads last month - 91 stars on GitHub - 1 maintainer
tap-planetscaleapi 0.1.1 💰
`tap-planetscaleapi` is a Singer tap for PlanetScaleAPI, built with the Meltano Singer SDK.4 versions - Latest release: 4 months ago - 37 downloads last month - 0 stars on GitHub - 1 maintainer
vnstock3 0.3.0.1 💰
A comprehensive and transparent solution for Vietnamese stock market analysis.4 versions - Latest release: about 1 month ago - 795 downloads last month - 390 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
18 versions - Latest release: over 6 years ago - 19 dependent packages - 208 dependent repositories - 1.63 million downloads last month - 5,547 stars on GitHub - 1 maintainer
flashtext 2.7
Extract/Replaces keywords in sentences.18 versions - Latest release: over 6 years ago - 19 dependent packages - 208 dependent repositories - 1.63 million downloads last month - 5,547 stars on GitHub - 1 maintainer
data-extractor 0.10.2
Combine XPath, CSS Selectors and JSONPath for Web data extracting.41 versions - Latest release: almost 3 years ago - 1 dependent repositories - 154 downloads last month - 27 stars on GitHub - 1 maintainer
jsonpath-extractor 0.9.0
A selector expression for extracting data from JSON.16 versions - Latest release: 11 months ago - 1 dependent repositories - 8.87 thousand downloads last month - 37 stars on GitHub - 1 maintainer
newshound 0.0.1 💰
A future news extractor package for Python 31 version - Latest release: over 2 years ago - 1 dependent repositories - 25 downloads last month - 29 stars on GitHub - 1 maintainer
boututils 0.2.1
Python utilities for BOUT++12 versions - Latest release: 8 months ago - 3 dependent packages - 4 dependent repositories - 1.55 thousand downloads last month - 1 stars on GitHub - 4 maintainers
ricloud 3.2.0
Python client for Reincubate's ricloud API.43 versions - Latest release: over 4 years ago - 2 dependent repositories - 123 downloads last month - 91 stars on GitHub - 2 maintainers
inparse 0.1.1
Collaborative AI for Web Scraping, Data Extraction and Crawling,Knowledge Graph2 versions - Latest release: over 5 years ago - 1 dependent repositories - 14 downloads last month - 15 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
34 versions - Latest release: 4 months ago - 6 dependent repositories - 715 downloads last month - 117 stars on GitHub - 1 maintainer
sayn 0.6.13
Data-modelling and processing framework for automating Python and SQL tasks34 versions - Latest release: 4 months ago - 6 dependent repositories - 715 downloads last month - 117 stars on GitHub - 1 maintainer
Top 3.7% on pypi.org
48 versions - Latest release: about 1 year ago - 3 dependent packages - 19 dependent repositories - 42.8 thousand downloads last month - 409 stars on GitHub - 1 maintainer
amazoncaptcha 0.5.11
"Pure Python, lightweight, Pillow-based solver for the Amazon text captcha."48 versions - Latest release: about 1 year ago - 3 dependent packages - 19 dependent repositories - 42.8 thousand downloads last month - 409 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
17 versions - Latest release: 7 months ago - 4 dependent repositories - 912 downloads last month - 51 stars on GitHub - 1 maintainer
hext 1.0.8
A module and command-line utility to extract structured data from HTML17 versions - Latest release: 7 months ago - 4 dependent repositories - 912 downloads last month - 51 stars on GitHub - 1 maintainer
arachnio 0.0.0
Client library for interacting with Arachnio API1 version - Latest release: over 1 year ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
plotdigitizer 0.2.3
Extract raw data from plots images11 versions - Latest release: over 1 year ago - 2 dependent repositories - 244 downloads last month - 106 stars on GitHub - 1 maintainer
pa-scraper 0.2.4
Python wrapper for Prompt API's Scraper API9 versions - Latest release: over 3 years ago - 1 dependent repositories - 66 downloads last month - 5 stars on GitHub - 1 maintainer
zapr-athena-client 0.1
It is a python library to run the presto query on the AWS Athena.1 version - Latest release: about 3 years ago - 1 dependent repositories - 17 downloads last month - 1 stars on GitHub - 1 maintainer
journalpdfscraper 0.2.1
A project to check if articles are free or paid3 versions - Latest release: about 3 years ago - 1 dependent repositories - 35 downloads last month - 1 stars on GitHub - 1 maintainer
complex-parser 0.0.2
A versatile Python package for data extraction from JSON-like structures with user-defined format...3 versions - Latest release: 3 months ago - 19 downloads last month - 4 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
41 versions - Latest release: 8 months ago - 4 dependent packages - 3 dependent repositories - 21.5 thousand downloads last month - 258 stars on GitHub - 1 maintainer
vnstock 0.2.8 💰
Vietnam Stock Market Data41 versions - Latest release: 8 months ago - 4 dependent packages - 3 dependent repositories - 21.5 thousand downloads last month - 258 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
4 versions - Latest release: almost 9 years ago - 5 dependent repositories - 23 downloads last month - 500 stars on GitHub - 1 maintainer
libextract 0.0.12
A HT/XML web scraping tool4 versions - Latest release: almost 9 years ago - 5 dependent repositories - 23 downloads last month - 500 stars on GitHub - 1 maintainer
bbva2pandas 1.1.3
Parse BBVA monthly reports directly to a Dataframe6 versions - Latest release: 3 months ago - 1 dependent repositories - 164 downloads last month - 5 stars on GitHub - 1 maintainer
tcx-extract 0.1.2
A speed-optimized tcx data extractor.4 versions - Latest release: 3 months ago - 28 downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
optimuspyspark 2.2.32
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion wi...83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
hivehoney 1.0.4
Client-less data retrieval from Hive.5 versions - Latest release: over 5 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
12 versions - Latest release: 8 months ago - 1 dependent package - 5 dependent repositories - 1.03 thousand downloads last month - 0 stars on GitHub - 4 maintainers
boutdata 0.2.1
Python package for collecting BOUT++ data12 versions - Latest release: 8 months ago - 1 dependent package - 5 dependent repositories - 1.03 thousand downloads last month - 0 stars on GitHub - 4 maintainers
Related Keywords
python
9
data-analysis
5
web-scraping
5
python3
5
data-science
4
classification
3
bigdata
3
machine-learning
3
nlp
3
data
3
data-engineering
3
webscraping
2
bout++
2
html
2
parser
2
bout
2
keyword-extraction
2
plasma
2
text-extraction
2
article-extractor
2
sql
2
data-labeling
2
llm
2
llm-evaluation
2
llm-optimization
2
structured-data
2
search-in-text
2
json
2
jsonpath
2
scraping
2
data-extractor
2
stock-screener
2
stock-market
2
elt
2
data-mining
2
extract
2
hive
2
twitter
2
scraper
2
datacleaner
2
data-wrangling
2
data-cleansing
2
data-profiling
2
big-data-cleaning
2
cudf
2
dask
2
dask-cudf
2
data-cleaner
2
physics
2
data-visualization
2
captcha-solver
2
captcha
2
spark
2
pyspark
2
data-transformation
2
data-preparation
2
data-exploration
2
data-cleaning
2
css-selector-parser
1
api-marketplace
1
css-selector
1
image-scraper
1
download
1
scraper-api
1
web-scraper
1
api-wrapper
1
athena
1
parse
1
machine
1
data-modeling
1
etl
1
amazon
1
amazon-captcha
1
amazon-scraper
1
amazoncaptcha
1
pillow
1
training-data
1
html-extraction
1
cpp
1
dsl
1
node
1
php
1
ruby
1
arachnio
1
arachn.io
1
news-scraping
1
web-scraping-python
1
digitization
1
image-processing
1
promptapi
1
scrape
1
learning
1
AI
1
artificial
1
intelligence
1
ML
1
bbva
1
pdf
1
bank
1
regex
1