Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data-extraction" keyword
Top 9.7% on pypi.org
4 versions - Latest release: almost 9 years ago - 5 dependent repositories - 23 downloads last month - 500 stars on GitHub - 1 maintainer
libextract 0.0.12
A HT/XML web scraping tool4 versions - Latest release: almost 9 years ago - 5 dependent repositories - 23 downloads last month - 500 stars on GitHub - 1 maintainer
superpipe-py 0.1.8
build unstructured to structured data transformation pipelines7 versions - Latest release: 22 days ago - 238 downloads last month - 98 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
18 versions - Latest release: about 6 years ago - 19 dependent packages - 208 dependent repositories - 1.71 million downloads last month - 5,539 stars on GitHub - 1 maintainer
flashtext 2.7
Extract/Replaces keywords in sentences.18 versions - Latest release: about 6 years ago - 19 dependent packages - 208 dependent repositories - 1.71 million downloads last month - 5,539 stars on GitHub - 1 maintainer
bbva2pandas 1.1.3
Parse BBVA monthly reports directly to a Dataframe6 versions - Latest release: 2 months ago - 1 dependent repositories - 164 downloads last month - 5 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
41 versions - Latest release: 7 months ago - 4 dependent packages - 3 dependent repositories - 20.7 thousand downloads last month - 258 stars on GitHub - 1 maintainer
vnstock 0.2.8 💰
Vietnam Stock Market Data41 versions - Latest release: 7 months ago - 4 dependent packages - 3 dependent repositories - 20.7 thousand downloads last month - 258 stars on GitHub - 1 maintainer
scrappeycom 0.3.8
An API wrapper for Scrappey.com written in Python (cloudflare bypass & solver)11 versions - Latest release: 11 months ago - 75 downloads last month - 10 stars on GitHub - 1 maintainer
Top 3.7% on pypi.org
48 versions - Latest release: about 1 year ago - 3 dependent packages - 19 dependent repositories - 48.6 thousand downloads last month - 409 stars on GitHub - 1 maintainer
amazoncaptcha 0.5.11
"Pure Python, lightweight, Pillow-based solver for the Amazon text captcha."48 versions - Latest release: about 1 year ago - 3 dependent packages - 19 dependent repositories - 48.6 thousand downloads last month - 409 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
32 versions - Latest release: over 1 year ago - 1 dependent repositories - 349 downloads last month - 1,441 stars on GitHub - 2 maintainers
pyoptimus 0.1.0
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion.32 versions - Latest release: over 1 year ago - 1 dependent repositories - 349 downloads last month - 1,441 stars on GitHub - 2 maintainers
vnstock3 0.3.0.1 💰
A comprehensive and transparent solution for Vietnamese stock market analysis.2 versions - Latest release: 5 days ago - 298 downloads last month - 379 stars on GitHub - 1 maintainer
complex-parser 0.0.2
A versatile Python package for data extraction from JSON-like structures with user-defined format...3 versions - Latest release: 2 months ago - 55 downloads last month - 3 stars on GitHub - 1 maintainer
tcx-extract 0.1.2
A speed-optimized tcx data extractor.4 versions - Latest release: 2 months ago - 28 downloads last month - 0 stars on GitHub - 1 maintainer
tap-planetscaleapi 0.1.1 💰
`tap-planetscaleapi` is a Singer tap for PlanetScaleAPI, built with the Meltano Singer SDK.3 versions - Latest release: 3 months ago - 22 downloads last month - 0 stars on GitHub - 1 maintainer
yirabot 1.0.9
YiraBot: Simplifying Web Scraping for All. A user-friendly tool for developers and enthusiasts, o...20 versions - Latest release: 2 months ago - 172 downloads last month - 11 stars on GitHub - 1 maintainer
xtweet 1.0.2
Es una biblioteca que te permite interactuar de manera eficiente con la API de Twitter.3 versions - Latest release: 10 months ago - 21 downloads last month - 2 stars on GitHub - 1 maintainer
arachnio 0.0.0
Client library for interacting with Arachnio API1 version - Latest release: about 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
zapr-athena-client 0.1
It is a python library to run the presto query on the AWS Athena.1 version - Latest release: about 3 years ago - 1 dependent repositories - 5 downloads last month - 1 stars on GitHub - 1 maintainer
wiktionary-de-parser 0.11.5
Extracts data from German Wiktionary dump files.33 versions - Latest release: 3 months ago - 2 dependent repositories - 2.14 thousand downloads last month - 24 stars on GitHub - 1 maintainer
ricloud 3.2.0
Python client for Reincubate's ricloud API.43 versions - Latest release: about 4 years ago - 2 dependent repositories - 115 downloads last month - 90 stars on GitHub - 2 maintainers
plotdigitizer 0.2.3
Extract raw data from plots images11 versions - Latest release: over 1 year ago - 2 dependent repositories - 334 downloads last month - 100 stars on GitHub - 1 maintainer
pa-scraper 0.2.4
Python wrapper for Prompt API's Scraper API9 versions - Latest release: over 3 years ago - 1 dependent repositories - 43 downloads last month - 5 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
optimuspyspark 2.2.32
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion wi...83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
newshound 0.0.1 💰
A future news extractor package for Python 31 version - Latest release: over 2 years ago - 1 dependent repositories - 37 downloads last month - 29 stars on GitHub - 1 maintainer
jsonpath-extractor 0.9.0
A selector expression for extracting data from JSON.16 versions - Latest release: 10 months ago - 1 dependent repositories - 7.21 thousand downloads last month - 36 stars on GitHub - 1 maintainer
inparse 0.1.1
Collaborative AI for Web Scraping, Data Extraction and Crawling,Knowledge Graph2 versions - Latest release: over 5 years ago - 1 dependent repositories - 25 downloads last month - 15 stars on GitHub - 1 maintainer
hivehoney 1.0.4
Client-less data retrieval from Hive.5 versions - Latest release: over 5 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
16 versions - Latest release: 6 months ago - 4 dependent repositories - 676 downloads last month - 51 stars on GitHub - 1 maintainer
hext 1.0.8
A module and command-line utility to extract structured data from HTML16 versions - Latest release: 6 months ago - 4 dependent repositories - 676 downloads last month - 51 stars on GitHub - 1 maintainer
filmweb 0.9
Export movie ratings from filmweb.pl8 versions - Latest release: 5 months ago - 1 dependent repositories - 50 downloads last month - 12 stars on GitHub - 1 maintainer
data-extractor 0.10.2
Combine XPath, CSS Selectors and JSONPath for Web data extracting.41 versions - Latest release: over 2 years ago - 1 dependent repositories - 266 downloads last month - 27 stars on GitHub - 1 maintainer
boututils 0.2.1
Python utilities for BOUT++12 versions - Latest release: 7 months ago - 3 dependent packages - 4 dependent repositories - 1.07 thousand downloads last month - 1 stars on GitHub - 4 maintainers
Top 6.5% on pypi.org
12 versions - Latest release: 7 months ago - 1 dependent package - 5 dependent repositories - 1.03 thousand downloads last month - 0 stars on GitHub - 4 maintainers
boutdata 0.2.1
Python package for collecting BOUT++ data12 versions - Latest release: 7 months ago - 1 dependent package - 5 dependent repositories - 1.03 thousand downloads last month - 0 stars on GitHub - 4 maintainers
taupe 1.2.0
Taupe: a tool to extract URLs from your personal Twitter archive4 versions - Latest release: over 1 year ago - 41 downloads last month - 27 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
9 versions - Latest release: 10 months ago - 1 dependent package - 4 dependent repositories - 998 downloads last month - 88 stars on GitHub - 1 maintainer
cyac 1.9
High performance Trie and Ahocorasick automata (AC automata) for python9 versions - Latest release: 10 months ago - 1 dependent package - 4 dependent repositories - 998 downloads last month - 88 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
34 versions - Latest release: 3 months ago - 6 dependent repositories - 844 downloads last month - 117 stars on GitHub - 1 maintainer
sayn 0.6.13
Data-modelling and processing framework for automating Python and SQL tasks34 versions - Latest release: 3 months ago - 6 dependent repositories - 844 downloads last month - 117 stars on GitHub - 1 maintainer
journalpdfscraper 0.2.1
A project to check if articles are free or paid3 versions - Latest release: about 3 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer
Related Keywords
python
9
web-scraping
5
python3
5
data-analysis
5
data-science
4
data-engineering
3
nlp
3
data
3
machine-learning
3
bigdata
3
article-extractor
2
data-cleaning
2
data-cleaner
2
dask-cudf
2
dask
2
cudf
2
big-data-cleaning
2
data-profiling
2
data-cleansing
2
data-wrangling
2
datacleaner
2
sql
2
classification
2
hive
2
twitter
2
text-extraction
2
scraping
2
data-mining
2
elt
2
scraper
2
parser
2
webscraping
2
data-extractor
2
jsonpath
2
json
2
spark
2
pyspark
2
data-transformation
2
data-preparation
2
data-exploration
2
keyword-extraction
2
search-in-text
2
data-visualization
2
physics
2
plasma
2
bout
2
bout++
2
stock-market
2
stock-screener
2
captcha
2
html
2
extract
2
captcha-solver
2
automation
1
python-client
1
digitization
1
image-processing
1
promptapi
1
scrape
1
parse
1
download
1
search
1
api-marketplace
1
api-wrapper
1
css-selector
1
css-selector-parser
1
image-scraper
1
double-array-trie
1
scraper-api
1
web-scraper
1
apachespark
1
article-extracting
1
datascience
1
news
1
dewiktionary
1
german-language
1
analytics
1
wiktionary-dump
1
wiktionary-parser
1
cloudkit
1
german
1
data-recovery
1
icloud
1
xml
1
wiktionary
1
data-modeling
1
icloud-access
1
icloud-api
1
trie
1
etl
1
aws-athena
1
aws
1
athena
1
web-scraping-python
1
extraction
1
poland
1
beeline
1
movies
1
data-integration
1
data-migration
1