Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-extraction" keyword

Top 9.7% on pypi.org
libextract 0.0.12
A HT/XML web scraping tool
4 versions - Latest release: almost 9 years ago - 5 dependent repositories - 23 downloads last month - 500 stars on GitHub - 1 maintainer
superpipe-py 0.1.8
build unstructured to structured data transformation pipelines
7 versions - Latest release: 22 days ago - 238 downloads last month - 98 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
flashtext 2.7
Extract/Replaces keywords in sentences.
18 versions - Latest release: about 6 years ago - 19 dependent packages - 208 dependent repositories - 1.71 million downloads last month - 5,539 stars on GitHub - 1 maintainer
bbva2pandas 1.1.3
Parse BBVA monthly reports directly to a Dataframe
6 versions - Latest release: 2 months ago - 1 dependent repositories - 164 downloads last month - 5 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
vnstock 0.2.8 💰
Vietnam Stock Market Data
41 versions - Latest release: 7 months ago - 4 dependent packages - 3 dependent repositories - 20.7 thousand downloads last month - 258 stars on GitHub - 1 maintainer
scrappeycom 0.3.8
An API wrapper for Scrappey.com written in Python (cloudflare bypass & solver)
11 versions - Latest release: 11 months ago - 75 downloads last month - 10 stars on GitHub - 1 maintainer
Top 3.7% on pypi.org
amazoncaptcha 0.5.11
"Pure Python, lightweight, Pillow-based solver for the Amazon text captcha."
48 versions - Latest release: about 1 year ago - 3 dependent packages - 19 dependent repositories - 48.6 thousand downloads last month - 409 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
pyoptimus 0.1.0
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion.
32 versions - Latest release: over 1 year ago - 1 dependent repositories - 349 downloads last month - 1,441 stars on GitHub - 2 maintainers
vnstock3 0.3.0.1 💰
A comprehensive and transparent solution for Vietnamese stock market analysis.
2 versions - Latest release: 5 days ago - 298 downloads last month - 379 stars on GitHub - 1 maintainer
complex-parser 0.0.2
A versatile Python package for data extraction from JSON-like structures with user-defined format...
3 versions - Latest release: 2 months ago - 55 downloads last month - 3 stars on GitHub - 1 maintainer
tcx-extract 0.1.2
A speed-optimized tcx data extractor.
4 versions - Latest release: 2 months ago - 28 downloads last month - 0 stars on GitHub - 1 maintainer
tap-planetscaleapi 0.1.1 💰
`tap-planetscaleapi` is a Singer tap for PlanetScaleAPI, built with the Meltano Singer SDK.
3 versions - Latest release: 3 months ago - 22 downloads last month - 0 stars on GitHub - 1 maintainer
yirabot 1.0.9
YiraBot: Simplifying Web Scraping for All. A user-friendly tool for developers and enthusiasts, o...
20 versions - Latest release: 2 months ago - 172 downloads last month - 11 stars on GitHub - 1 maintainer
xtweet 1.0.2
Es una biblioteca que te permite interactuar de manera eficiente con la API de Twitter.
3 versions - Latest release: 10 months ago - 21 downloads last month - 2 stars on GitHub - 1 maintainer
arachnio 0.0.0
Client library for interacting with Arachnio API
1 version - Latest release: about 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
zapr-athena-client 0.1
It is a python library to run the presto query on the AWS Athena.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 5 downloads last month - 1 stars on GitHub - 1 maintainer
wiktionary-de-parser 0.11.5
Extracts data from German Wiktionary dump files.
33 versions - Latest release: 3 months ago - 2 dependent repositories - 2.14 thousand downloads last month - 24 stars on GitHub - 1 maintainer
ricloud 3.2.0
Python client for Reincubate's ricloud API.
43 versions - Latest release: about 4 years ago - 2 dependent repositories - 115 downloads last month - 90 stars on GitHub - 2 maintainers
plotdigitizer 0.2.3
Extract raw data from plots images
11 versions - Latest release: over 1 year ago - 2 dependent repositories - 334 downloads last month - 100 stars on GitHub - 1 maintainer
pa-scraper 0.2.4
Python wrapper for Prompt API's Scraper API
9 versions - Latest release: over 3 years ago - 1 dependent repositories - 43 downloads last month - 5 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
optimuspyspark 2.2.32
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion wi...
83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
newshound 0.0.1 💰
A future news extractor package for Python 3
1 version - Latest release: over 2 years ago - 1 dependent repositories - 37 downloads last month - 29 stars on GitHub - 1 maintainer
jsonpath-extractor 0.9.0
A selector expression for extracting data from JSON.
16 versions - Latest release: 10 months ago - 1 dependent repositories - 7.21 thousand downloads last month - 36 stars on GitHub - 1 maintainer
inparse 0.1.1
Collaborative AI for Web Scraping, Data Extraction and Crawling,Knowledge Graph
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 25 downloads last month - 15 stars on GitHub - 1 maintainer
hivehoney 1.0.4
Client-less data retrieval from Hive.
5 versions - Latest release: over 5 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
hext 1.0.8
A module and command-line utility to extract structured data from HTML
16 versions - Latest release: 6 months ago - 4 dependent repositories - 676 downloads last month - 51 stars on GitHub - 1 maintainer
filmweb 0.9
Export movie ratings from filmweb.pl
8 versions - Latest release: 5 months ago - 1 dependent repositories - 50 downloads last month - 12 stars on GitHub - 1 maintainer
data-extractor 0.10.2
Combine XPath, CSS Selectors and JSONPath for Web data extracting.
41 versions - Latest release: over 2 years ago - 1 dependent repositories - 266 downloads last month - 27 stars on GitHub - 1 maintainer
boututils 0.2.1
Python utilities for BOUT++
12 versions - Latest release: 7 months ago - 3 dependent packages - 4 dependent repositories - 1.07 thousand downloads last month - 1 stars on GitHub - 4 maintainers
Top 6.5% on pypi.org
boutdata 0.2.1
Python package for collecting BOUT++ data
12 versions - Latest release: 7 months ago - 1 dependent package - 5 dependent repositories - 1.03 thousand downloads last month - 0 stars on GitHub - 4 maintainers
taupe 1.2.0
Taupe: a tool to extract URLs from your personal Twitter archive
4 versions - Latest release: over 1 year ago - 41 downloads last month - 27 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
cyac 1.9
High performance Trie and Ahocorasick automata (AC automata) for python
9 versions - Latest release: 10 months ago - 1 dependent package - 4 dependent repositories - 998 downloads last month - 88 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
sayn 0.6.13
Data-modelling and processing framework for automating Python and SQL tasks
34 versions - Latest release: 3 months ago - 6 dependent repositories - 844 downloads last month - 117 stars on GitHub - 1 maintainer
journalpdfscraper 0.2.1
A project to check if articles are free or paid
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer