An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

Top 1.1% on pypi.org
Top 0.1% downloads on pypi.org
Top 0.1% dependent packages on pypi.org
Top 0.1% dependent repos on pypi.org
Top 3.7% forks on pypi.org
Top 0.6% docker downloads on pypi.org

pypi.org : smart-open

Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)

Registry - Source - Documentation - JSON
purl: pkg:pypi/smart-open
Keywords: file streaming , s3 , hdfs , gcs , azure blob storage , boto , bz2 , file , gzip-stream , hacktoberfest , python , streaming , streaming-data , webhdfs
License: MIT
Latest release: 4 months ago
First release: over 10 years ago
Dependent packages: 248
Dependent repositories: 10,157
Downloads: 46,621,959 last month
Stars: 3,101 on GitHub
Forks: 378 on GitHub
Docker dependents: 1,264
Docker downloads: 986,441,609
Total Commits: 911
Committers: 127
Average commits per author: 7.173
Development Distribution Score (DDS): 0.717
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Funding links: https://github.com/sponsors/piskvorky
Last synced: about 2 hours ago

self-healing-driver removed
1 version
Top 4.1% on pypi.org
astro-sdk-python 1.8.1
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python...
51 versions - Latest release: 10 months ago - 2 dependent packages - 7 dependent repositories - 84.7 thousand downloads last month - 335 stars on GitHub - 4 maintainers
xingu 1.7.3
Automated ML model training and packaging
49 versions - Latest release: 11 months ago - 788 downloads last month - 3 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
weasel 0.4.1
Weasel: A small and easy workflow system
13 versions - Latest release: 11 months ago - 5 dependent packages - 2 dependent repositories - 6.94 million downloads last month - 80 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
featuretools 1.31.0
a framework for automated feature engineering
105 versions - Latest release: 11 months ago - 23 dependent packages - 286 dependent repositories - 88.2 thousand downloads last month - 7,148 stars on GitHub - 8 maintainers
records-mover 1.6.4
Records mover is a command-line tool and Python library you can use to move relational data from ...
26 versions - Latest release: 11 months ago - 1 dependent repositories - 1.78 thousand downloads last month - 37 stars on GitHub - 2 maintainers
Top 3.5% on pypi.org
woodwork 0.31.0
a data typing library for machine learning
61 versions - Latest release: 11 months ago - 11 dependent packages - 33 dependent repositories - 61.8 thousand downloads last month - 151 stars on GitHub - 7 maintainers
leya 0.1.5
A coding assistant to help with repository management and code queries.
6 versions - Latest release: 12 months ago - 239 downloads last month - 0 stars on GitHub - 1 maintainer
fog-x 0.1.0 💰
An Efficient and Scalable Data Collection and Management Framework For Robotics Learning
4 versions - Latest release: 12 months ago - 129 downloads last month - 12 stars on GitHub - 1 maintainer
test-option-deps 3.0.0
Testing optional dependencies
4 versions - Latest release: about 1 year ago - 160 downloads last month - 1 maintainer
llm-datasets 0.0.3
A collection of datasets for language model training including scripts for downloading, preproces...
1 version - Latest release: about 1 year ago - 71 downloads last month - 56 stars on GitHub - 1 maintainer
tap-filesanywhere 0.0.1
`tap-filesanywhere` is a Singer tap for extracting 'files from anywhere', built with the Meltano ...
1 version - Latest release: about 1 year ago - 46 downloads last month - 1 maintainer
hf-tap-filesanywhere 0.0.1 removed
`tap-filesanywhere` is a Singer tap for FilesAnywhere, built with the Meltano Singer SDK.
1 version - Latest release: about 1 year ago - 1 maintainer
super-config 0.0.3
Parse bank cheques
3 versions - Latest release: about 1 year ago - 77 downloads last month - 1 stars on GitHub - 1 maintainer
computing-toolbox 1.6.11
Computing Toolbox for daily computations
61 versions - Latest release: about 1 year ago - 1.22 thousand downloads last month - 1 maintainer
cpkil 0.2.3
CPR Python Package
15 versions - Latest release: about 1 year ago - 437 downloads last month - 1 maintainer
s3tethys 0.1.4
Python S3 tools for tethys using smart_open
13 versions - Latest release: about 1 year ago - 2 dependent packages - 373 downloads last month - 0 stars on GitHub - 1 maintainer
cloudtik 1.6.0
CloudTik: a cloud scale platform for distributed analytics and AI on public clouds
17 versions - Latest release: about 1 year ago - 1 dependent repositories - 1.12 thousand downloads last month - 2 stars on GitHub - 1 maintainer
hammadml-gpu 0.0.14
Hammad Python ~ Machine Learning
2 versions - Latest release: about 1 year ago - 101 downloads last month - 2 stars on GitHub - 1 maintainer
playwright-request 1.5.0
Playwright request to make regular request for sites that blocks regular requests like www.amazon...
16 versions - Latest release: about 1 year ago - 411 downloads last month - 1 maintainer
board-game-scraper 2.22.0 💰
Board games data scraping and processing from BoardGameGeek and more!
58 versions - Latest release: about 1 year ago - 5 dependent repositories - 915 downloads last month - 23 stars on gitlab.com - 1 maintainer
scraperx 0.7.1
ScraperX SDK
72 versions - Latest release: about 1 year ago - 1 dependent repositories - 1.54 thousand downloads last month - 53 stars on GitHub - 1 maintainer
pashehnet 0.1.1
Sensor network simulator publishing to MQTT broker
3 versions - Latest release: about 1 year ago - 117 downloads last month - 1 stars on GitHub - 1 maintainer
claws 0.0.21
The Crossref Labs AWS tooling system.
21 versions - Latest release: over 1 year ago - 4 dependent packages - 850 downloads last month - 1 maintainer
Top 2.5% on pypi.org
pathy 0.11.0
pathlib.Path subclasses for local and cloud bucket storage
30 versions - Latest release: over 1 year ago - 25 dependent packages - 3,362 dependent repositories - 8.94 million downloads last month - 170 stars on GitHub - 1 maintainer
chicken-coop 0.0.5
An environment for reproducing dominance hierarchies in RL agents
5 versions - Latest release: over 1 year ago - 216 downloads last month - 1 maintainer
lm-datasets 0.0.2
A collection of datasets for language model training including scripts for downloading, preproces...
2 versions - Latest release: over 1 year ago - 71 downloads last month - 56 stars on GitHub - 1 maintainer
machine-learning-with-graph 0.0.3
A comprehensive package for graph-based machine learning algorithms.
3 versions - Latest release: over 1 year ago - 159 downloads last month - 2 maintainers
ai-services 0.5.5
"A simple web framework based on Sanic"
30 versions - Latest release: over 1 year ago - 1.1 thousand downloads last month - 4 stars on GitHub - 1 maintainer
asfsmd 1.4.1
ASF Sentinel-1 Metadata Download tool
4 versions - Latest release: over 1 year ago - 146 downloads last month - 13 stars on GitHub - 1 maintainer
drop-backend 1.1.2
API and Command line tools for building drop
8 versions - Latest release: over 1 year ago - 1 dependent package - 160 downloads last month - 1 maintainer
ml-starter 0.1.91 removed
ML project template repository
148 versions - Latest release: over 1 year ago - 4 dependent packages - 2 dependent repositories - 595 downloads last month - 9 stars on GitHub - 1 maintainer
elx 0.2.0 💰
A lightweight Python interface for extracting and loading using the Singer.io spec.
8 versions - Latest release: over 1 year ago - 329 downloads last month - 3,198 stars on GitHub - 1 maintainer
epochraft 0.1.0.dev20231107
Supercharge Your LLM Training with Checkpointable Data Loading
10 versions - Latest release: over 1 year ago - 455 downloads last month - 28 stars on GitHub - 1 maintainer
efemarai 0.4.2
A CLI and SDK for interacting with the Efemarai ML testing platform.
35 versions - Latest release: over 1 year ago - 1 dependent repositories - 345 downloads last month - 24 stars on GitHub - 1 maintainer
visionplus-lib 0.0.1 removed
Vision+ Team Data Custom Package
1 version - Latest release: over 1 year ago - 1 maintainer
vplus-library 0.0.1 removed
Vision+ Team Data Custom Package
1 version - Latest release: over 1 year ago - 1 maintainer
smashed 0.21.5
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields ext...
64 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 6.79 thousand downloads last month - 30 stars on GitHub - 2 maintainers
use-mp-tta 1.0.3
init TTA_prepro_Data_method
4 versions - Latest release: over 1 year ago - 75 downloads last month - 1 maintainer
use-method-tta-ph 2.0.0.1
init TTA_prepro_Data_method
11 versions - Latest release: over 1 year ago - 266 downloads last month - 1 maintainer
dlm-matrix 0.7.10 removed
Divergent Language Matrix
25 versions - Latest release: over 1 year ago - 1.34 thousand downloads last month - 0 stars on GitHub - 1 maintainer
quicklab 0.1.3
Start Jupyter Lab sessions on the cloud
2 versions - Latest release: almost 2 years ago - 106 downloads last month - 1 maintainer
scale-lidar-io 1.2.5
Lidar data conversion helpers
26 versions - Latest release: almost 2 years ago - 1 dependent repositories - 2.24 thousand downloads last month - 2 maintainers
short-poetry 0.1.0
1 version - Latest release: almost 2 years ago - 51 downloads last month - 1 maintainer
dioptra 1.0.29
Client library to log data to Dioptra API
78 versions - Latest release: almost 2 years ago - 1 dependent repositories - 2.02 thousand downloads last month - 4 stars on GitHub - 1 maintainer
plynx 1.11.1
ML platform
57 versions - Latest release: almost 2 years ago - 1 dependent repositories - 845 downloads last month - 309 stars on GitHub - 1 maintainer
jsonl2 1.1.14
Json line library
9 versions - Latest release: almost 2 years ago - 385 downloads last month - 1 maintainer
gces-trab1 1.0.0
Biblioteca do trabalho prático de GCES PUC Minas
2 versions - Latest release: almost 2 years ago - 96 downloads last month - 1 maintainer
smart-bucket 0.1.0
Bidirectional synchronization between local directory and s3 bucket.
1 version - Latest release: almost 2 years ago - 42 downloads last month - 0 stars on GitHub - 1 maintainer
spec-pilot 0.3.4
A command-line tool for generating and managing OpenAPI specifications
8 versions - Latest release: almost 2 years ago - 188 downloads last month - 0 stars on GitHub - 1 maintainer
s3dbm 0.0.8
A python dbm-style interface to S3
8 versions - Latest release: almost 2 years ago - 217 downloads last month - 0 stars on GitHub - 1 maintainer
simple-elmo 0.9.2
Handy library to work with pre-trained ELMo embeddings in TensorFlow
18 versions - Latest release: almost 2 years ago - 2 dependent packages - 1 dependent repositories - 832 downloads last month - 52 stars on GitHub - 1 maintainer
uphill 0.1.2
make data preparation more friendly
3 versions - Latest release: almost 2 years ago - 104 downloads last month - 18 stars on GitHub - 1 maintainer
s7r 0.1.9
An easy to use job runner.
9 versions - Latest release: almost 2 years ago - 273 downloads last month - 0 stars on GitHub - 1 maintainer
paaaaath 0.2.7
a useful alternative Path object
12 versions - Latest release: about 2 years ago - 1 dependent repositories - 436 downloads last month - 5 stars on GitHub - 1 maintainer
vocably 0.0.9
Vocably is a Natural Language Framework written in Python for Language based Tasks.
7 versions - Latest release: about 2 years ago - 273 downloads last month - 2 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
codefast 0.9.29
A package for faster coding.
194 versions - Latest release: about 2 years ago - 28 dependent packages - 2 dependent repositories - 1.26 thousand downloads last month - 1 maintainer
nlp-cryptography 0.1.0 removed
NLP Cryptography
1 version - Latest release: about 2 years ago - 1 maintainer
apache-airflow-provider-transfers 0.1.0
This project contains the Universal Transfer Operator which can transfer all the data that could ...
1 version - Latest release: about 2 years ago - 81 downloads last month - 335 stars on GitHub - 1 maintainer
sitefab 1.2.1679848743
State of the art static website generator for humans
31 versions - Latest release: about 2 years ago - 1 dependent repositories - 862 downloads last month - 12 stars on GitHub - 1 maintainer
cloud-dataplug 0.0.2
Pluggable data types for cloud-native scientific workloads
2 versions - Latest release: about 2 years ago - 17 downloads last month - 3 stars on GitHub - 1 maintainer
doesnt-git-easier 0.0.4
Making it easy to read and write files to Git in a Pythonic way with context managers and the Git...
4 versions - Latest release: about 2 years ago - 177 downloads last month - 0 stars on GitHub - 1 maintainer
dofast 1.2.18
214 versions - Latest release: about 2 years ago - 1 dependent package - 38 downloads last month - 1 maintainer
hygia 0.2.2
A short description of the package.
14 versions - Latest release: about 2 years ago - 543 downloads last month - 4 stars on GitHub - 1 maintainer
s3head 0.2.0
head command for AWS S3 objects
3 versions - Latest release: about 2 years ago - 133 downloads last month - 0 stars on GitHub - 1 maintainer
m4-utils 0.0.15
Biblioteca com funções de uso comum em projetos de aprendizado de máquina e ciencia de dados.
17 versions - Latest release: about 2 years ago - 713 downloads last month - 1 maintainer
ethereum-tools 0.1.6
High-level tools and library to interact with Ethereum
7 versions - Latest release: about 2 years ago - 1 dependent repositories - 542 downloads last month - 4 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-jvsdurso 0.1.1
Dependências deste projeto.
2 versions - Latest release: about 2 years ago - 95 downloads last month - 1 stars on GitHub - 1 maintainer
gces-kevin-180042386 0.4.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
3 versions - Latest release: about 2 years ago - 125 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-wesleysantos00 0.1.0
Sistema que trata de uma biblioteca python para executar pipelines de dados de forma customizável...
1 version - Latest release: about 2 years ago - 39 downloads last month - 6 stars on GitHub - 1 maintainer
interbase-code 0.1.2
Projeto GCES
3 versions - Latest release: about 2 years ago - 154 downloads last month - 6 stars on GitHub - 1 maintainer
gcs-2022-2-trabalho-individual-yc427 0.1.2
Sistema que trata de uma biblioteca python para executar pipelines de dados de forma customizável...
3 versions - Latest release: about 2 years ago - 138 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-de-gces 0.1.5
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
5 versions - Latest release: about 2 years ago - 178 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-170105342 1.0.5
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
2 versions - Latest release: about 2 years ago - 61 downloads last month - 6 stars on GitHub - 1 maintainer
gces-denysrogeres 1.0.0
Trabalho desenvolvido na matéria de Gestão Configuração e Evolução de Software da Universidade de...
3 versions - Latest release: about 2 years ago - 124 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-gces-2022-2-luisglins 0.1.3
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
4 versions - Latest release: about 2 years ago - 155 downloads last month - 6 stars on GitHub - 1 maintainer
teste-gces 0.2.7
Trabalho desenvolvido na matéria de Gestão Configuração e Evolução de Software da Universidade de...
7 versions - Latest release: about 2 years ago - 363 downloads last month - 6 stars on GitHub - 1 maintainer
application-gces-2-2022-douglasmonteles 0.2.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
2 versions - Latest release: about 2 years ago - 107 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2 0.0.16
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
4 versions - Latest release: about 2 years ago - 175 downloads last month - 6 stars on GitHub - 1 maintainer
2022-2-gces-ifpf 0.3.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
2 versions - Latest release: about 2 years ago - 96 downloads last month - 6 stars on GitHub - 1 maintainer
eduardo-gces-poetry 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
1 version - Latest release: about 2 years ago - 52 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-gces 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
6 versions - Latest release: about 2 years ago - 213 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-final-gces-erick-levy 0.1.2
Trabalho final da materia gces
3 versions - Latest release: about 2 years ago - 120 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-gces-2022 0.1.8
trabalho-individual-2022-GCES
5 versions - Latest release: about 2 years ago - 142 downloads last month - 6 stars on GitHub - 1 maintainer
170051277-trab-final-gces 0.5.0
Pacote utilizado para o deploy do trabalho final da disciplina Gerência de Configuração e Evoluçã...
3 versions - Latest release: about 2 years ago - 203 downloads last month - 0 stars on GitHub - 1 maintainer
poetry-190048221-rodrigo 0.1.0
1 version - Latest release: about 2 years ago - 51 downloads last month - 1 maintainer
trabalho-gces 0.3.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
3 versions - Latest release: about 2 years ago - 115 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-lameque 0.1.5
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
9 versions - Latest release: about 2 years ago - 345 downloads last month - 6 stars on GitHub - 1 maintainer
gces-bib 1.0.0
Pacote de dependências Python do projeto.
3 versions - Latest release: about 2 years ago - 132 downloads last month - 1 maintainer
poetry-docs 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
6 versions - Latest release: about 2 years ago - 260 downloads last month - 6 stars on GitHub - 1 maintainer
gces-poetry 0.1.2
3 versions - Latest release: about 2 years ago - 79 downloads last month - 1 maintainer
170051277-pypi-package 0.2.0 removed
Lorem ipsum dolor sit amet
2 versions - Latest release: about 2 years ago
trabalho-individual-gces-thiagooliveira 0.1.0
Trabalho individual de GCES 2022/2 de Thiago França
1 version - Latest release: about 2 years ago - 58 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-poetry 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
1 version - Latest release: about 2 years ago - 52 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-gces-170051277 0.1.0 removed
Lorem ipsum dolor sit amet
1 version - Latest release: about 2 years ago
trabalho-individual-2022-2-adrian-160000572 0.1.2
Lib to manage metadata of databases to be consumed by data analysis tools
2 versions - Latest release: about 2 years ago - 50 downloads last month - 6 stars on GitHub - 1 maintainer
poetry-vital14 0.1.0
1 version - Latest release: about 2 years ago - 58 downloads last month - 1 maintainer
trabalho-individual-de-gces-2022-2 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
1 version - Latest release: about 2 years ago - 35 downloads last month - 6 stars on GitHub - 1 maintainer
gces-teste 0.10.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/1
10 versions - Latest release: about 2 years ago - 275 downloads last month - 6 stars on GitHub - 1 maintainer
yaml-parser-gces-italo 0.4.0
A biblioteca desenvolvida auxilia desenvolvedores a explorar os dados com funções essenciais para...
3 versions - Latest release: about 2 years ago - 93 downloads last month - 6 stars on GitHub - 1 maintainer