Top 0.1% downloads on pypi.org
Top 0.1% dependent packages on pypi.org
Top 0.1% dependent repos on pypi.org
Top 3.7% forks on pypi.org
Top 0.6% docker downloads on pypi.org
pypi.org : smart-open
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
Registry
-
Source
- Documentation
- JSON
purl: pkg:pypi/smart-open
Keywords:
file streaming
, s3
, hdfs
, gcs
, azure blob storage
, boto
, bz2
, file
, gzip-stream
, hacktoberfest
, python
, streaming
, streaming-data
, webhdfs
License: MIT
Latest release: 4 months ago
First release: over 10 years ago
Dependent packages: 248
Dependent repositories: 10,157
Downloads: 46,621,959 last month
Stars: 3,101 on GitHub
Forks: 378 on GitHub
Docker dependents: 1,264
Docker downloads: 986,441,609
Total Commits: 911
Committers: 127
Average commits per author: 7.173
Development Distribution Score (DDS): 0.717
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Funding links: https://github.com/sponsors/piskvorky
Last synced: about 2 hours ago
self-healing-driver removed
1 versionastro-sdk-python 1.8.1
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python...51 versions - Latest release: 10 months ago - 2 dependent packages - 7 dependent repositories - 84.7 thousand downloads last month - 335 stars on GitHub - 4 maintainers
xingu 1.7.3
Automated ML model training and packaging49 versions - Latest release: 11 months ago - 788 downloads last month - 3 stars on GitHub - 1 maintainer
weasel 0.4.1
Weasel: A small and easy workflow system13 versions - Latest release: 11 months ago - 5 dependent packages - 2 dependent repositories - 6.94 million downloads last month - 80 stars on GitHub - 1 maintainer
featuretools 1.31.0
a framework for automated feature engineering105 versions - Latest release: 11 months ago - 23 dependent packages - 286 dependent repositories - 88.2 thousand downloads last month - 7,148 stars on GitHub - 8 maintainers
records-mover 1.6.4
Records mover is a command-line tool and Python library you can use to move relational data from ...26 versions - Latest release: 11 months ago - 1 dependent repositories - 1.78 thousand downloads last month - 37 stars on GitHub - 2 maintainers
woodwork 0.31.0
a data typing library for machine learning61 versions - Latest release: 11 months ago - 11 dependent packages - 33 dependent repositories - 61.8 thousand downloads last month - 151 stars on GitHub - 7 maintainers
leya 0.1.5
A coding assistant to help with repository management and code queries.6 versions - Latest release: 12 months ago - 239 downloads last month - 0 stars on GitHub - 1 maintainer
fog-x 0.1.0 💰
An Efficient and Scalable Data Collection and Management Framework For Robotics Learning4 versions - Latest release: 12 months ago - 129 downloads last month - 12 stars on GitHub - 1 maintainer
test-option-deps 3.0.0
Testing optional dependencies4 versions - Latest release: about 1 year ago - 160 downloads last month - 1 maintainer
llm-datasets 0.0.3
A collection of datasets for language model training including scripts for downloading, preproces...1 version - Latest release: about 1 year ago - 71 downloads last month - 56 stars on GitHub - 1 maintainer
tap-filesanywhere 0.0.1
`tap-filesanywhere` is a Singer tap for extracting 'files from anywhere', built with the Meltano ...1 version - Latest release: about 1 year ago - 46 downloads last month - 1 maintainer
hf-tap-filesanywhere 0.0.1 removed
`tap-filesanywhere` is a Singer tap for FilesAnywhere, built with the Meltano Singer SDK.1 version - Latest release: about 1 year ago - 1 maintainer
super-config 0.0.3
Parse bank cheques3 versions - Latest release: about 1 year ago - 77 downloads last month - 1 stars on GitHub - 1 maintainer
computing-toolbox 1.6.11
Computing Toolbox for daily computations61 versions - Latest release: about 1 year ago - 1.22 thousand downloads last month - 1 maintainer
cpkil 0.2.3
CPR Python Package15 versions - Latest release: about 1 year ago - 437 downloads last month - 1 maintainer
s3tethys 0.1.4
Python S3 tools for tethys using smart_open13 versions - Latest release: about 1 year ago - 2 dependent packages - 373 downloads last month - 0 stars on GitHub - 1 maintainer
cloudtik 1.6.0
CloudTik: a cloud scale platform for distributed analytics and AI on public clouds17 versions - Latest release: about 1 year ago - 1 dependent repositories - 1.12 thousand downloads last month - 2 stars on GitHub - 1 maintainer
hammadml-gpu 0.0.14
Hammad Python ~ Machine Learning2 versions - Latest release: about 1 year ago - 101 downloads last month - 2 stars on GitHub - 1 maintainer
playwright-request 1.5.0
Playwright request to make regular request for sites that blocks regular requests like www.amazon...16 versions - Latest release: about 1 year ago - 411 downloads last month - 1 maintainer
board-game-scraper 2.22.0 💰
Board games data scraping and processing from BoardGameGeek and more!58 versions - Latest release: about 1 year ago - 5 dependent repositories - 915 downloads last month - 23 stars on gitlab.com - 1 maintainer
scraperx 0.7.1
ScraperX SDK72 versions - Latest release: about 1 year ago - 1 dependent repositories - 1.54 thousand downloads last month - 53 stars on GitHub - 1 maintainer
pashehnet 0.1.1
Sensor network simulator publishing to MQTT broker3 versions - Latest release: about 1 year ago - 117 downloads last month - 1 stars on GitHub - 1 maintainer
claws 0.0.21
The Crossref Labs AWS tooling system.21 versions - Latest release: over 1 year ago - 4 dependent packages - 850 downloads last month - 1 maintainer
pathy 0.11.0
pathlib.Path subclasses for local and cloud bucket storage30 versions - Latest release: over 1 year ago - 25 dependent packages - 3,362 dependent repositories - 8.94 million downloads last month - 170 stars on GitHub - 1 maintainer
chicken-coop 0.0.5
An environment for reproducing dominance hierarchies in RL agents5 versions - Latest release: over 1 year ago - 216 downloads last month - 1 maintainer
lm-datasets 0.0.2
A collection of datasets for language model training including scripts for downloading, preproces...2 versions - Latest release: over 1 year ago - 71 downloads last month - 56 stars on GitHub - 1 maintainer
machine-learning-with-graph 0.0.3
A comprehensive package for graph-based machine learning algorithms.3 versions - Latest release: over 1 year ago - 159 downloads last month - 2 maintainers
ai-services 0.5.5
"A simple web framework based on Sanic"30 versions - Latest release: over 1 year ago - 1.1 thousand downloads last month - 4 stars on GitHub - 1 maintainer
asfsmd 1.4.1
ASF Sentinel-1 Metadata Download tool4 versions - Latest release: over 1 year ago - 146 downloads last month - 13 stars on GitHub - 1 maintainer
drop-backend 1.1.2
API and Command line tools for building drop8 versions - Latest release: over 1 year ago - 1 dependent package - 160 downloads last month - 1 maintainer
ml-starter 0.1.91 removed
ML project template repository148 versions - Latest release: over 1 year ago - 4 dependent packages - 2 dependent repositories - 595 downloads last month - 9 stars on GitHub - 1 maintainer
elx 0.2.0 💰
A lightweight Python interface for extracting and loading using the Singer.io spec.8 versions - Latest release: over 1 year ago - 329 downloads last month - 3,198 stars on GitHub - 1 maintainer
epochraft 0.1.0.dev20231107
Supercharge Your LLM Training with Checkpointable Data Loading10 versions - Latest release: over 1 year ago - 455 downloads last month - 28 stars on GitHub - 1 maintainer
efemarai 0.4.2
A CLI and SDK for interacting with the Efemarai ML testing platform.35 versions - Latest release: over 1 year ago - 1 dependent repositories - 345 downloads last month - 24 stars on GitHub - 1 maintainer
visionplus-lib 0.0.1 removed
Vision+ Team Data Custom Package1 version - Latest release: over 1 year ago - 1 maintainer
vplus-library 0.0.1 removed
Vision+ Team Data Custom Package1 version - Latest release: over 1 year ago - 1 maintainer
smashed 0.21.5
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields ext...64 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 6.79 thousand downloads last month - 30 stars on GitHub - 2 maintainers
use-mp-tta 1.0.3
init TTA_prepro_Data_method4 versions - Latest release: over 1 year ago - 75 downloads last month - 1 maintainer
use-method-tta-ph 2.0.0.1
init TTA_prepro_Data_method11 versions - Latest release: over 1 year ago - 266 downloads last month - 1 maintainer
dlm-matrix 0.7.10 removed
Divergent Language Matrix25 versions - Latest release: over 1 year ago - 1.34 thousand downloads last month - 0 stars on GitHub - 1 maintainer
quicklab 0.1.3
Start Jupyter Lab sessions on the cloud2 versions - Latest release: almost 2 years ago - 106 downloads last month - 1 maintainer
scale-lidar-io 1.2.5
Lidar data conversion helpers26 versions - Latest release: almost 2 years ago - 1 dependent repositories - 2.24 thousand downloads last month - 2 maintainers
short-poetry 0.1.0
1 version - Latest release: almost 2 years ago - 51 downloads last month - 1 maintainerdioptra 1.0.29
Client library to log data to Dioptra API78 versions - Latest release: almost 2 years ago - 1 dependent repositories - 2.02 thousand downloads last month - 4 stars on GitHub - 1 maintainer
plynx 1.11.1
ML platform57 versions - Latest release: almost 2 years ago - 1 dependent repositories - 845 downloads last month - 309 stars on GitHub - 1 maintainer
jsonl2 1.1.14
Json line library9 versions - Latest release: almost 2 years ago - 385 downloads last month - 1 maintainer
gces-trab1 1.0.0
Biblioteca do trabalho prático de GCES PUC Minas2 versions - Latest release: almost 2 years ago - 96 downloads last month - 1 maintainer
smart-bucket 0.1.0
Bidirectional synchronization between local directory and s3 bucket.1 version - Latest release: almost 2 years ago - 42 downloads last month - 0 stars on GitHub - 1 maintainer
spec-pilot 0.3.4
A command-line tool for generating and managing OpenAPI specifications8 versions - Latest release: almost 2 years ago - 188 downloads last month - 0 stars on GitHub - 1 maintainer
s3dbm 0.0.8
A python dbm-style interface to S38 versions - Latest release: almost 2 years ago - 217 downloads last month - 0 stars on GitHub - 1 maintainer
simple-elmo 0.9.2
Handy library to work with pre-trained ELMo embeddings in TensorFlow18 versions - Latest release: almost 2 years ago - 2 dependent packages - 1 dependent repositories - 832 downloads last month - 52 stars on GitHub - 1 maintainer
uphill 0.1.2
make data preparation more friendly3 versions - Latest release: almost 2 years ago - 104 downloads last month - 18 stars on GitHub - 1 maintainer
s7r 0.1.9
An easy to use job runner.9 versions - Latest release: almost 2 years ago - 273 downloads last month - 0 stars on GitHub - 1 maintainer
paaaaath 0.2.7
a useful alternative Path object12 versions - Latest release: about 2 years ago - 1 dependent repositories - 436 downloads last month - 5 stars on GitHub - 1 maintainer
vocably 0.0.9
Vocably is a Natural Language Framework written in Python for Language based Tasks.7 versions - Latest release: about 2 years ago - 273 downloads last month - 2 stars on GitHub - 1 maintainer
codefast 0.9.29
A package for faster coding.194 versions - Latest release: about 2 years ago - 28 dependent packages - 2 dependent repositories - 1.26 thousand downloads last month - 1 maintainer
nlp-cryptography 0.1.0 removed
NLP Cryptography1 version - Latest release: about 2 years ago - 1 maintainer
apache-airflow-provider-transfers 0.1.0
This project contains the Universal Transfer Operator which can transfer all the data that could ...1 version - Latest release: about 2 years ago - 81 downloads last month - 335 stars on GitHub - 1 maintainer
sitefab 1.2.1679848743
State of the art static website generator for humans31 versions - Latest release: about 2 years ago - 1 dependent repositories - 862 downloads last month - 12 stars on GitHub - 1 maintainer
cloud-dataplug 0.0.2
Pluggable data types for cloud-native scientific workloads2 versions - Latest release: about 2 years ago - 17 downloads last month - 3 stars on GitHub - 1 maintainer
doesnt-git-easier 0.0.4
Making it easy to read and write files to Git in a Pythonic way with context managers and the Git...4 versions - Latest release: about 2 years ago - 177 downloads last month - 0 stars on GitHub - 1 maintainer
dofast 1.2.18
214 versions - Latest release: about 2 years ago - 1 dependent package - 38 downloads last month - 1 maintainerhygia 0.2.2
A short description of the package.14 versions - Latest release: about 2 years ago - 543 downloads last month - 4 stars on GitHub - 1 maintainer
s3head 0.2.0
head command for AWS S3 objects3 versions - Latest release: about 2 years ago - 133 downloads last month - 0 stars on GitHub - 1 maintainer
m4-utils 0.0.15
Biblioteca com funções de uso comum em projetos de aprendizado de máquina e ciencia de dados.17 versions - Latest release: about 2 years ago - 713 downloads last month - 1 maintainer
ethereum-tools 0.1.6
High-level tools and library to interact with Ethereum7 versions - Latest release: about 2 years ago - 1 dependent repositories - 542 downloads last month - 4 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-jvsdurso 0.1.1
Dependências deste projeto.2 versions - Latest release: about 2 years ago - 95 downloads last month - 1 stars on GitHub - 1 maintainer
gces-kevin-180042386 0.4.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/13 versions - Latest release: about 2 years ago - 125 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-wesleysantos00 0.1.0
Sistema que trata de uma biblioteca python para executar pipelines de dados de forma customizável...1 version - Latest release: about 2 years ago - 39 downloads last month - 6 stars on GitHub - 1 maintainer
interbase-code 0.1.2
Projeto GCES3 versions - Latest release: about 2 years ago - 154 downloads last month - 6 stars on GitHub - 1 maintainer
gcs-2022-2-trabalho-individual-yc427 0.1.2
Sistema que trata de uma biblioteca python para executar pipelines de dados de forma customizável...3 versions - Latest release: about 2 years ago - 138 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-de-gces 0.1.5
Enunciado e código fonte do Trabalho Individual de GCES 2021/15 versions - Latest release: about 2 years ago - 178 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-170105342 1.0.5
Enunciado e código fonte do Trabalho Individual de GCES 2021/12 versions - Latest release: about 2 years ago - 61 downloads last month - 6 stars on GitHub - 1 maintainer
gces-denysrogeres 1.0.0
Trabalho desenvolvido na matéria de Gestão Configuração e Evolução de Software da Universidade de...3 versions - Latest release: about 2 years ago - 124 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-gces-2022-2-luisglins 0.1.3
Enunciado e código fonte do Trabalho Individual de GCES 2021/14 versions - Latest release: about 2 years ago - 155 downloads last month - 6 stars on GitHub - 1 maintainer
teste-gces 0.2.7
Trabalho desenvolvido na matéria de Gestão Configuração e Evolução de Software da Universidade de...7 versions - Latest release: about 2 years ago - 363 downloads last month - 6 stars on GitHub - 1 maintainer
application-gces-2-2022-douglasmonteles 0.2.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/12 versions - Latest release: about 2 years ago - 107 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2 0.0.16
Enunciado e código fonte do Trabalho Individual de GCES 2021/14 versions - Latest release: about 2 years ago - 175 downloads last month - 6 stars on GitHub - 1 maintainer
2022-2-gces-ifpf 0.3.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/12 versions - Latest release: about 2 years ago - 96 downloads last month - 6 stars on GitHub - 1 maintainer
eduardo-gces-poetry 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/11 version - Latest release: about 2 years ago - 52 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-gces 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/16 versions - Latest release: about 2 years ago - 213 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-final-gces-erick-levy 0.1.2
Trabalho final da materia gces3 versions - Latest release: about 2 years ago - 120 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-gces-2022 0.1.8
trabalho-individual-2022-GCES5 versions - Latest release: about 2 years ago - 142 downloads last month - 6 stars on GitHub - 1 maintainer
170051277-trab-final-gces 0.5.0
Pacote utilizado para o deploy do trabalho final da disciplina Gerência de Configuração e Evoluçã...3 versions - Latest release: about 2 years ago - 203 downloads last month - 0 stars on GitHub - 1 maintainer
poetry-190048221-rodrigo 0.1.0
1 version - Latest release: about 2 years ago - 51 downloads last month - 1 maintainertrabalho-gces 0.3.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/13 versions - Latest release: about 2 years ago - 115 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-2022-2-lameque 0.1.5
Enunciado e código fonte do Trabalho Individual de GCES 2021/19 versions - Latest release: about 2 years ago - 345 downloads last month - 6 stars on GitHub - 1 maintainer
gces-bib 1.0.0
Pacote de dependências Python do projeto.3 versions - Latest release: about 2 years ago - 132 downloads last month - 1 maintainer
poetry-docs 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/16 versions - Latest release: about 2 years ago - 260 downloads last month - 6 stars on GitHub - 1 maintainer
gces-poetry 0.1.2
3 versions - Latest release: about 2 years ago - 79 downloads last month - 1 maintainer170051277-pypi-package 0.2.0 removed
Lorem ipsum dolor sit amet2 versions - Latest release: about 2 years ago
trabalho-individual-gces-thiagooliveira 0.1.0
Trabalho individual de GCES 2022/2 de Thiago França1 version - Latest release: about 2 years ago - 58 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-poetry 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/11 version - Latest release: about 2 years ago - 52 downloads last month - 6 stars on GitHub - 1 maintainer
trabalho-individual-gces-170051277 0.1.0 removed
Lorem ipsum dolor sit amet1 version - Latest release: about 2 years ago
trabalho-individual-2022-2-adrian-160000572 0.1.2
Lib to manage metadata of databases to be consumed by data analysis tools2 versions - Latest release: about 2 years ago - 50 downloads last month - 6 stars on GitHub - 1 maintainer
poetry-vital14 0.1.0
1 version - Latest release: about 2 years ago - 58 downloads last month - 1 maintainertrabalho-individual-de-gces-2022-2 0.1.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/11 version - Latest release: about 2 years ago - 35 downloads last month - 6 stars on GitHub - 1 maintainer
gces-teste 0.10.0
Enunciado e código fonte do Trabalho Individual de GCES 2021/110 versions - Latest release: about 2 years ago - 275 downloads last month - 6 stars on GitHub - 1 maintainer
yaml-parser-gces-italo 0.4.0
A biblioteca desenvolvida auxilia desenvolvedores a explorar os dados com funções essenciais para...3 versions - Latest release: about 2 years ago - 93 downloads last month - 6 stars on GitHub - 1 maintainer
Check this option to include packages that no longer depend on this package in their latest version but previously did.