Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-generation" keyword

Top 1.8% on pypi.org
sdv 1.13.1
Generate synthetic data for single table, multi table and sequential data
133 versions - Latest release: 5 days ago - 25 dependent packages - 36 dependent repositories - 48.7 thousand downloads last month - 2,161 stars on GitHub - 9 maintainers
Top 7.7% on pypi.org
pydbgen 1.0.5
Random database/dataframe generator
1 version - Latest release: about 6 years ago - 12 dependent repositories - 369 downloads last month - 292 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
dbldatagen 0.3.6
Databricks Labs - PySpark Synthetic Data Generator
13 versions - Latest release: 3 months ago - 1 dependent package - 2 dependent repositories - 173 thousand downloads last month - 268 stars on GitHub - 1 maintainer
sparkdantic 0.20.5
A pydantic -> spark schema library
30 versions - Latest release: 2 months ago - 32.9 thousand downloads last month - 268 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
be-great 0.0.7
Generating Realistic Tabular Data using Large Language Models
7 versions - Latest release: 9 months ago - 2 dependent packages - 1 dependent repositories - 2.5 thousand downloads last month - 235 stars on GitHub - 2 maintainers
absynthe 0.0.3
A (branching) Behaviour Synthesizer
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 73 downloads last month - 8 stars on GitHub - 1 maintainer
openmixup 0.2.9
Mixup for Supervision, Semi- and Self-Supervision Learning Toolbox and Benchmark
2 versions - Latest release: 2 months ago - 32 downloads last month - 499 stars on GitHub - 1 maintainer
sdgne 4.0.0
Synthetic Data Generation and Evaluation
7 versions - Latest release: 3 months ago - 40 downloads last month - 1 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
ctgan 0.10.1
Create tabular synthetic data using a conditional GAN
68 versions - Latest release: 8 days ago - 13 dependent packages - 29 dependent repositories - 99 thousand downloads last month - 1,147 stars on GitHub - 8 maintainers
Top 6.3% on pypi.org
hypothesis-graphql 0.11.0 💰
Hypothesis strategies for GraphQL queries
23 versions - Latest release: 6 months ago - 1 dependent package - 109 dependent repositories - 111 thousand downloads last month - 40 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
copulas 0.11.0
Create tabular synthetic data using copulas-based modeling.
49 versions - Latest release: about 1 month ago - 12 dependent packages - 30 dependent repositories - 130 thousand downloads last month - 506 stars on GitHub - 8 maintainers
datamaker-faker 0.0.1
A python lib for data generation
1 version - Latest release: about 2 months ago - 272 downloads last month - 2 stars on GitHub - 1 maintainer
milkstraw-client 1.0.2
Generate synthetic data with a simple python client for milkstraw.ai
5 versions - Latest release: 9 months ago - 20 downloads last month - 13 stars on GitHub - 1 maintainer
augmentum 0.1.2
A library for doing image augmentation
2 versions - Latest release: about 1 year ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
xtuning 0.0.0
Fine-tuning, evaluation and data generation for LLMs
1 version - Latest release: about 1 year ago - 3 downloads last month - 1 maintainer
coota 1.0.3
A powerful data-generating library.
12 versions - Latest release: almost 2 years ago - 1 dependent repositories - 39 downloads last month - 2 stars on GitHub - 2 maintainers
synthia 1.1.0
Multidimensional synthetic data generation in Python
6 versions - Latest release: over 2 years ago - 1 dependent package - 2 dependent repositories - 54 downloads last month - 52 stars on GitHub - 1 maintainer
signalz 0.1.1
Synthetic data generators in Python
9 versions - Latest release: over 7 years ago - 2 dependent repositories - 97 downloads last month - 14 stars on GitHub - 1 maintainer
qgeneration 0.1a3
Data generation project
4 versions - Latest release: almost 7 years ago - 1 dependent repositories - 41 downloads last month - 0 stars on GitHub - 1 maintainer
noisemix 0.1.1
NoiseMix is a library for data generation for text datasets.
2 versions - Latest release: about 6 years ago - 1 dependent repositories - 22 downloads last month - 41 stars on GitHub - 1 maintainer
hypervector-wrapper 0.0.15
Python wrapper to use the Hypervector API. Better data tests
14 versions - Latest release: about 3 years ago - 1 dependent repositories - 74 downloads last month - 9 stars on GitHub - 1 maintainer
fastent 0.7.3
Automated Custom NER tool
20 versions - Latest release: almost 6 years ago - 1 dependent repositories - 82 downloads last month - 7 stars on GitHub - 1 maintainer
fake-data-for-learning 0.4.4
Sample interesting fake data for machine and human learning
10 versions - Latest release: 10 months ago - 1 dependent repositories - 83 downloads last month - 7 stars on GitHub - 1 maintainer
edo 0.3.6
Generating artificial datasets through evolution.
15 versions - Latest release: over 3 years ago - 2 dependent repositories - 81 downloads last month - 13 stars on GitHub - 1 maintainer
dummy-file-generator 1.1.21
dummy csv, flat, json text file generator, typical usage scenario can be load / stress / performa...
21 versions - Latest release: over 1 year ago - 1 dependent repositories - 104 downloads last month - 3 stars on GitHub - 1 maintainer
django-data-seeder 0.2.0
A data seeder for models for Django.
4 versions - Latest release: over 4 years ago - 4 dependent repositories - 168 downloads last month - 9 stars on GitHub - 1 maintainer
genalog 0.1.0
Tools for generating analog document (images) from raw text
3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 48 downloads last month - 293 stars on GitHub - 1 maintainer
xturing 0.1.8
Fine-tuning, evaluation and data generation for LLMs
19 versions - Latest release: 9 months ago - 321 downloads last month - 1 maintainer
Top 4.5% on pypi.org
deepecho 0.6.0
Create sequential synthetic data of mixed types using a GAN.
24 versions - Latest release: about 1 month ago - 3 dependent packages - 12 dependent repositories - 95.1 thousand downloads last month - 88 stars on GitHub - 7 maintainers
kerasgen 0.1.3
A Keras/Tensorflow compatible image data generator for creating balanced batches
4 versions - Latest release: almost 2 years ago - 1 dependent repositories - 31 downloads last month - 14 stars on GitHub - 1 maintainer
realtabformer 0.1.7
A novel method for generating tabular and relational data using language models.
32 versions - Latest release: 23 days ago - 1 dependent repositories - 1.3 thousand downloads last month - 183 stars on GitHub - 1 maintainer
py-synd 0.0.1 removed
SYNthetic Data generation for complex tabular datasets
1 version - Latest release: 12 months ago - 2 stars on GitHub
Related Keywords