Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data-generation" keyword
Top 1.8% on pypi.org
133 versions - Latest release: 5 days ago - 25 dependent packages - 36 dependent repositories - 48.7 thousand downloads last month - 2,161 stars on GitHub - 9 maintainers
sdv 1.13.1
Generate synthetic data for single table, multi table and sequential data133 versions - Latest release: 5 days ago - 25 dependent packages - 36 dependent repositories - 48.7 thousand downloads last month - 2,161 stars on GitHub - 9 maintainers
Top 7.7% on pypi.org
1 version - Latest release: about 6 years ago - 12 dependent repositories - 369 downloads last month - 292 stars on GitHub - 1 maintainer
pydbgen 1.0.5
Random database/dataframe generator1 version - Latest release: about 6 years ago - 12 dependent repositories - 369 downloads last month - 292 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
13 versions - Latest release: 3 months ago - 1 dependent package - 2 dependent repositories - 173 thousand downloads last month - 268 stars on GitHub - 1 maintainer
dbldatagen 0.3.6
Databricks Labs - PySpark Synthetic Data Generator13 versions - Latest release: 3 months ago - 1 dependent package - 2 dependent repositories - 173 thousand downloads last month - 268 stars on GitHub - 1 maintainer
sparkdantic 0.20.5
A pydantic -> spark schema library30 versions - Latest release: 2 months ago - 32.9 thousand downloads last month - 268 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
7 versions - Latest release: 9 months ago - 2 dependent packages - 1 dependent repositories - 2.5 thousand downloads last month - 235 stars on GitHub - 2 maintainers
be-great 0.0.7
Generating Realistic Tabular Data using Large Language Models7 versions - Latest release: 9 months ago - 2 dependent packages - 1 dependent repositories - 2.5 thousand downloads last month - 235 stars on GitHub - 2 maintainers
absynthe 0.0.3
A (branching) Behaviour Synthesizer3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 73 downloads last month - 8 stars on GitHub - 1 maintainer
openmixup 0.2.9
Mixup for Supervision, Semi- and Self-Supervision Learning Toolbox and Benchmark2 versions - Latest release: 2 months ago - 32 downloads last month - 499 stars on GitHub - 1 maintainer
sdgne 4.0.0
Synthetic Data Generation and Evaluation7 versions - Latest release: 3 months ago - 40 downloads last month - 1 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
68 versions - Latest release: 8 days ago - 13 dependent packages - 29 dependent repositories - 99 thousand downloads last month - 1,147 stars on GitHub - 8 maintainers
ctgan 0.10.1
Create tabular synthetic data using a conditional GAN68 versions - Latest release: 8 days ago - 13 dependent packages - 29 dependent repositories - 99 thousand downloads last month - 1,147 stars on GitHub - 8 maintainers
Top 6.3% on pypi.org
23 versions - Latest release: 6 months ago - 1 dependent package - 109 dependent repositories - 111 thousand downloads last month - 40 stars on GitHub - 1 maintainer
hypothesis-graphql 0.11.0 💰
Hypothesis strategies for GraphQL queries23 versions - Latest release: 6 months ago - 1 dependent package - 109 dependent repositories - 111 thousand downloads last month - 40 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
49 versions - Latest release: about 1 month ago - 12 dependent packages - 30 dependent repositories - 130 thousand downloads last month - 506 stars on GitHub - 8 maintainers
copulas 0.11.0
Create tabular synthetic data using copulas-based modeling.49 versions - Latest release: about 1 month ago - 12 dependent packages - 30 dependent repositories - 130 thousand downloads last month - 506 stars on GitHub - 8 maintainers
datamaker-faker 0.0.1
A python lib for data generation1 version - Latest release: about 2 months ago - 272 downloads last month - 2 stars on GitHub - 1 maintainer
milkstraw-client 1.0.2
Generate synthetic data with a simple python client for milkstraw.ai5 versions - Latest release: 9 months ago - 20 downloads last month - 13 stars on GitHub - 1 maintainer
augmentum 0.1.2
A library for doing image augmentation2 versions - Latest release: about 1 year ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
xtuning 0.0.0
Fine-tuning, evaluation and data generation for LLMs1 version - Latest release: about 1 year ago - 3 downloads last month - 1 maintainer
coota 1.0.3
A powerful data-generating library.12 versions - Latest release: almost 2 years ago - 1 dependent repositories - 39 downloads last month - 2 stars on GitHub - 2 maintainers
synthia 1.1.0
Multidimensional synthetic data generation in Python6 versions - Latest release: over 2 years ago - 1 dependent package - 2 dependent repositories - 54 downloads last month - 52 stars on GitHub - 1 maintainer
signalz 0.1.1
Synthetic data generators in Python9 versions - Latest release: over 7 years ago - 2 dependent repositories - 97 downloads last month - 14 stars on GitHub - 1 maintainer
qgeneration 0.1a3
Data generation project4 versions - Latest release: almost 7 years ago - 1 dependent repositories - 41 downloads last month - 0 stars on GitHub - 1 maintainer
noisemix 0.1.1
NoiseMix is a library for data generation for text datasets.2 versions - Latest release: about 6 years ago - 1 dependent repositories - 22 downloads last month - 41 stars on GitHub - 1 maintainer
hypervector-wrapper 0.0.15
Python wrapper to use the Hypervector API. Better data tests14 versions - Latest release: about 3 years ago - 1 dependent repositories - 74 downloads last month - 9 stars on GitHub - 1 maintainer
fastent 0.7.3
Automated Custom NER tool20 versions - Latest release: almost 6 years ago - 1 dependent repositories - 82 downloads last month - 7 stars on GitHub - 1 maintainer
fake-data-for-learning 0.4.4
Sample interesting fake data for machine and human learning10 versions - Latest release: 10 months ago - 1 dependent repositories - 83 downloads last month - 7 stars on GitHub - 1 maintainer
edo 0.3.6
Generating artificial datasets through evolution.15 versions - Latest release: over 3 years ago - 2 dependent repositories - 81 downloads last month - 13 stars on GitHub - 1 maintainer
dummy-file-generator 1.1.21
dummy csv, flat, json text file generator, typical usage scenario can be load / stress / performa...21 versions - Latest release: over 1 year ago - 1 dependent repositories - 104 downloads last month - 3 stars on GitHub - 1 maintainer
django-data-seeder 0.2.0
A data seeder for models for Django.4 versions - Latest release: over 4 years ago - 4 dependent repositories - 168 downloads last month - 9 stars on GitHub - 1 maintainer
genalog 0.1.0
Tools for generating analog document (images) from raw text3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 48 downloads last month - 293 stars on GitHub - 1 maintainer
xturing 0.1.8
Fine-tuning, evaluation and data generation for LLMs19 versions - Latest release: 9 months ago - 321 downloads last month - 1 maintainer
Top 4.5% on pypi.org
24 versions - Latest release: about 1 month ago - 3 dependent packages - 12 dependent repositories - 95.1 thousand downloads last month - 88 stars on GitHub - 7 maintainers
deepecho 0.6.0
Create sequential synthetic data of mixed types using a GAN.24 versions - Latest release: about 1 month ago - 3 dependent packages - 12 dependent repositories - 95.1 thousand downloads last month - 88 stars on GitHub - 7 maintainers
kerasgen 0.1.3
A Keras/Tensorflow compatible image data generator for creating balanced batches4 versions - Latest release: almost 2 years ago - 1 dependent repositories - 31 downloads last month - 14 stars on GitHub - 1 maintainer
realtabformer 0.1.7
A novel method for generating tabular and relational data using language models.32 versions - Latest release: 23 days ago - 1 dependent repositories - 1.3 thousand downloads last month - 183 stars on GitHub - 1 maintainer
py-synd 0.0.1 removed
SYNthetic Data generation for complex tabular datasets1 version - Latest release: 12 months ago - 2 stars on GitHub
Related Keywords
synthetic-data
14
python
12
data-science
8
tabular-data
6
deep-learning
6
machine-learning
6
synthetic-data-generation
5
data-generator
5
data-augmentation
4
pytorch
3
time-series
3
synthetic-dataset-generation
3
synthetic
3
generative-adversarial-network
3
nlp
3
data
3
testing
3
faker
3
tabular data
2
statistics
2
spark-streaming
2
spark
2
pyspark
2
deltalake
2
data generation
2
data-privacy
2
language models
2
deep learning
2
transformers
2
augmentation
2
python3
2
test-data-generator
2
natural-language-processing
2
synthetic data
2
sdv
2
gan
2
generative-ai
2
generative-model
2
distributed
2
training
2
database
2
evaluation
2
finetuning
2
delta-live-tables
2
datagenerator
2
datageneration
2
datagen
2
databricks
2
llm
2
generator
2
fake-data
2
named-entity-recognition
1
named-entities
1
data-annotation
1
NER
1
data-science-testing
1
data-driven-tests
1
text-datasets
1
dependency-modeling
1
finance
1
fasttext
1
test
1
fpca
1
fixtures
1
functional-data
1
noise
1
generators
1
oversampling
1
artificial
1
signals
1
principal-component-analysis
1
xarray
1
weather
1
spacy
1
synthetic-images
1
text-alignment
1
deepecho
1
DeepEcho
1
data-generators
1
keras
1
keras-tensorflow
1
tensorflow
1
triplet
1
triplet-loss
1
triplet-neural-network
1
REaLTabFormer
1
seq2seq model
1
synthetic data generation
1
gpt
1
gpt-2
1
seq2seq-model
1
sequential-data
1
fake-data-generator
1
evolutionary algorithm
1
artificial data
1
evolution
1
evolutionary-algorithms
1
optimisation
1
csv
1
dummy-csv
1