Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "synthetic-data" keyword

Top 8.6% on pypi.org
sdgym 0.7.0
Benchmark tabular synthetic data generators using a variety of datasets
28 versions - Latest release: 11 months ago - 3 dependent repositories - 411 downloads last month - 243 stars on GitHub - 6 maintainers
discus 0.1.3b0
Generate high-quality data to unlock all AI possibilities
3 versions - Latest release: 8 months ago - 1 dependent repositories - 21 downloads last month - 62 stars on GitHub - 1 maintainer
syntheval 1.4.1
A package for evaluating synthetic data fidelity on various performance dimensions.
8 versions - Latest release: 3 months ago - 1 dependent package - 102 downloads last month - 3 stars on GitHub - 1 maintainer
conflowgen 2.1.1
A generator for synthetic container flows at maritime container terminals with a focus on yard op...
8 versions - Latest release: 5 months ago - 1 dependent repositories - 65 downloads last month - 11 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
sdv 1.13.1
Generate synthetic data for single table, multi table and sequential data
133 versions - Latest release: 5 days ago - 25 dependent packages - 36 dependent repositories - 48.7 thousand downloads last month - 2,161 stars on GitHub - 9 maintainers
Top 1.3% on pypi.org
mimesis 16.0.0
Mimesis: Fake Data Generator.
58 versions - Latest release: about 2 months ago - 20 dependent packages - 363 dependent repositories - 468 thousand downloads last month - 4,313 stars on GitHub - 2 maintainers
faker-file 0.17.11
Generate files with fake data.
68 versions - Latest release: 6 months ago - 1 dependent package - 4.93 thousand downloads last month - 82 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
pydbgen 1.0.5
Random database/dataframe generator
1 version - Latest release: about 6 years ago - 12 dependent repositories - 369 downloads last month - 292 stars on GitHub - 1 maintainer
edsl 0.1.22
Create and analyze LLM-based surveys
28 versions - Latest release: 7 days ago - 1.03 thousand downloads last month - 20 stars on GitHub - 3 maintainers
Top 5.7% on pypi.org
dbldatagen 0.3.6
Databricks Labs - PySpark Synthetic Data Generator
13 versions - Latest release: 3 months ago - 1 dependent package - 2 dependent repositories - 173 thousand downloads last month - 268 stars on GitHub - 1 maintainer
sparkdantic 0.20.5
A pydantic -> spark schema library
30 versions - Latest release: 2 months ago - 32.9 thousand downloads last month - 268 stars on GitHub - 1 maintainer
flip-data 0.2.1
Generate thousands of new 2D images from a small batch of objects and backgrounds.
10 versions - Latest release: over 2 years ago - 1 dependent repositories - 111 downloads last month - 298 stars on GitHub - 2 maintainers
blendersynth 0.2.4
Synthetic Rendering for Blender
19 versions - Latest release: 5 months ago - 80 downloads last month - 35 stars on GitHub - 1 maintainer
xrfeitoria 0.6.2
OpenXRLab Synthetic Data Rendering Toolbox
6 versions - Latest release: about 1 month ago - 50 downloads last month - 169 stars on GitHub - 3 maintainers
syndiffix 1.0.1
Python implementation of the SynDiffix synthetic data generation mechanism.
6 versions - Latest release: about 2 months ago - 56 downloads last month - 3 stars on GitHub - 1 maintainer
syngen 0.8.1
The tool uncovers patterns, trends, and correlations hidden within your production datasets.
236 versions - Latest release: 4 days ago - 2.82 thousand downloads last month - 14 stars on GitHub - 1 maintainer
privateai 1.3.1
A thin client for communicating with the Private AI de-identication API.
1 version - Latest release: 9 months ago - 18 downloads last month - 18 stars on GitHub - 1 maintainer
privateai-client 3.8.1
A thin client for communicating with the Private AI de-identication API.
20 versions - Latest release: about 1 month ago - 1 dependent repositories - 697 downloads last month - 18 stars on GitHub - 2 maintainers
Top 7.8% on pypi.org
ydata-synthetic 1.4.0
Synthetic data generation methods with different synthetization methods.
31 versions - Latest release: 16 days ago - 2 dependent packages - 1 dependent repositories - 6.1 thousand downloads last month - 1,313 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
sdmetrics 0.14.1
Metrics for Synthetic Data Generation Projects
86 versions - Latest release: 8 days ago - 5 dependent packages - 13 dependent repositories - 104 thousand downloads last month - 192 stars on GitHub - 10 maintainers
blenderline 0.1.6
A pipeline for generating synthetic production line images
7 versions - Latest release: 10 months ago - 80 downloads last month - 14 stars on GitHub - 1 maintainer
asm1-influent-generator 0.0.6
A Python Module to generate Arrays in the Activated Sludge Model 1 (ASM1) format.
4 versions - Latest release: 10 days ago - 599 downloads last month - 542 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
gretel-synthetics 0.22.10
Synthetic Data Generation with optional Differential Privacy
59 versions - Latest release: 7 days ago - 2 dependent packages - 4 dependent repositories - 5.62 thousand downloads last month - 542 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
gretel-client 0.18.2
Balance, anonymize, and share your data. With privacy guarantees.
99 versions - Latest release: 7 days ago - 1 dependent package - 2 dependent repositories - 3.17 thousand downloads last month - 43 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
table-evaluator 1.6.1
A package to evaluate how close a synthetic data set is to real data.
31 versions - Latest release: 9 months ago - 3 dependent packages - 5 dependent repositories - 1.79 thousand downloads last month - 74 stars on GitHub - 1 maintainer
synthcity 0.2.10
Synthetic data generator and evaluator!
22 versions - Latest release: 3 months ago - 3 dependent packages - 1 dependent repositories - 1.84 thousand downloads last month - 363 stars on GitHub - 2 maintainers
sciphi-synthesizer 1.0.5
Synthesizer: A Framework for LLM Powered Data.
6 versions - Latest release: 5 months ago - 298 downloads last month - 588 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
ctgan 0.10.1
Create tabular synthetic data using a conditional GAN
68 versions - Latest release: 8 days ago - 13 dependent packages - 29 dependent repositories - 99 thousand downloads last month - 1,147 stars on GitHub - 8 maintainers
dpart 0.1.2
dpart: General, flexible, and scalable framework for differentially private synthetic data genera...
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 29 downloads last month - 22 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
copulas 0.11.0
Create tabular synthetic data using copulas-based modeling.
49 versions - Latest release: about 1 month ago - 12 dependent packages - 30 dependent repositories - 130 thousand downloads last month - 506 stars on GitHub - 8 maintainers
bamt-light 0.0.2
data modeling and analysis tool based on Bayesian networks
3 versions - Latest release: 4 months ago - 1 dependent package - 27 downloads last month - 113 stars on GitHub - 1 maintainer
anonymeter 1.0.0
Measure singling out, linkability, and inference risk for synthetic data.
3 versions - Latest release: 4 months ago - 1 dependent package - 1 dependent repositories - 642 downloads last month - 57 stars on GitHub - 1 maintainer
non-parametric-multivariate-data-generator 0.0.3
A small example package
3 versions - Latest release: 10 months ago - 28 downloads last month - 3 stars on GitHub - 1 maintainer
datamaker-faker 0.0.1
A python lib for data generation
1 version - Latest release: about 2 months ago - 272 downloads last month - 2 stars on GitHub - 1 maintainer
tno.sdg.tabular.eval.utility-metrics 0.3.0
Utility metrics for tabular data
1 version - Latest release: 3 months ago - 8 downloads last month - 2 stars on GitHub - 1 maintainer
sdv-installer 0.0.0.dev2
Package to install SDV Enterprise.
3 versions - Latest release: 3 months ago - 101 downloads last month - 1 maintainer
datadreamer.dev 0.35.0
Prompt. Generate Synthetic Data. Train & Align Models.
34 versions - Latest release: 20 days ago - 1.05 thousand downloads last month - 648 stars on GitHub - 1 maintainer
sciphi 0.1.7
SciPhi: A Framework for LLM Powered Data.
7 versions - Latest release: 7 months ago - 47 downloads last month - 588 stars on GitHub - 1 maintainer
metasyn 1.0.0
Package for creating synthetic datasets while preserving privacy.
5 versions - Latest release: 8 days ago - 1 dependent package - 1 dependent repositories - 486 downloads last month - 29 stars on GitHub - 1 maintainer
pygraft 0.0.3
PyGraft: Configurable Generation of Schemas and Knowledge Graphs at Your Fingertips
3 versions - Latest release: 9 months ago - 48 downloads last month - 634 stars on GitHub - 1 maintainer
sim4rec 0.0.2
Simulator for recommendation algorithms
2 versions - Latest release: 10 months ago - 40 downloads last month - 43 stars on GitHub - 1 maintainer
milkstraw-client 1.0.2
Generate synthetic data with a simple python client for milkstraw.ai
5 versions - Latest release: 9 months ago - 20 downloads last month - 13 stars on GitHub - 1 maintainer
tinto-prueba 0.0.2
TINTO package test
2 versions - Latest release: about 1 year ago - 25 downloads last month - 2 stars on GitHub - 1 maintainer
tintonera-prueba 0.0.1
TINTO package test
1 version - Latest release: about 1 year ago - 22 downloads last month - 2 stars on GitHub - 1 maintainer
tsgm 0.0.5
Time Series Generative Modelling Framework
5 versions - Latest release: about 2 months ago - 395 downloads last month - 94 stars on GitHub - 1 maintainer
ml-impute 0.0.7
A package for synthetic data generation for imputation using single and multiple imputation methods.
4 versions - Latest release: about 1 year ago - 32 downloads last month - 3 stars on GitHub - 1 maintainer
keshik 0.0.5
A package which lets users oversample class imbalanced(binary) data by Denoising Diffusion Probab...
2 versions - Latest release: over 1 year ago - 14 downloads last month - 4 stars on GitHub - 1 maintainer
simba-ml 0.0.1
Simulation-Based Machine Learning
21 versions - Latest release: over 1 year ago - 133 downloads last month - 0 stars on GitHub - 1 maintainer
metasynth 0.5.999
[Inactive] Package for creating synthetic datasets while preserving privacy.
10 versions - Latest release: 8 months ago - 39 downloads last month - 29 stars on GitHub - 1 maintainer
dpsdv 0.0.3 💰
Creating a Differential Privacy securing Synthetic Data Generation for tabular, relational and ti...
4 versions - Latest release: almost 2 years ago - 32 downloads last month - 5 stars on GitHub - 1 maintainer
zpy-zumo 1.4.0
Create synthetic data with Blender.
49 versions - Latest release: almost 3 years ago - 1 dependent repositories - 478 downloads last month - 291 stars on GitHub - 2 maintainers
workforcesim 0.4.22
Synaptans WorkforceSim is a free open-source web app for simulating the complex dynamics of a fac...
8 versions - Latest release: about 1 year ago - 1 dependent repositories - 26 downloads last month - 2 stars on GitHub - 1 maintainer
unreal 0.1.1
A Pythonic implementation of R package conjurer
2 versions - Latest release: over 3 years ago - 5 dependent repositories - 513 downloads last month - 0 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
tgan 0.1.0
Generative adversarial training for synthesizing tabular data
2 versions - Latest release: about 5 years ago - 13 dependent repositories - 306 downloads last month - 267 stars on GitHub - 2 maintainers
synthia 1.1.0
Multidimensional synthetic data generation in Python
6 versions - Latest release: over 2 years ago - 1 dependent package - 2 dependent repositories - 54 downloads last month - 52 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
smogn 0.1.2
A Python implementation of Synthetic Minority Over-Sampling Technique for Regression with Gaussia...
6 versions - Latest release: about 4 years ago - 2 dependent packages - 9 dependent repositories - 1.67 thousand downloads last month - 288 stars on GitHub - 1 maintainer
sdnist 2.3.0
SDNist: Deidentified Data Report Generator
10 versions - Latest release: 11 months ago - 1 dependent repositories - 233 downloads last month - 30 stars on GitHub - 2 maintainers
price-process 1.1.3
Library for generating various stochastic price sequences
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 4 downloads last month - 0 stars on GitHub - 1 maintainer
plaitpy 0.1.1
a fake data generator
12 versions - Latest release: over 6 years ago - 1 dependent repositories - 43 downloads last month - 425 stars on GitHub - 1 maintainer
mesh-to-depth 0.1.2
Generate depth maps, given a mesh and camera parameters
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 26 downloads last month - 30 stars on GitHub - 1 maintainer
leopardi 0.2.24
An extensible library for generating 3D synthetic data with Blender that just works.
12 versions - Latest release: about 2 years ago - 1 dependent repositories - 24 downloads last month - 5 stars on GitHub - 1 maintainer
hypervector-wrapper 0.0.15
Python wrapper to use the Hypervector API. Better data tests
14 versions - Latest release: about 3 years ago - 1 dependent repositories - 74 downloads last month - 9 stars on GitHub - 1 maintainer
hawks 0.2.0
A package for generating synthetic clusters, with parameters to customize different aspects of th...
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 45 downloads last month - 22 stars on GitHub - 1 maintainer
datagene 0.0.3
Data Comparison Toolbox with Transformation and Similarity Analysis
3 versions - Latest release: about 4 years ago - 1 dependent repositories - 24 downloads last month - 191 stars on GitHub - 1 maintainer
domias 0.0.5
DOMIAS, a density-based MIA model that aims to infer membership by targeting local overfitting of...
5 versions - Latest release: about 1 year ago - 52 downloads last month - 6 stars on GitHub - 2 maintainers
Top 9.8% on pypi.org
augraphy 8.2.6 💰
Augmentation pipeline for rendering synthetic paper printing and scanning processes
26 versions - Latest release: 5 months ago - 1 dependent repositories - 1.14 thousand downloads last month - 289 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
blenderproc 2.7.1
A procedural Blender pipeline for photorealistic training image generation
17 versions - Latest release: about 1 month ago - 3 dependent repositories - 90.3 thousand downloads last month - 2,381 stars on GitHub - 3 maintainers
Top 9.9% on pypi.org
bamt 1.2.71
data modeling and analysis tool based on Bayesian networks
41 versions - Latest release: 4 months ago - 2 dependent packages - 1 dependent repositories - 339 downloads last month - 113 stars on GitHub - 2 maintainers
genalog 0.1.0
Tools for generating analog document (images) from raw text
3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 48 downloads last month - 293 stars on GitHub - 1 maintainer
dp-cgans 0.0.6
A library to generate synthetic tabular or RDF data using Conditional Generative Adversary Networ...
6 versions - Latest release: 6 months ago - 1 dependent repositories - 72 downloads last month - 29 stars on GitHub - 1 maintainer
exhibit 0.9.8
Command line tool to generate anonymised demonstrator data
9 versions - Latest release: 6 months ago - 2 dependent repositories - 109 downloads last month - 5 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
deepecho 0.6.0
Create sequential synthetic data of mixed types using a GAN.
24 versions - Latest release: about 1 month ago - 3 dependent packages - 12 dependent repositories - 95.1 thousand downloads last month - 88 stars on GitHub - 7 maintainers
synloc 0.1.2
A Python package to create synthetic data from a locally estimated distributions.
5 versions - Latest release: over 1 year ago - 1 dependent repositories - 21 downloads last month - 2 stars on GitHub - 1 maintainer
medkit-learn 0.1.0
Medical sequential decision making simulation tools.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 18 downloads last month - 23 stars on GitHub - 4 maintainers
bpycv3d 1.1.0
Blender Python Package for extracting internal data from blender scenes for 3d related data gener...
2 versions - Latest release: over 1 year ago - 13 downloads last month - 5 stars on GitHub - 1 maintainer
faker-file-qt 0.1.4
PyQT UI for faker-file.
5 versions - Latest release: 8 months ago - 50 downloads last month - 0 stars on GitHub - 1 maintainer
medigan 1.0.0
medigan is a modular open-source Python library that provides an interface to multiple generative...
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 76 downloads last month - 106 stars on GitHub - 2 maintainers
realtabformer 0.1.7
A novel method for generating tabular and relational data using language models.
32 versions - Latest release: 23 days ago - 1 dependent repositories - 1.3 thousand downloads last month - 183 stars on GitHub - 1 maintainer
artext 0.2.9
Probabilistic Noising of Natural Language
10 versions - Latest release: about 4 years ago - 1 dependent repositories - 80 downloads last month - 6 stars on GitHub - 1 maintainer
ktdg 0.1.18
Library to simulate knowledge tracing datasets
15 versions - Latest release: about 1 year ago - 1 dependent repositories - 98 downloads last month - 0 stars on GitLab.com - 1 maintainer
workforcesim-neuraxenetica 0.1.83 removed
a very short description.
1 version - Latest release: about 2 years ago - 1 stars on GitHub
py-synd 0.0.1 removed
SYNthetic Data generation for complex tabular datasets
1 version - Latest release: 12 months ago - 2 stars on GitHub
Related Keywords
python 31 machine-learning 20 synthetic-dataset-generation 18 data-generation 14 deep-learning 14 tabular-data 13 synthetic 10 privacy 10 data-augmentation 8 artificial-intelligence 8 data 8 generative-adversarial-network 7 synthetic-data-generation 7 blender 7 data-science 6 differential-privacy 6 generative-ai 5 generative-model 5 synthetic data 5 generation 5 time-series 5 faker 5 anonymization 4 sdv 4 rendering 4 fake-data 4 gan 4 computer-vision 4 pytorch 4 gans 4 testing 4 timeseries 4 simulation 4 single-table 3 multi-table 3 ai 3 deep learning 3 machine learning 3 synthetic-data-generator 3 finance 3 openai 3 gdpr 3 gpt 3 datagenerator 3 datageneration 3 deep-neural-networks 3 generator 3 open-source 3 redaction 2 redact 2 hippa 2 dataset-generation 2 python3 2 tensorflow2 2 training-data 2 tabular datasets 2 dlp 2 deidentification 2 de-identification 2 tabular 2 vae 2 depth-images 2 object-detection 2 spark-streaming 2 spark 2 pyspark 2 deltalake 2 delta-live-tables 2 datagen 2 databricks 2 llm 2 disclosure 2 data-generator 2 workplace 2 sensitivity-analysis 2 prescriptive-analytics 2 predictive-analytics 2 modelling 2 human-resources 2 hrm 2 blender-addon 2 augmentation 2 data-privacy 2 convolutional-neural-network 2 python-script 2 tidy-data 2 tidy-data-into-images 2 GAN 2 clustering 2 tensorflow 2 privacy-enhancing-technologies 2 evaluation 2 agents 2 copulas 2 bayesian-networks 2 mixed-data 2 parameters-learning 2 structure-learning 2 dataset 2 synthetic data generation 2