Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
-
Updated
May 5, 2025 - Python
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
A procedural Blender pipeline for photorealistic training image generation
Python Library for Causal and Probabilistic Modeling using Bayesian Networks
Synthetic data generation for tabular data
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
SDG is a specialized framework designed to generate high-quality structured tabular data.
Conditional GAN for generating synthetic tabular data.
Synthetic data curation for post-training and structured data extraction
A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
Synthetic data generators for structured and unstructured text, featuring differentially private learning.
A multi-purpose LLM framework for RAG and data creation.
A library to model multivariate data using copulas.
A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.
Synthetic Data SDK ✨
Add a description, image, and links to the synthetic-data topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-data topic, visit your repo's landing page and select "manage topics."