Skip to content

SynthoHive Documentation

SynthoHive is a production-grade synthetic data engine that generates high-fidelity, privacy-preserving synthetic data for complex relational databases. It maintains referential integrity across multi-table schemas, preserves statistical correlations, and provides automated PII handling.

What you'll find

Quick install

pip install synthohive pyspark pandas pyarrow

Build docs locally

pip install .[docs]
mkdocs serve

Deploy to GitHub Pages:

mkdocs gh-deploy --force