RonanStokes_DB
Databricks Employee
Databricks Employee

Hi @Nick_Hughes 
This may be late for your scenario - but hopefully others facing similar issues will find it useful.

You can specify how data is generated in `dbldatagen` using rules in the data generation spec. If rules are specified for data generation, you can generate a wide variety of data. 

The document (located at https://databrickslabs.github.io/dbldatagen/public_docs/index.html) gives many examples. 

There is an example similar to what you are requesting (multi tables with cross referencing) located at: https://databrickslabs.github.io/dbldatagen/public_docs/multi_table_data.html

If you are seeing only 1s and 0s when rules are specified, or simply have suggestions on ways to improve this - feel free to open an issue in the issues page of the main GitHub site at https://github.com/databrickslabs/dbldatagen

Regards Ronan Stokes (Databricks)