Great post!I went after more reading while reading each topics and I would like to add a few things here1. AnonymizationI wouldn't use uuid() like this.Using a hashing function would be better to ensure consistency across multiple runs. F.sha2(F.con...
It is a great article, I am excited for the next parts.I am not sure about having these metadata tables in the lakehouse. It forces us to build a data pipeline for the metadata table. Isn't it better to just use a transactional database like mongo or...