cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

delta table separate gold zone by different tenant

alexgv12
New Contributor III

Hello, currently we have a process that builds with delta table the zones of bronze, silver and when it reaches gold we must create specific zones for each client because the schema changes, for this we create databases and separate tables, but when we are doing this process it takes a long time and in many occasions communication is lost in the notebook and the execution fails, what good practice, or alternatives could we do to improve our flow performance and also that metrics could be used to define the characteristics and number of workers to scale. currently we are processing 30 tables and each one must be generated for 800 tenant so it would be the execution of 24000 notebooks.image

2 REPLIES 2

Noopur_Nigam
Valued Contributor II
Valued Contributor II

Hi @alexander grajales vanegas​ Are you creating all the databases and tables in gold zone manually?If so, please check out DLT https://docs.databricks.com/data-engineering/delta-live-tables/index.html, it will take care of your complete pipeline by itself.

Hi @Noopur Nigam​ , currently the creation is an automatic process that we implement in pyspark and using jinja to build the different zones, but it is very interesting this alternative that you show me, I will review the documentation to see what we could use or take it into account for future projects, thank you very much.

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.