Hello everyone,
I am not sure if this was asked, but I am trying to find a way to create one python (or Scala) script that would basically take a list of all the schemas and then run optimize, vacuum, and then analyze tables on them.
I see a lot of websites online where a user has to specify the schema, but is it possible to not have to specify a schema, but instead have the script run daily and optimize tables without specifying it?
https://www.modern-dataengineering.com/post/azure-databricks-how-to-optimize-all-your-tables
The above is the closest that I got to having such a script.
Has anyone done something similar to above? Any help would be much appreciated.