Hi @zyang,
To sync data and tables/views between production and test workspaces in Azure, the recommended approach is to use the Databricks Sync (DBSync) project, which is an object synchronization tool that backs up, restores, and syncs Databricks workspaces. Using the Data Explorer upgrade wizard, you can use the DBSync project to copy complete schemas (databases) and multiple external tables from your default Hive metastore to the Unity Catalog metastore.
Here are the steps you can follow:
1. Use the DBSync project to sync the production workspace to the test workspace.
2. After the sync succeeds, go to the test workspace and click Queries.
3. Click Create Query and choose the name of the SQL warehouse.
4. Enter a query, for example SELECT * FROM database.table
.
5. Click Run to display the ingested data. Suppose you cannot get the complete list of tables/views in the production database by using "show tables in database" in the test workspace. In that case, you can use the Databricks REST API to get the list of tables/views in the production database and then create them in the test workspace. You can also use the DBSync project to copy the entire database from production to test, including all tables/views.