cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Migrate a workspace from one AWS account to another AWS account

Heisenberg
New Contributor II

Hi everyone,

We have a Databricks workspace in an AWS account that we need to migrate to a new AWS account.

The workspace has a lot of managed tables, workflows, saved queries, notebooks which need to be migrated, so looking for an efficient approach to replicate the complete setup in the new workspace.

If table data can't be migrated due to the nature of how managed tables are stored in S3, we can think of using Deep clone or Delta sharing to copy over the data.

What would be helpful is, if we can replicate the table structure, workflows, queries, permissions, etc.

Appreciate your suggestions!

Thank you!

 

2 REPLIES 2

katherine561
New Contributor II

For a streamlined migration of your Databricks workspace from one AWS account to another, start by exporting notebook, workflow, and saved query configurations using Databricks REST API or CLI. Employ Deep Clone or Delta Sharing for managed table data transfer, considering AWS DataSync or Glue for large datasets. Import configurations into the new AWS account's Databricks workspace and recreate or script permissions alignment. Test thoroughly, ensuring correct functionality and permissions, and document the migration process for future reference. Engage with Databricks and AWS support for tailored insights based on your specific requirements.

Thanks for the response.

A follow-up question related to the export/import of Workflows and Saved queries using REST APIs...

Let's say I have a databricks workflow which makes use of Saved SQL queries using SQL tasks. The JSON config of this workflow shows that it refers to the saved query using Query ID.
If I use API to export and import the saved queries and workflows, won't this query ID (which is Databricks generated) change in my target environment breaking the query reference in the workflow? 
What is the best way to export and import such workflows and queries?

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.