cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Migrate a workspace from one AWS account to another AWS account

Heisenberg
New Contributor II

Hi everyone,

We have a Databricks workspace in an AWS account that we need to migrate to a new AWS account.

The workspace has a lot of managed tables, workflows, saved queries, notebooks which need to be migrated, so looking for an efficient approach to replicate the complete setup in the new workspace.

If table data can't be migrated due to the nature of how managed tables are stored in S3, we can think of using Deep clone or Delta sharing to copy over the data.

What would be helpful is, if we can replicate the table structure, workflows, queries, permissions, etc.

Appreciate your suggestions!

Thank you!

 

2 REPLIES 2

katherine561
New Contributor II

For a streamlined migration of your Databricks workspace from one AWS account to another, start by exporting notebook, workflow, and saved query configurations using Databricks REST API or CLI. Employ Deep Clone or Delta Sharing for managed table data transfer, considering AWS DataSync or Glue for large datasets. Import configurations into the new AWS account's Databricks workspace and recreate or script permissions alignment. Test thoroughly, ensuring correct functionality and permissions, and document the migration process for future reference. Engage with Databricks and AWS support for tailored insights based on your specific requirements.

Thanks for the response.

A follow-up question related to the export/import of Workflows and Saved queries using REST APIs...

Let's say I have a databricks workflow which makes use of Saved SQL queries using SQL tasks. The JSON config of this workflow shows that it refers to the saved query using Query ID.
If I use API to export and import the saved queries and workflows, won't this query ID (which is Databricks generated) change in my target environment breaking the query reference in the workflow? 
What is the best way to export and import such workflows and queries?

Join 100K+ Data Experts: Register Now & Grow with Us!

Excited to expand your horizons with us? Click here to Register and begin your journey to success!

Already a member? Login and join your local regional user group! If there isn’t one near you, fill out this form and we’ll create one for you to join!