Migrate a workspace from one AWS account to another AWS account
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-23-2024 10:18 AM
Hi everyone,
We have a Databricks workspace in an AWS account that we need to migrate to a new AWS account.
The workspace has a lot of managed tables, workflows, saved queries, notebooks which need to be migrated, so looking for an efficient approach to replicate the complete setup in the new workspace.
If table data can't be migrated due to the nature of how managed tables are stored in S3, we can think of using Deep clone or Delta sharing to copy over the data.
What would be helpful is, if we can replicate the table structure, workflows, queries, permissions, etc.
Appreciate your suggestions!
Thank you!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-25-2024 10:58 PM
For a streamlined migration of your Databricks workspace from one AWS account to another, start by exporting notebook, workflow, and saved query configurations using Databricks REST API or CLI. Employ Deep Clone or Delta Sharing for managed table data transfer, considering AWS DataSync or Glue for large datasets. Import configurations into the new AWS account's Databricks workspace and recreate or script permissions alignment. Test thoroughly, ensuring correct functionality and permissions, and document the migration process for future reference. Engage with Databricks and AWS support for tailored insights based on your specific requirements.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-26-2024 08:58 AM
Thanks for the response.
A follow-up question related to the export/import of Workflows and Saved queries using REST APIs...
Let's say I have a databricks workflow which makes use of Saved SQL queries using SQL tasks. The JSON config of this workflow shows that it refers to the saved query using Query ID.
If I use API to export and import the saved queries and workflows, won't this query ID (which is Databricks generated) change in my target environment breaking the query reference in the workflow?
What is the best way to export and import such workflows and queries?

