Databricks Community

Asterol · ‎07-24-2023

Hey,

I've created a schema with few tables with historical data (prod), now I would like to have a Dev/testing environment with exactly the same data.

What do you recommend? CTAS? Shallow clone? Deep clone? I wonder if shallow clone would be sufficient, what is your experience?

Tharun-Kumar · ‎07-24-2023

@Asterol

If you would like to have the same data for your Dev/testing environment, I would recommend using Deep Clone. Deep clone copies the metadata and creates an independent copy of the table data.

Shallow clone only copies the metadata and will have a pointer to your production data. Shallow clone only helps for short-lived use cases.

If you end goal is to create a Dev/testing/production environment, Deep Clone is the way forward. This would help you in using and maintaining them independently.

Databricks Community

Creating a test schema - what is the best practice?

The Next Wave of Enterprise AI | Webinar

🌟 Community Pulse: Your Weekly Roundup! June 29 – July 05, 2026

📌‌ Complete Your Profile – Help Others Get to Know You

Solution Accelerator Series | Identify Fraud With Geospatial Analytics and AI

Databricks Community Champion - June 2026 - Amira Bedhiafi