cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

When developing a Delta Live Table, can I test my code in the notebook?

User16826992666
Valued Contributor

Just wondering about the dev flow when building Delta Live Tables. I write my code in the notebook so it would be useful to be able to test it out from within that environment.

5 REPLIES 5

Srikanth_Gupta_
Valued Contributor

I don't think Delta Live Table code is interactive, we need to create Pipeline and test the code.

sajith_appukutt
Honored Contributor II

By test , did you mean running them from notebooks ( as against an IDE ) or, unit testing from notebooks.

If it is the first, checkout example notebooks. If it is the latter you could abstract out transformations into separate functions ( that accepts a dataframe and returns a dataframe) and have them unit tested using standard unit testing practices.

aladda
Honored Contributor II
Honored Contributor II

The delta live tables code in a notebook is essentially a template for how you want your data transformed. The Pipelines feature is the execution/operationlization component for that notebook.

With respect to being able to test the notebook, one option could be to setup up a Dev/Test Pipeline that lands the data in a dev/test target database run validation queries either using a different databricks notebook or Databricks SQL and then push it to production with a different pipeline that send data to a production target database

martinez
New Contributor III

Hey ๐Ÿค—

As a fellow developer, I can definitely relate to the question User16826992666 has raised regarding testing code in a notebook when developing a Delta Live Table.

In my experience, notebooks are a fantastic tool for exploratory data analysis and prototyping code. When it comes to testing your code for Delta Live Tables, you can certainly leverage the notebook environment to your advantage. It allows you to execute code cells and observe the results, making it convenient for iterative development.

I would recommend using the notebook to validate your code logic, verify data transformations, and ensure that your Delta Live Table behaves as expected. You can simulate the expected behavior of your live table by creating sample data or using subsets of your actual data. Tutorial here: https://andersenlab.com/services/quality-assurance-services

However, keep in mind that notebooks may not fully replicate the production environment, especially when it comes to handling large-scale data or simulating real-time data updates. So, while the notebook is great for initial testing and debugging, it's essential to conduct thorough testing and performance evaluation in a dedicated development or staging environment before deploying your Delta Live Table to production.

Happy coding and testing! โœ๏ธ

Lakshay
Esteemed Contributor
Esteemed Contributor

You need to create a DLT pipeline to test the code. 

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group