Databricks Community

Hoviedo · ‎12-05-2024

Hi, is there any way to apply a expectations only if that column exists? I am creating multiple dlt tables with the same python function so i would like to create diferent expectations based in the table name, currently i only can create expectations for those column that exists in all the tables created for the python function

thanks

Walter_C · ‎12-05-2024

To apply expectations only if a column exists in Delta Live Tables (DLT), you can use the @Dlt.expect decorator conditionally within your Python function. Here is a step-by-step approach to achieve this:

Check if the Column Exists: Before applying the expectation, check if the column exists in the DataFrame.
Apply Expectations Conditionally: Use the @Dlt.expect decorator only if the column is present.

Hoviedo · ‎12-05-2024

Hi Walter, sorry what would it look like in the code?

i can do this in the python function (it does nothing)

if "Distance" in df.columns:

dlt.expect("Distance is positive", "Distance >= 0")

but i am now sure how can i apply the same with the decorator

Hoviedo · ‎12-05-2024

def get_changes_from_raw(table_name):

@Dlt.table(

name=f"{table_name}_changes",

comment=f"New {table_name} data incrementally ingested from cloud object storage landing zone",

)

@Dlt.expect("valid_rescued_data", "_rescued_data is null")

def read_changes_from_adl2():

df = spark.read...

if "distance" in df.columns:

dlt.expect("distance is positive", "distance >= 0")

Databricks Community

Apply expectations only if column exists

Photos

Join Us as a Local Community Builder!

Business Intelligence in the Era of AI

🚀 Monthly Databricks Get Started Days – Accelerate Your Learning Journey! 🚀

Databricks Community Champion - March 2025 - Takuya Omi

Get Started With Lakehouse Architecture | Pass a quiz to earn your certificate completion.

Virtual Learning Festival: 9 April - 30 April