cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

How to enable CDF when saveAsTable from pyspark code?

drag7ter
New Contributor III

I'm running this code in databricks notebook and I want the table from dataframe in catalog were created with CDF enables. When I run the code table hasn't exited yet.

This code doesn't create a table with enables CDF. It doesn't add:

delta.enableChangeDataFeed = true
 df \
.write \
.format("delta") \
.mode("overwrite") \
.option("overwriteSchema", "true") \
.option("delta.enableChangeDataFeed", "true") \
.saveAsTable(my_table_name_in_catalog): 

 Or it is possible only enable it via Spark SQL, after table already exists?

1 ACCEPTED SOLUTION

Accepted Solutions

raphaelblg
Contributor III
Contributor III

Hello @drag7ter ,

I don't see anything wrong with your approach, check my repro:

raphaelblg_0-1712936628123.png

 

 

Best regards,

Raphael Balogo
Sr. Technical Solutions Engineer
Databricks

View solution in original post

2 REPLIES 2

raphaelblg
Contributor III
Contributor III

Hello @drag7ter ,

I don't see anything wrong with your approach, check my repro:

raphaelblg_0-1712936628123.png

 

 

Best regards,

Raphael Balogo
Sr. Technical Solutions Engineer
Databricks

drag7ter
New Contributor III

Hi @raphaelblg in your screenshot cdcTest table already exists in unity catalog, or it will be created for the first time, running your code? It is really strange as I'm not able to enable from the pyspark code, only after tables created ALTER TABLES works.

"delta.enableChangeDataFeed", "true"