Databricks Community

561064 · ‎12-01-2023

Process to export a delta table is taking ~2hrs.

Delta table has 66 partitions with total size of ~6gb, 4million rows and 270 columns.

Used below command

df.coalesce(1).write.csv("path")

what are my options to reduce the time?

Dribka · ‎12-02-2023

A very interesting task in front of you.... let me know how you solve it!

561064 · ‎12-04-2023

Hi Kainz,

None of the options I tried helped as the challenge is not reading but writing it to a one CSV file. df.repartition(numFiles).write.csv("path") has consumed the same amount of time as 'df.coalesce(1).write.csv("path")' in my case.

any other options I can explore?

Databricks Community

Exporting delta table to one CSV

Connect with Databricks Users in Your Area

Databricks Learning Festival (Virtual): 15 January - 31 January 2025

Milestone: DatabricksTV Reaches 100 Videos!

Announcing the new Meta Llama 3.3 model on Databricks

Databricks Community Champion - December 2024 - Sujesh Menon

Dotmatics and Databricks Partner to Advance Scientific Intelligence in Life Sciences