Databricks Community

databicky · ‎01-22-2023

i am loading the 1billion data from spark dataframe into target table, but in the 7.3 cluster it takes 3 hours to complete but after migrated to 10.4 cluster its taking 8 hours to complete , how can i reduce the time duration

Debayan · ‎01-24-2023

Hi, Please refer https://docs.databricks.com/clusters/cluster-config-best-practices.html for best practises for cluster configurations. Please let us know if this helps.

jose_gonzalez · ‎01-24-2023

Hi @Mohammed sadamusean,

Could you provide more details on what are you doing? What type of transformations/actions are you doing? whats your source and sink? batch or streaming? all that information will help.

databicky · ‎01-24-2023

i have data in adls, i load thise data into multiple dataframes in the databricks notebook, from the final dataframe i am loading data into final target table based on the dataframe tempview, usually it takes 3 in 7.3 cluster but in 10.4 cluster it take around 8 hours , 1 billion records is there