cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Writing data to RDS table taking more time

Policepatil
New Contributor III

Hi, 

Cluster Configuration details:

Policepatil_0-1693458570195.png

RDS Configuration Details:

Policepatil_1-1693458629845.png

I have 30 files, each file having 540000 records

I read all files and created one dataframe.

When i write dataframe(16,200,000 records) to a table it take more time nearly more than 1 hour (sometime it will fail saying "Connection time out error")

When i read all 30 files in multithreading and write dataframes to table (30 threads, 30 dataframes, each dataframe having 540000 records) it takes nearly 30 minutes without any error.

I want understand why writing one dataframe takes more time?

 

 

0 REPLIES 0

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now