Topics with Label: Garbage Collection

Forum Posts

Sorted by:

by aschiff • Contributor II

09-23-2022 12:43:14 PM

725169 Views
33 replies
5 kudos

GC Driver Error

I am using a cluster in databricks to connect to a Tableau workbook through the JDBC connector. My Tableau workbook has been unable to load due to resources not being available through the data connection. I went to look at the driver log for my clus...

Data Engineering

725169 Views
33 replies
5 kudos

09-23-2022 12:43:14 PM

View Replies

Latest Reply

galang123
New Contributor II

07-30-2024 6:49:54 AM

5 kudos

yesasd

5 kudos

07-30-2024 6:49:54 AM

32 More Replies

by nolanlavender00 • New Contributor

02-23-2023 3:46:34 PM

1855 Views
1 replies
0 kudos

Garbage Collection on AutoLoader

Once a week, I get very long run times with AutoLoader. The spark job says it is done, but garbage collection keeps rising on the driver. I assume this is because of the backfill interval that I am using with FileNotification Type. I have this set to...

Data Engineering

1855 Views
1 replies
0 kudos

02-23-2023 3:46:34 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-24-2023 3:26:01 AM

0 kudos

Hi @nolanlavender008 Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us...

0 kudos

04-24-2023 3:26:01 AM

by nolanlavender00 • New Contributor

02-10-2023 11:39:10 AM

7001 Views
2 replies
0 kudos

How to control garbage collection while using Autoloader File Notification?

I am using Autoloader to load files from a directory. I have set up File Notification with the Event Subscription. I have a backfill interval set to 1 day and have not run the stream for a week. There should only be about ~100 new files to pick up an...

Data Engineering

7001 Views
2 replies
0 kudos

02-10-2023 11:39:10 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-10-2023 12:23:03 AM

0 kudos

Hi @nolanlavender008 Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answ...

0 kudos

04-10-2023 12:23:03 AM

1 More Replies

by HariharaSam • Contributor

09-15-2022 6:47:57 AM

3129 Views
3 replies
0 kudos

DRIVER Garbage Collection

Does anyone know how to fix this ..??

Data Engineering

3129 Views
3 replies
0 kudos

09-15-2022 6:47:57 AM

View Replies

Latest Reply

Anonymous
Not applicable

09-28-2022 1:06:32 AM

0 kudos

Hi @Hariharan Sambath Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you....

0 kudos

09-28-2022 1:06:32 AM

2 More Replies

by sanchit_popli • New Contributor II

08-22-2022 2:43:15 PM

1882 Views
0 replies
0 kudos

How can process 3.5GB GZ (~90GB) nested JSON and convert them to tabular formats with less processing time and optimized cost in Azure Databricks?

I have a total of 5000 files (Nested JSON ~ 3.5 GB). I have written a code which converts the json to Table in minutes (for JSON size till 1 GB) but when I am trying to process 3.5GB GZ json it is mostly getting failed because of Garbage collection. ...

Data Engineering

1882 Views
0 replies
0 kudos

08-22-2022 2:43:15 PM

by User16826994223 • Databricks Employee

06-22-2021 6:08:09 AM

4741 Views
2 replies
0 kudos

Resolved! Garbage Collection optimization

I have a case where garbage collection is taking much time and I want to optimize it for better performance

Data Engineering

4741 Views
2 replies
0 kudos

06-22-2021 6:08:09 AM

View Replies

Latest Reply

sean_owen
Databricks Employee

06-22-2021 9:06:59 AM

0 kudos

You can also tune the JVM's GC parameters directly, if you mean the pauses are too long. Set "spark.executor.extraJavaOptions", but it does require knowing a thing or two about how to tune for what performance goal.

0 kudos

06-22-2021 9:06:59 AM

1 More Replies

Databricks Community

GC Driver Error

Garbage Collection on AutoLoader

How to control garbage collection while using Autoloader File Notification?

DRIVER Garbage Collection

How can process 3.5GB GZ (~90GB) nested JSON and convert them to tabular formats with less processing time and optimized cost in Azure Databricks?

Resolved! Garbage Collection optimization