Topics with Label: Memory management

by SaravananPalani • New Contributor II

08-23-2018 4:08:35 AM

18124 Views
8 replies
9 kudos

Is there any way to monitor the CPU, disk and memory usage of a cluster while a job is running?

I am looking for something preferably similar to Windows task manager which we can use for monitoring the CPU, memory and disk usage for local desktop.

Data Engineering

18124 Views
8 replies
9 kudos

08-23-2018 4:08:35 AM

View Replies

Latest Reply

hitech88
New Contributor II

02-04-2023 11:57:28 AM

9 kudos

Some important info to look in Gangalia UI in CPU, memory and server load charts to spot the problem:CPU chart :User %Idle %High percentage of user % indicates heavy CPU usage in the cluster.Memory chart : Use %Free %Swap % If you see purple line ove...

9 kudos

02-04-2023 11:57:28 AM

7 More Replies

by Michael_Galli • Contributor II

04-22-2022 3:00:10 AM

2210 Views
1 replies
1 kudos

Resolved! Pipelines with alot of Spark Caching - best practices for cleanup?

We have the situation where many concurrent Azure Datafactory Notebooks are running in one single Databricks Interactive Cluster (Azure E8 Series Driver, 1-10 E4 Series Drivers autoscaling).Each notebook reads data, does a dataframe.cache(), just to ...

Data Engineering

2210 Views
1 replies
1 kudos

04-22-2022 3:00:10 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

04-22-2022 3:16:05 AM

1 kudos

This cache is dynamically saved to disk if there is no place in memory. So I don't see it as an issue. However, the best practice is to use "unpersist()" method in your code after caching. As in the example below, my answer, the cache/persist method ...

1 kudos

04-22-2022 3:16:05 AM