How to incorporate these GC options into my Databricks Cluster? )(spark.executor.extraJavaOptions)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-16-2022 07:49 AM
I want to try incorporating these options into my databricks cluster.
spark.driver.extraJavaOptions -XX:+UseG1GC -XX:+G1SummarizeConcMark
spark.executor.extraJavaOptions -XX:+UseG1GC -XX:+G1SummarizeConcMarkIf I put them under Compute -> Cluster -> Configuration -> Advanced Options -> Spark -> Spark config, my cluster fails to boot;
If I add them to Compute -> Cluster -> Configuration -> Advanced Options -> Spark -> Environment variables, the UI says its an illegal key value pair and wont proceed.
- Labels:
-
Cluster
-
Databricks Cluster
-
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-17-2022 10:47 PM
hey @Andrew Fogarty , I think this is only for the spark-submit command, not for cluster UI.
Please have a look at this doc - http://progexc.blogspot.com/2014/12/spark-configuration-mess-solved.html
spark.executor.extraJavaOptions
A string of extra JVM options to pass to executors. This is intended to be set by users. For instance, GC settings or other logging. Note that it is illegal to set Spark properties or maximum heap size (-Xmx) settings with this option. Spark properties should be set using a SparkConf object or the spark-defaults.conf file used with the spark-submit script. Maximum heap size settings can be set with spark.executor.memory. The following symbols, if present will be interpolated: will be replaced by application ID and will be replaced by executor ID. For example, to enable verbose gc logging to a file named for the executor ID of the app in /tmp, pass a 'value' of:
-verbose:gc -Xloggc:/tmp/-.gc
spark.executor.defaultJavaOptions
will be prepended to this configuration.
And more you can explore here - https://spark.apache.org/docs/latest/configuration.html
Thanks
Aviral