โ05-05-2022 11:23 PM
Hi,
I am executing a simple job in Databricks for which I am getting below error. I increased the Driver size still I faced same issue.
Spark config :
from pyspark.sql import SparkSession
spark_session = SparkSession.builder.appName("Demand Forecasting").config("spark.yarn.executor.memoryOverhead", 2048).getOrCreate()
Driver and worker node type -r5.2xlarge
10 worker nodes.
Error Log:
Caused by: org.apache.spark.sql.execution.OutOfMemorySparkException: Size of broadcasted table far exceeds estimates and exceeds limit of spark.driver.maxResultSize=4294967296.
โ06-02-2022 01:51 AM
Hi @Kaniz Fatmaโ ,
Switching the runtime version to 10.4 fixed the issue for me.
Thanks,
Chandan
โ05-05-2022 11:54 PM
looking at the error message you try to broadcast a large table. Remove the broadcast statement on the large table and you will be fine.
โ05-08-2022 12:05 PM
HI @Werner Stinckensโ ,
I am getting the above issue while writing a Spark DF as a parquet file to AWS S3. Not doing any broadcast join actually.
Thanks,
Chandan
โ05-06-2022 09:04 AM
In my opinion on databricks, you don't need to specify (spark_session = SparkSession.builder.appName("Demand Forecasting").config("spark.yarn.executor.memoryOverhead", 2048).getOrCreate()) and rest is as @Werner Stinckensโ said
โ05-08-2022 12:05 PM
I am getting the above issue while writing a Spark DF as a parquet file to AWS S3. Not doing any broadcast join actually.
โ05-11-2022 06:25 AM
As Hubert mentioned: you should not create a spark session on databricks, it is provided.
The fact you do not broadcast manually makes me think Spark uses a broadcastjoin.
There is a KB about issues with that:
https://kb.databricks.com/sql/bchashjoin-exceeds-bcjointhreshold-oom.html
Can you check if it is applicable?
โ06-02-2022 01:51 AM
Hi @Kaniz Fatmaโ ,
Switching the runtime version to 10.4 fixed the issue for me.
Thanks,
Chandan
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.
Request a New Group