cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Getting broadcast join errors

jose_gonzalez
Moderator
Moderator

I would like to know how do disable broadcast join in my job to avoid this error message. Is there a Spark configuration?

1 ACCEPTED SOLUTION

Accepted Solutions

jose_gonzalez
Moderator
Moderator

You can disable broadcast join by adding the following Spark configuration to you notebook:

spark.conf.set("spark.sql.autoBroadcastJoinThreshold", -1)

In addition, you can also add this configuration to your cluster:

spark.sql.autoBroadcastJoinThreshold -1

You can disable it at notebook level or cluster level.

View solution in original post

1 REPLY 1

jose_gonzalez
Moderator
Moderator

You can disable broadcast join by adding the following Spark configuration to you notebook:

spark.conf.set("spark.sql.autoBroadcastJoinThreshold", -1)

In addition, you can also add this configuration to your cluster:

spark.sql.autoBroadcastJoinThreshold -1

You can disable it at notebook level or cluster level.

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.