Data Engineering

Forum Posts

Sorted by:

Start a conversation

by brickster_2018 • Databricks Employee

06-25-2021 2:51:19 PM

10547 Views
2 replies
0 kudos

Resolved! What is the maximum limit of data that can be broadcasted using broadcast join

Data Engineering

10547 Views
2 replies
0 kudos

06-25-2021 2:51:19 PM

View Replies

Latest Reply

lchari
New Contributor II

11-16-2024 6:01:19 AM

0 kudos

Is the limit per "table/dataframe" or for all tables/dataframes put together?The driver collects the data from all executors (which are having the respective table or dataframe) and distributes to all executors. When will the memory be released in bo...

0 kudos

11-16-2024 6:01:19 AM

1 More Replies

by Chalki • New Contributor III

05-12-2023 6:17:44 AM

3854 Views
2 replies
4 kudos

Resolved! Delta Table Merge statement is not accepting broadcast hint

I have a statement like this with pyspark:target_tbl.alias("target")\ .merge(stage_df.hint("broadcast").alias("source"), merge_join_expr)\ .whenMatchedUpdateAll()\ .whenNotMatchedInsertAll()\ .w...

Data Engineering

3854 Views
2 replies
4 kudos

05-12-2023 6:17:44 AM

View Replies

Latest Reply

Anonymous
Not applicable

05-13-2023 6:06:28 PM

4 kudos

Hi @Nikolay Chalkanov Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best ans...

4 kudos

05-13-2023 6:06:28 PM

1 More Replies

by gauthamchettiar • New Contributor II

12-13-2022 5:57:27 AM

1937 Views
0 replies
1 kudos

Spark always performing broad casts irrespective of spark.sql.autoBroadcastJoinThreshold during streaming merge operation with DeltaTable.

I am trying to do a streaming merge between delta tables using this guide - https://docs.delta.io/latest/delta-update.html#upsert-from-streaming-queries-using-foreachbatchOur Code Sample (Java): Dataset<Row> sourceDf = sparkSession ...

Data Engineering

1937 Views
0 replies
1 kudos

12-13-2022 5:57:27 AM

by jose_gonzalez • Databricks Employee

06-18-2021 4:27:00 PM

2874 Views
1 replies
1 kudos

Resolved! Are there any limitations on my broadcast joins?

I would like to know if there are any broadcast joins limitations.

Data Engineering

2874 Views
1 replies
1 kudos

06-18-2021 4:27:00 PM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

06-18-2021 4:28:12 PM

1 kudos

Yes, there are a couple limitation. Please find below the details:> It will not perform broadcast join if the table has 512 million or more rows > It will not perform broadcast join if the table is larger than 8GB

1 kudos

06-18-2021 4:28:12 PM

Databricks Community

Resolved! What is the maximum limit of data that can be broadcasted using broadcast join

Resolved! Delta Table Merge statement is not accepting broadcast hint

Spark always performing broad casts irrespective of spark.sql.autoBroadcastJoinThreshold during streaming merge operation with DeltaTable.

Resolved! Are there any limitations on my broadcast joins?