<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Executor OOM Error with AQE enabled in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/118676#M45679</link>
    <description>&lt;P&gt;no, we dont want to remove broadcast hint, as it works without problems in DBR 10.4, and there is a lot of memory availble for it.&lt;/P&gt;</description>
    <pubDate>Fri, 09 May 2025 12:06:01 GMT</pubDate>
    <dc:creator>alsetr</dc:creator>
    <dc:date>2025-05-09T12:06:01Z</dc:date>
    <item>
      <title>Executor OOM Error with AQE enabled</title>
      <link>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/118672#M45676</link>
      <description>&lt;P&gt;We have Databricks Spark Job. After migration from Databricks Runtime 10.4 to 15.4 one of our Spark jobs which uses broadcast hint started to fail with error:&lt;/P&gt;&lt;P&gt;```&lt;BR /&gt;ERROR Executor: Exception in task 2.0 in stage 371.0 (TID 16912)&lt;BR /&gt;org.apache.spark.memory.SparkOutOfMemoryError: [EXECUTOR_BROADCAST_JOIN_OOM] There is not enough memory to build the broadcast relation LongToUnsafeRowMap. Relation Size = 1462.4 MiB. Total memory used by this task = 1526.4 MiB. Executor Memory Manager Metrics: onHeapExecutionMemoryUsed = 2.4 GiB, offHeapExecutionMemoryUsed = 0.0 B, onHeapStorageMemoryUsed = 472.5 MiB, offHeapStorageMemoryUsed = 0.0 B. [sparkPlanId: Some(44226)] SQLSTATE: 53200&lt;BR /&gt;```&lt;BR /&gt;This job fails regardless resources we use, it fails even with Standard_D8s_v3 worker nodes, which has 32GB RAM.&lt;BR /&gt;Also before the error we have log message which show that there is enough memory.&lt;BR /&gt;```&lt;BR /&gt;INFO MemoryStore: Block broadcast_188 stored as values in memory (estimated size 359.3 KiB, free 24.0 GiB)&lt;BR /&gt;```&lt;/P&gt;&lt;P&gt;Looks like this is Adaptive Query Execution issue, as disabling it solves the problem.&lt;/P&gt;&lt;P&gt;Could anybody advise how to overcome this issue without disabling AQE?&lt;/P&gt;</description>
      <pubDate>Fri, 09 May 2025 11:22:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/118672#M45676</guid>
      <dc:creator>alsetr</dc:creator>
      <dc:date>2025-05-09T11:22:07Z</dc:date>
    </item>
    <item>
      <title>Re: Executor OOM Error with AQE enabled</title>
      <link>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/118673#M45677</link>
      <description>&lt;P&gt;have you tried removing the broadcast hint?&lt;BR /&gt;In recent versions of databricks runtime a lot of optimizations have been added.&lt;BR /&gt;Also: do you use the photon engine?&lt;/P&gt;</description>
      <pubDate>Fri, 09 May 2025 11:54:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/118673#M45677</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2025-05-09T11:54:53Z</dc:date>
    </item>
    <item>
      <title>Re: Executor OOM Error with AQE enabled</title>
      <link>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/118676#M45679</link>
      <description>&lt;P&gt;no, we dont want to remove broadcast hint, as it works without problems in DBR 10.4, and there is a lot of memory availble for it.&lt;/P&gt;</description>
      <pubDate>Fri, 09 May 2025 12:06:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/118676#M45679</guid>
      <dc:creator>alsetr</dc:creator>
      <dc:date>2025-05-09T12:06:01Z</dc:date>
    </item>
    <item>
      <title>Re: Executor OOM Error with AQE enabled</title>
      <link>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/118677#M45680</link>
      <description>&lt;P&gt;ok, that is up to you.&lt;/P&gt;&lt;P&gt;An executor will not be able to take all the ram.&lt;BR /&gt;you can try to work with the spark.executor parameters.&lt;/P&gt;</description>
      <pubDate>Fri, 09 May 2025 12:12:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/118677#M45680</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2025-05-09T12:12:18Z</dc:date>
    </item>
    <item>
      <title>Re: Executor OOM Error with AQE enabled</title>
      <link>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/123420#M47004</link>
      <description>&lt;P&gt;I found similar issue&lt;BR /&gt;&lt;A href="https://kb.databricks.com/python/job-fails-with-not-enough-memory-to-build-the-hash-map-error" target="_blank"&gt;https://kb.databricks.com/python/job-fails-with-not-enough-memory-to-build-the-hash-map-error&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Looks like the reason of error is a bug in new Databricks feature which is called&amp;nbsp;&lt;SPAN&gt;executor-side broadcast (ebj, executor broadcast join) which was introduced in DBR 11.3.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Unfortunately could not find the way how to disable this feature, so keeping AQE disable so far.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 01 Jul 2025 11:06:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/executor-oom-error-with-aqe-enabled/m-p/123420#M47004</guid>
      <dc:creator>alsetr</dc:creator>
      <dc:date>2025-07-01T11:06:16Z</dc:date>
    </item>
  </channel>
</rss>

