<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Job Cluster Reuse in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/job-cluster-reuse/m-p/144237#M52292</link>
    <description>&lt;P&gt;Adding link to doc for reference:&lt;BR /&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/jobs/compute#share-compute-across-tasks" target="_self"&gt;https://learn.microsoft.com/en-us/azure/databricks/jobs/compute#share-compute-across-tasks&lt;/A&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 16 Jan 2026 12:23:25 GMT</pubDate>
    <dc:creator>SparkMan</dc:creator>
    <dc:date>2026-01-16T12:23:25Z</dc:date>
    <item>
      <title>Job Cluster Reuse</title>
      <link>https://community.databricks.com/t5/data-engineering/job-cluster-reuse/m-p/144231#M52290</link>
      <description>&lt;P&gt;Hi, I have a job where a job cluster is reused twice for task A and task C. Between A and C, task B runs for 4 hours on a different interactive cluster. The issue here is that the job cluster doesn't terminate as soon as Task A is completed and sits idle for 4 hrs. Is this expected behaviour when reusing a job cluster? Do we need to make sure that job cluster is reused only for consecutive tasks?&lt;/P&gt;&lt;P&gt;TASK A -&amp;gt; TASK B -&amp;gt; TASK C&lt;/P&gt;&lt;P&gt;More on job cluster reuse:&lt;/P&gt;&lt;P&gt;&lt;A title="https://community.databricks.com/t5/technical-blog/maximizing-resource-utilisation-with-cluster-reuse/ba-p/64331" href="https://community.databricks.com/t5/technical-blog/maximizing-resource-utilisation-with-cluster-reuse/ba-p/64331" target="_blank" rel="noreferrer noopener"&gt;https://community.databricks.com/t5/technical-blog/maximizing-resource-utilisation-with-cluster-reu…&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV class=""&gt;&amp;nbsp;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Jan 2026 11:16:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/job-cluster-reuse/m-p/144231#M52290</guid>
      <dc:creator>SparkMan</dc:creator>
      <dc:date>2026-01-16T11:16:19Z</dc:date>
    </item>
    <item>
      <title>Re: Job Cluster Reuse</title>
      <link>https://community.databricks.com/t5/data-engineering/job-cluster-reuse/m-p/144236#M52291</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/174905"&gt;@SparkMan&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;This is expected behavior with Databricks job cluster reuse unless you change your job/task configuration. Look at following documentation entry:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="szymon_dybczak_0-1768565474054.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/23030i9429B28BB825E869/image-size/medium?v=v2&amp;amp;px=400" role="button" title="szymon_dybczak_0-1768565474054.png" alt="szymon_dybczak_0-1768565474054.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;So with your flow you have something like this:&lt;/P&gt;&lt;P&gt;Task A (job cluster) → Task B (interactive cluster) → Task C (job cluster)&lt;/P&gt;&lt;P&gt;If Task A and Task C share the same job cluster, the cluster will stay alive and idle during Task B’s execution - because Databricks doesn’t consider the job cluster “done” until all tasks that refer to it (A and C) have run. That’s why you see ~4 hours of idle time&lt;/P&gt;</description>
      <pubDate>Fri, 16 Jan 2026 12:12:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/job-cluster-reuse/m-p/144236#M52291</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2026-01-16T12:12:07Z</dc:date>
    </item>
    <item>
      <title>Re: Job Cluster Reuse</title>
      <link>https://community.databricks.com/t5/data-engineering/job-cluster-reuse/m-p/144237#M52292</link>
      <description>&lt;P&gt;Adding link to doc for reference:&lt;BR /&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/jobs/compute#share-compute-across-tasks" target="_self"&gt;https://learn.microsoft.com/en-us/azure/databricks/jobs/compute#share-compute-across-tasks&lt;/A&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Jan 2026 12:23:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/job-cluster-reuse/m-p/144237#M52292</guid>
      <dc:creator>SparkMan</dc:creator>
      <dc:date>2026-01-16T12:23:25Z</dc:date>
    </item>
  </channel>
</rss>

