<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Autoscaling with the autoloader without SDP in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/autoscaling-with-the-autoloader-without-sdp/m-p/156637#M54459</link>
    <description>&lt;P&gt;Hi there,&lt;/P&gt;&lt;P&gt;I have a question regarding the autoloader without SDP and auto-scaling of clusters. I'm reading the following in the docs:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;A href="https://docs.databricks.com/aws/en/structured-streaming/production" target="_blank" rel="noopener"&gt;Production considerations for Structured Streaming | Databricks on AWS:&lt;/A&gt;&lt;BR /&gt;&lt;EM&gt;Do not enable autoscaling for compute for&amp;nbsp;Structured Streaming&amp;nbsp;jobs.&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;A style="font-family: inherit; background-color: #ffffff;" href="https://docs.databricks.com/aws/en/ingestion/cloud-object-storage/auto-loader/production" target="_blank" rel="noopener"&gt;Configure Auto Loader for production workloads | Databricks on AWS:&lt;/A&gt;&amp;nbsp;&lt;EM&gt;Enhanced autoscaling implements optimization of streaming workloads and adds enhancements to improve the performance of batch workloads. Enhanced autoscaling optimizes costs by adding or removing machines as the workload changes.&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;But also:&amp;nbsp;&lt;EM&gt;Compute auto-scaling has limitations when scaling down cluster size for structured streaming workloads. Databricks recommends using&amp;nbsp;&lt;A class="" href="https://docs.databricks.com/aws/en/ldp/" target="_blank" rel="noopener"&gt;Lakeflow Spark Declarative Pipelines&lt;/A&gt;&amp;nbsp;with&amp;nbsp;&lt;A class="" href="https://docs.databricks.com/aws/en/ldp/auto-scaling" target="_blank" rel="noopener"&gt;enhanced autoscaling&lt;/A&gt;&amp;nbsp;for streaming workloads.&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;SPAN&gt;We don't use SDP because of serverless limitations. Is it not advised to use enhanced autoscaling for non-SDP jobs? And why is that?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 12 May 2026 06:45:22 GMT</pubDate>
    <dc:creator>HTD360</dc:creator>
    <dc:date>2026-05-12T06:45:22Z</dc:date>
    <item>
      <title>Autoscaling with the autoloader without SDP</title>
      <link>https://community.databricks.com/t5/data-engineering/autoscaling-with-the-autoloader-without-sdp/m-p/156637#M54459</link>
      <description>&lt;P&gt;Hi there,&lt;/P&gt;&lt;P&gt;I have a question regarding the autoloader without SDP and auto-scaling of clusters. I'm reading the following in the docs:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;A href="https://docs.databricks.com/aws/en/structured-streaming/production" target="_blank" rel="noopener"&gt;Production considerations for Structured Streaming | Databricks on AWS:&lt;/A&gt;&lt;BR /&gt;&lt;EM&gt;Do not enable autoscaling for compute for&amp;nbsp;Structured Streaming&amp;nbsp;jobs.&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;A style="font-family: inherit; background-color: #ffffff;" href="https://docs.databricks.com/aws/en/ingestion/cloud-object-storage/auto-loader/production" target="_blank" rel="noopener"&gt;Configure Auto Loader for production workloads | Databricks on AWS:&lt;/A&gt;&amp;nbsp;&lt;EM&gt;Enhanced autoscaling implements optimization of streaming workloads and adds enhancements to improve the performance of batch workloads. Enhanced autoscaling optimizes costs by adding or removing machines as the workload changes.&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;But also:&amp;nbsp;&lt;EM&gt;Compute auto-scaling has limitations when scaling down cluster size for structured streaming workloads. Databricks recommends using&amp;nbsp;&lt;A class="" href="https://docs.databricks.com/aws/en/ldp/" target="_blank" rel="noopener"&gt;Lakeflow Spark Declarative Pipelines&lt;/A&gt;&amp;nbsp;with&amp;nbsp;&lt;A class="" href="https://docs.databricks.com/aws/en/ldp/auto-scaling" target="_blank" rel="noopener"&gt;enhanced autoscaling&lt;/A&gt;&amp;nbsp;for streaming workloads.&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;SPAN&gt;We don't use SDP because of serverless limitations. Is it not advised to use enhanced autoscaling for non-SDP jobs? And why is that?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 12 May 2026 06:45:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoscaling-with-the-autoloader-without-sdp/m-p/156637#M54459</guid>
      <dc:creator>HTD360</dc:creator>
      <dc:date>2026-05-12T06:45:22Z</dc:date>
    </item>
    <item>
      <title>Re: Autoscaling with the autoloader without SDP</title>
      <link>https://community.databricks.com/t5/data-engineering/autoscaling-with-the-autoloader-without-sdp/m-p/156640#M54460</link>
      <description>&lt;P&gt;And to add to the question. What if I have a job with 10 tasks that all use the autoloader. Would that benefit from auto-scaling?&lt;/P&gt;</description>
      <pubDate>Tue, 12 May 2026 06:54:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoscaling-with-the-autoloader-without-sdp/m-p/156640#M54460</guid>
      <dc:creator>HTD360</dc:creator>
      <dc:date>2026-05-12T06:54:38Z</dc:date>
    </item>
    <item>
      <title>Re: Autoscaling with the autoloader without SDP</title>
      <link>https://community.databricks.com/t5/data-engineering/autoscaling-with-the-autoloader-without-sdp/m-p/156679#M54465</link>
      <description>&lt;P&gt;Hello !&lt;/P&gt;&lt;P&gt;In DBKS we have 2&amp;nbsp;different autoscaling mechanisms here:&lt;/P&gt;&lt;P&gt;- normal compute autoscaling on a job or all purpose cluster : this is the autoscaling you enable on a classic job cluster by setting min or max workers and for structured streaming jobs, it is not recommended to enable compute autoscaling because scale down has limitations for streaming workloads. The cluster may not scale down as expected and if you want to resize you will experience latency especially for stateful streams.&lt;/P&gt;&lt;P&gt;- autoscaling for LSDP : this is a pipeline specific autoscaling mode that uses pipeline workload metrics such as task slot usage and queued tasks. It improves streaming workload optimization and can proactively shut down under used nodes while avoiding failed tasks during shutdown.&lt;/P&gt;&lt;P&gt;So shortly :&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN&gt;for non SDP continuous auto loader jobs you can use fixed size jobs compute&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;for non SDP available now auto loader jobs autoscaling can be reasonable&lt;/LI&gt;&lt;LI&gt;for streaming autoscaling with better scale down behavior LSDP autoscaling is the recommended option&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Tue, 12 May 2026 11:30:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoscaling-with-the-autoloader-without-sdp/m-p/156679#M54465</guid>
      <dc:creator>amirabedhiafi</dc:creator>
      <dc:date>2026-05-12T11:30:48Z</dc:date>
    </item>
    <item>
      <title>Re: Autoscaling with the autoloader without SDP</title>
      <link>https://community.databricks.com/t5/data-engineering/autoscaling-with-the-autoloader-without-sdp/m-p/156680#M54466</link>
      <description>&lt;P&gt;Hi, thank you for your answer. Could you elaborate a bit on this?&lt;BR /&gt;&lt;STRONG&gt;for non SDP available now auto loader jobs autoscaling can be reasonable&lt;BR /&gt;&lt;BR /&gt;&lt;/STRONG&gt;How do you decide on whether it is reasonable or not? Especially you said i&lt;SPAN&gt;t is not recommended to enable compute autoscaling because scale down has limitations for streaming workloads&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 12 May 2026 12:07:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoscaling-with-the-autoloader-without-sdp/m-p/156680#M54466</guid>
      <dc:creator>HTD360</dc:creator>
      <dc:date>2026-05-12T12:07:37Z</dc:date>
    </item>
  </channel>
</rss>

