<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129246#M48476</link>
    <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/180607"&gt;@nikhilshetty4&lt;/a&gt;!&lt;/P&gt;
&lt;P&gt;To confirm, do the files show a non-null commit_time in cloud_files_state? They’ll only move to the archive location after this is set and the retention period has elapsed.&lt;/P&gt;</description>
    <pubDate>Fri, 22 Aug 2025 11:19:52 GMT</pubDate>
    <dc:creator>Advika</dc:creator>
    <dc:date>2025-08-22T11:19:52Z</dc:date>
    <item>
      <title>Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129236#M48473</link>
      <description>&lt;DIV&gt;Hi everyone,&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;I've been trying to explore on cleanSource option in Autoloader to move files from the source to an archive location after they're processed and loaded into a table. I used the following simple code to test this functionality. While the code executes without any errors, the files remain in the source location and are not moved.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;source_path = "abfss://container@storage_acc.dfs.core.windows.net/source/files"&lt;/DIV&gt;&lt;DIV&gt;archive_path = "abfss://container@storage_acc.dfs.core.windows.net/archive/files"&lt;/DIV&gt;&lt;DIV&gt;schema_location = "abfss://container@storage_acc.dfs.core.windows.net/source/autoloader/schema"&lt;/DIV&gt;&lt;DIV&gt;checkpoint_location = "abfss://container@storage_acc.dfs.core.windows.net/source/autoloader/checkpoint"&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;df = (&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; spark.readStream&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .format("cloudFiles")&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .option("cloudFiles.format", "parquet")&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .option("cloudFiles.schemaLocation", schema_location)&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .option("cloudFiles.cleanSource", "MOVE")&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .option("cloudFiles.cleanSource.moveDestination", archive_path)&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;.option("cloudFiles.includeExistingFiles", "true")&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .load(source_path))&lt;/DIV&gt;&lt;DIV&gt;)&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;df.writeStream&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .format("delta")&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .option("checkpointLocation", checkpoint_location)&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .outputMode("append")&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp; &amp;nbsp; .table("uc.schema.table_name")&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;When I run the query SELECT * FROM cloud_files_state(checkpoint_location), I notice that the archive_mode and move_location columns are NULL, even though I’ve explicitly set cleanSource to MOVE. I also tested the DELETE option with .option("cloudFiles.cleanSource.retentionDuration", "7 days"), but that didn’t work either.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;I came across a similar issue reported by another user using an S3 bucket as the source:&amp;nbsp;&lt;A title="Autoloader cleansource option does not take any effect" href="https://community.databricks.com/t5/data-engineering/autoloader-cleansource-option-does-not-take-any-effect/td-p/123436" target="_blank" rel="noopener"&gt;Autoloader cleansource option does not take any effect&lt;/A&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;I’ve tested this both in a notebook with a cluster running on Runtime 17.0 and using DLT with Runtime 16.4.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;Could someone help me understand if I’m missing something or if there are any prerequisites or configurations needed to make this work?&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;Thanks,&lt;/DIV&gt;&lt;DIV&gt;Nikhil&lt;/DIV&gt;</description>
      <pubDate>Fri, 22 Aug 2025 10:01:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129236#M48473</guid>
      <dc:creator>nikhilshetty4</dc:creator>
      <dc:date>2025-08-22T10:01:09Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129239#M48474</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/180607"&gt;@nikhilshetty4&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;I think it might be some kind of bug related to that feautre. You are another person who is saying that it doesn't work as expected&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.databricks.com/t5/data-engineering/autoloader-move-file-to-archive-immediately-after-processing/td-p/120692" target="_blank"&gt;Autoloader move file to archive immediately after ... - Databricks Community - 120692&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 10:21:35 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129239#M48474</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-08-22T10:21:35Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129246#M48476</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/180607"&gt;@nikhilshetty4&lt;/a&gt;!&lt;/P&gt;
&lt;P&gt;To confirm, do the files show a non-null commit_time in cloud_files_state? They’ll only move to the archive location after this is set and the retention period has elapsed.&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 11:19:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129246#M48476</guid>
      <dc:creator>Advika</dc:creator>
      <dc:date>2025-08-22T11:19:52Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129250#M48477</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/152834"&gt;@Advika&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;Yes, the commit_time column contains valid timestamp values. However, the archive_time, archive_mode, and move_location columns are all showing null.&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 11:34:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129250#M48477</guid>
      <dc:creator>nikhilshetty4</dc:creator>
      <dc:date>2025-08-22T11:34:52Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129251#M48478</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/180607"&gt;@nikhilshetty4&lt;/a&gt;, If the archive columns (archive_time, archive_mode, move_location) are null, it means files haven’t been picked up by cleanSource for move/delete yet. Move/delete occurs after commit_time is set, the retention period has passed, and the stream is actively processing. If the stream is stopped, cleanup won’t occur, it resumes the next time the stream runs and processes data.&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 12:08:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129251#M48478</guid>
      <dc:creator>Advika</dc:creator>
      <dc:date>2025-08-22T12:08:08Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129257#M48480</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/152834"&gt;@Advika&lt;/a&gt;, I’ve tried setting the retention duration to 1 or 2 minutes and kept the stream running well beyond that time. Even when new files were processed during the stream, the data still wasn’t moved to the archive location.&lt;BR /&gt;I've attached the screenshot of&amp;nbsp;&lt;SPAN&gt;cloud_files_state output:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-left" image-alt="image.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/19258i24F0649151D2D434/image-size/medium?v=v2&amp;amp;px=400" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I did see that there is no time restriction for MOVE in autoloader documentation:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-left" image-alt="Screenshot 2025-08-22 175206.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/19256i4EB17CED59E1B7DA/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Screenshot 2025-08-22 175206.png" alt="Screenshot 2025-08-22 175206.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 12:45:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129257#M48480</guid>
      <dc:creator>nikhilshetty4</dc:creator>
      <dc:date>2025-08-22T12:45:29Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129269#M48484</link>
      <description>&lt;P&gt;Thanks for sharing the details,&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/180607"&gt;@nikhilshetty4&lt;/a&gt;.&lt;BR /&gt;I recommend raising a case with the &lt;A href="https://help.databricks.com/s/contact-us?ReqType=training" target="_blank"&gt;Databricks Support team&lt;/A&gt; and including all the relevant details. This will help them investigate and resolve the issue more quickly.&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 13:05:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129269#M48484</guid>
      <dc:creator>Advika</dc:creator>
      <dc:date>2025-08-22T13:05:34Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129282#M48488</link>
      <description>&lt;P&gt;Got it, Thanks! I'll raise a case with Databricks Support team.&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 13:32:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/129282#M48488</guid>
      <dc:creator>nikhilshetty4</dc:creator>
      <dc:date>2025-08-22T13:32:19Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/133095#M49723</link>
      <description>&lt;P&gt;any update on this ?&lt;/P&gt;</description>
      <pubDate>Fri, 26 Sep 2025 16:29:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/133095#M49723</guid>
      <dc:creator>Ozear</dc:creator>
      <dc:date>2025-09-26T16:29:13Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Autoloader cleanSource=MOVE Not Working as Expected</title>
      <link>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/147955#M52799</link>
      <description>&lt;P&gt;We're having the same issue, where for a continuous-trigger on a long-running stream, it took 10 hours to "MOVE" files out of the directory for a retention period of 30s. Ideally, we'd want to move stuff out immediately, since incoming files slow down the autoloader in directory listing mode (yes, planning on moving to file notifications ASAP, but can't dedicate time to it while blocking a launch).&lt;/P&gt;</description>
      <pubDate>Tue, 10 Feb 2026 20:45:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/issue-with-autoloader-cleansource-move-not-working-as-expected/m-p/147955#M52799</guid>
      <dc:creator>kunalmishra9</dc:creator>
      <dc:date>2026-02-10T20:45:58Z</dc:date>
    </item>
  </channel>
</rss>

