<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to reset an autoloader? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-reset-an-autoloader/m-p/28801#M20573</link>
    <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;i'm using an autoloader with Azure Databricks:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;df = (spark.readStream.format("cloudFiles")&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;.options(**cloudfile)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;.load("abfss://dev@std******.dfs.core.windows.net/**/*****))&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;at my target checkpointLocation folder there are some files and subdirs created as a result.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It will detect and process new files which is OK.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Also when I restart my cluster it will again process only the new files, which is OK.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;But if I want to restart the autoloader in order to re-process all files from the source folder again I could not find anything how to do so.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Can someone please give me a hint.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 10 Feb 2022 10:21:16 GMT</pubDate>
    <dc:creator>Scouty</dc:creator>
    <dc:date>2022-02-10T10:21:16Z</dc:date>
    <item>
      <title>How to reset an autoloader?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-reset-an-autoloader/m-p/28801#M20573</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;i'm using an autoloader with Azure Databricks:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;df = (spark.readStream.format("cloudFiles")&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;.options(**cloudfile)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;.load("abfss://dev@std******.dfs.core.windows.net/**/*****))&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;at my target checkpointLocation folder there are some files and subdirs created as a result.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It will detect and process new files which is OK.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Also when I restart my cluster it will again process only the new files, which is OK.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;But if I want to restart the autoloader in order to re-process all files from the source folder again I could not find anything how to do so.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Can someone please give me a hint.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 10 Feb 2022 10:21:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-reset-an-autoloader/m-p/28801#M20573</guid>
      <dc:creator>Scouty</dc:creator>
      <dc:date>2022-02-10T10:21:16Z</dc:date>
    </item>
    <item>
      <title>Re: How to reset an autoloader?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-reset-an-autoloader/m-p/28802#M20574</link>
      <description>&lt;P&gt;Change the checkpoint location or delete the existing checkpoint location. &lt;/P&gt;&lt;P&gt;The new checkpoint location implies that previous stream has been abandoned  and new stream has been started.&lt;/P&gt;</description>
      <pubDate>Thu, 10 Feb 2022 12:58:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-reset-an-autoloader/m-p/28802#M20574</guid>
      <dc:creator>AmanSehgal</dc:creator>
      <dc:date>2022-02-10T12:58:00Z</dc:date>
    </item>
    <item>
      <title>Re: How to reset an autoloader?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-reset-an-autoloader/m-p/28803#M20575</link>
      <description>&lt;P&gt;@Aman Sehgal​&amp;nbsp;- My name is Piper, and I'm one of the moderators for Databricks. I wanted to jump in real quick to thank you for being so generous with your knowledge. &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; &lt;/P&gt;</description>
      <pubDate>Thu, 10 Feb 2022 14:55:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-reset-an-autoloader/m-p/28803#M20575</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-02-10T14:55:58Z</dc:date>
    </item>
  </channel>
</rss>

