<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Autoloader Checkpoint Fails and then the after changing the checkpoint path need to reload all d in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88197#M37503</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/119085"&gt;@Subhasis&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Could you provide more details? Like exact error message, you autoloader configuration etc. It will be easier for us to help you&lt;/P&gt;</description>
    <pubDate>Wed, 04 Sep 2024 05:53:13 GMT</pubDate>
    <dc:creator>szymon_dybczak</dc:creator>
    <dc:date>2024-09-04T05:53:13Z</dc:date>
    <item>
      <title>Autoloader Checkpoint Fails and then the after changing the checkpoint path need to reload all data</title>
      <link>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88193#M37500</link>
      <description>&lt;P&gt;Autoloader Checkpoint Fails and then the after changing the checkpoint path need to reload all data. I want to load all the data which are not processed . I don't want to relaod all the data.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Sep 2024 05:31:32 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88193#M37500</guid>
      <dc:creator>Subhasis</dc:creator>
      <dc:date>2024-09-04T05:31:32Z</dc:date>
    </item>
    <item>
      <title>Re: Autoloader Checkpoint Fails and then the after changing the checkpoint path need to reload all d</title>
      <link>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88197#M37503</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/119085"&gt;@Subhasis&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Could you provide more details? Like exact error message, you autoloader configuration etc. It will be easier for us to help you&lt;/P&gt;</description>
      <pubDate>Wed, 04 Sep 2024 05:53:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88197#M37503</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-09-04T05:53:13Z</dc:date>
    </item>
    <item>
      <title>Re: Autoloader Checkpoint Fails and then the after changing the checkpoint path need to reload all d</title>
      <link>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88198#M37504</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/119085"&gt;@Subhasis&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;What do you exactly mean by "Autoloader Checkpoint fails"?&lt;BR /&gt;How did you change the checkpoint path? Simply by specifying a new path?&lt;BR /&gt;If yes, then it's normal that it will try to reload all data, as it sees that the checkpoint is empty thus it's trying to load everything it finds.&amp;nbsp;&lt;BR /&gt;What you could do is to specify&amp;nbsp;&lt;EM&gt;modifiedAfter&amp;nbsp;&lt;/EM&gt;option to set up a cutoff date.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Sep 2024 05:54:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88198#M37504</guid>
      <dc:creator>daniel_sahal</dc:creator>
      <dc:date>2024-09-04T05:54:43Z</dc:date>
    </item>
    <item>
      <title>Re: Autoloader Checkpoint Fails and then the after changing the checkpoint path need to reload all d</title>
      <link>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88202#M37507</link>
      <description>&lt;P&gt;No such error is showing it is reading file but it is not writing the data into deltatable. Then when I identified it is not writing data I created a new checkpoint path. Then it is reloading all the data. How to avoid this situation . Modified after I used but then not able to identify from when the data is not writing since the job is not failing as such.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Sep 2024 06:28:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88202#M37507</guid>
      <dc:creator>Subhasis</dc:creator>
      <dc:date>2024-09-04T06:28:08Z</dc:date>
    </item>
    <item>
      <title>Re: Autoloader Checkpoint Fails and then the after changing the checkpoint path need to reload all d</title>
      <link>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88203#M37508</link>
      <description>&lt;P&gt;Do checkpoint has some benchmark capacity after that it stops writing data?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Sep 2024 06:30:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88203#M37508</guid>
      <dc:creator>Subhasis</dc:creator>
      <dc:date>2024-09-04T06:30:24Z</dc:date>
    </item>
    <item>
      <title>Re: Autoloader Checkpoint Fails and then the after changing the checkpoint path need to reload all d</title>
      <link>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88204#M37509</link>
      <description>&lt;P&gt;You can use cloud_files_state function to see what files has been processed by autoloader and saved in checkpoint.&lt;/P&gt;&lt;P&gt;I'm assuming that in your case you have some misconfiguration that's causing a problem&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://docs.databricks.com/en/sql/language-manual/functions/cloud_files_state.html" target="_blank" rel="noopener"&gt;cloud_files_state table-valued function | Databricks on AWS&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Sep 2024 06:36:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/autoloader-checkpoint-fails-and-then-the-after-changing-the/m-p/88204#M37509</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-09-04T06:36:53Z</dc:date>
    </item>
  </channel>
</rss>

