<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Auotoloader-&amp;quot;cloudFiles.backfillInterval&amp;quot; in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/auotoloader-quot-cloudfiles-backfillinterval-quot/m-p/50606#M1713</link>
    <description>&lt;P&gt;Hi &lt;A href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9" target="_blank"&gt;@Kaniz&lt;/A&gt; , Can you please answer follows question ,&lt;BR /&gt;1.Is the following code correct for specifying the .option("cloudFiles.backfillInterval", 300)?&lt;BR /&gt;df = spark.readStream.format("cloudFiles") \&lt;BR /&gt;.option("cloudFiles.format", "csv") \&lt;BR /&gt;.option("cloudFiles.schemaLocation", f"dbfs:/FileStore/xyz/back_fill_option/schema/backfill")\&lt;BR /&gt;.load(f"dbfs:/FileStore/xyz/back_fill_option/source")&lt;/P&gt;&lt;P&gt;df.writeStream \&lt;BR /&gt;.format("delta") \&lt;BR /&gt;.option("cloudFiles.backfillInterval", 300) \&lt;BR /&gt;.trigger(processingTime='3 minutes') \&lt;BR /&gt;.option("checkpointLocation", f"dbfs:/FileStore/xyz/back_fill_option/checkpoint/backfill") \&lt;BR /&gt;.table("back_fill_option")&lt;/P&gt;&lt;P&gt;2.If the autoloader streaming process begins at "2023-11-01T01:00:00" and you set .option("cloudFiles.backfillInterval", 300), does this mean that the backfillInterval will trigger at "2023-11-01T01:05:00"?&lt;BR /&gt;3.When you pass the option .trigger(processingTime='3 minutes'), it triggers the process every 3 minutes. If you also set backfillInterval to 2 minutes, does that mean the backfillInterval triggers every 2 minutes?&lt;BR /&gt;4.When you set the property processingTime to a value greater than backfillInterval, does that mean the backfillInterval runs before the processingTime interval elapses?&lt;BR /&gt;5.How can you verify the functionality of the "cloudFiles.backfillInterval" to ensure it is working correctly with the provided autoloader code?&lt;/P&gt;</description>
    <pubDate>Wed, 08 Nov 2023 07:18:31 GMT</pubDate>
    <dc:creator>Kiranrathod</dc:creator>
    <dc:date>2023-11-08T07:18:31Z</dc:date>
    <item>
      <title>Auotoloader-"cloudFiles.backfillInterval"</title>
      <link>https://community.databricks.com/t5/get-started-discussions/auotoloader-quot-cloudfiles-backfillinterval-quot/m-p/49843#M1639</link>
      <description>&lt;P&gt;1. How to use&amp;nbsp;cloudFiles.backfillInterval option in a notebook?&lt;BR /&gt;2. Does It need to be any set of the property?&lt;BR /&gt;3. Where is exactly placed readstream portion of the code or writestream portion of the code?&lt;BR /&gt;4. Do you have any sample code?&lt;BR /&gt;5. Where we find cloudFiles.backfillInterval logs?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 25 Oct 2023 10:15:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/auotoloader-quot-cloudfiles-backfillinterval-quot/m-p/49843#M1639</guid>
      <dc:creator>Kiranrathod</dc:creator>
      <dc:date>2023-10-25T10:15:28Z</dc:date>
    </item>
    <item>
      <title>Re: Auotoloader-"cloudFiles.backfillInterval"</title>
      <link>https://community.databricks.com/t5/get-started-discussions/auotoloader-quot-cloudfiles-backfillinterval-quot/m-p/50256#M1677</link>
      <description>&lt;P&gt;1.Is the following code correct for specifying the&amp;nbsp;&amp;nbsp;&lt;STRONG&gt;.option("cloudFiles.backfillInterval", 300)&lt;/STRONG&gt;?&lt;BR /&gt;df = spark.readStream.format("cloudFiles") \&lt;BR /&gt;.option("cloudFiles.format", "csv") \&lt;BR /&gt;.option("cloudFiles.schemaLocation", f"dbfs:/FileStore/xyz/back_fill_option/schema/backfill")\&lt;BR /&gt;.load(f"dbfs:/FileStore/xyz/back_fill_option/source")&lt;/P&gt;&lt;P&gt;df.writeStream \&lt;BR /&gt;.format("delta") \&lt;BR /&gt;.option("cloudFiles.backfillInterval", 300) \&lt;BR /&gt;.trigger(processingTime='3 minutes') \&lt;BR /&gt;.option("checkpointLocation", f"dbfs:/FileStore/xyz/back_fill_option/checkpoint/backfill") \&lt;BR /&gt;.table("back_fill_option")&lt;BR /&gt;&lt;BR /&gt;2.If the autoloader streaming process begins at &lt;STRONG&gt;"2023-11-01T01:00:00"&lt;/STRONG&gt; and you set &lt;STRONG&gt;.option("cloudFiles.backfillInterval", 300),&lt;/STRONG&gt; does this mean that the backfillInterval will trigger at &lt;STRONG&gt;"2023-11-01T01:05:00"&lt;/STRONG&gt;?&lt;BR /&gt;3.When you pass the option &lt;STRONG&gt;.trigger(processingTime='3 minutes')&lt;/STRONG&gt;, it triggers the process every 3 minutes. If you also set &lt;STRONG&gt;backfillInterval&lt;/STRONG&gt; to 2 minutes, does that mean the &lt;STRONG&gt;backfillInterval&lt;/STRONG&gt; triggers every 2 minutes?&lt;BR /&gt;4.When you set the property &lt;STRONG&gt;processingTime&lt;/STRONG&gt; to a value greater than &lt;STRONG&gt;backfillInterval&lt;/STRONG&gt;, does that mean the &lt;STRONG&gt;backfillInterval&lt;/STRONG&gt; runs before the &lt;STRONG&gt;processingTime&lt;/STRONG&gt; interval elapses?&lt;BR /&gt;5.How can you verify the functionality of the &lt;STRONG&gt;"cloudFiles.backfillInterval"&lt;/STRONG&gt; to ensure it is working correctly with the provided autoloader code?&lt;/P&gt;</description>
      <pubDate>Wed, 01 Nov 2023 11:09:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/auotoloader-quot-cloudfiles-backfillinterval-quot/m-p/50256#M1677</guid>
      <dc:creator>Kiranrathod</dc:creator>
      <dc:date>2023-11-01T11:09:59Z</dc:date>
    </item>
    <item>
      <title>Re: Auotoloader-"cloudFiles.backfillInterval"</title>
      <link>https://community.databricks.com/t5/get-started-discussions/auotoloader-quot-cloudfiles-backfillinterval-quot/m-p/50606#M1713</link>
      <description>&lt;P&gt;Hi &lt;A href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9" target="_blank"&gt;@Kaniz&lt;/A&gt; , Can you please answer follows question ,&lt;BR /&gt;1.Is the following code correct for specifying the .option("cloudFiles.backfillInterval", 300)?&lt;BR /&gt;df = spark.readStream.format("cloudFiles") \&lt;BR /&gt;.option("cloudFiles.format", "csv") \&lt;BR /&gt;.option("cloudFiles.schemaLocation", f"dbfs:/FileStore/xyz/back_fill_option/schema/backfill")\&lt;BR /&gt;.load(f"dbfs:/FileStore/xyz/back_fill_option/source")&lt;/P&gt;&lt;P&gt;df.writeStream \&lt;BR /&gt;.format("delta") \&lt;BR /&gt;.option("cloudFiles.backfillInterval", 300) \&lt;BR /&gt;.trigger(processingTime='3 minutes') \&lt;BR /&gt;.option("checkpointLocation", f"dbfs:/FileStore/xyz/back_fill_option/checkpoint/backfill") \&lt;BR /&gt;.table("back_fill_option")&lt;/P&gt;&lt;P&gt;2.If the autoloader streaming process begins at "2023-11-01T01:00:00" and you set .option("cloudFiles.backfillInterval", 300), does this mean that the backfillInterval will trigger at "2023-11-01T01:05:00"?&lt;BR /&gt;3.When you pass the option .trigger(processingTime='3 minutes'), it triggers the process every 3 minutes. If you also set backfillInterval to 2 minutes, does that mean the backfillInterval triggers every 2 minutes?&lt;BR /&gt;4.When you set the property processingTime to a value greater than backfillInterval, does that mean the backfillInterval runs before the processingTime interval elapses?&lt;BR /&gt;5.How can you verify the functionality of the "cloudFiles.backfillInterval" to ensure it is working correctly with the provided autoloader code?&lt;/P&gt;</description>
      <pubDate>Wed, 08 Nov 2023 07:18:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/auotoloader-quot-cloudfiles-backfillinterval-quot/m-p/50606#M1713</guid>
      <dc:creator>Kiranrathod</dc:creator>
      <dc:date>2023-11-08T07:18:31Z</dc:date>
    </item>
  </channel>
</rss>

