<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: AutoLoader - Write To Console (Notebook Cell) Long Running Issue in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/autoloader-write-to-console-notebook-cell-long-running-issue/m-p/127489#M10483</link>
    <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/96188"&gt;@ChristianRRL&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;This is expected behavior. Under the hood autoloader uses spark structured streaming. In spark structured streaming you can't use display.&amp;nbsp;&lt;/P&gt;&lt;P&gt;It would be beneficial for you to familiarize yourself with structured streaming concept. It is whole different world than traditional batch approach, so hence your confusion:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://spark.apache.org/docs/latest/streaming/index.html" target="_blank"&gt;https://spark.apache.org/docs/latest/streaming/index.html&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 05 Aug 2025 15:58:04 GMT</pubDate>
    <dc:creator>szymon_dybczak</dc:creator>
    <dc:date>2025-08-05T15:58:04Z</dc:date>
    <item>
      <title>AutoLoader - Write To Console (Notebook Cell) Long Running Issue</title>
      <link>https://community.databricks.com/t5/get-started-discussions/autoloader-write-to-console-notebook-cell-long-running-issue/m-p/127469#M10479</link>
      <description>&lt;P&gt;Hi there,&lt;/P&gt;&lt;P&gt;I am likely misunderstanding how to use AutoLoader properly while developing/testing. I am trying to write a simple AutoLoader notebook cell to *read* the contents of a path with json files, and *write* them to console (i.e. notebook cell) in order to visualize the results. I kicked this off yesterday before logging off, and when I logged back in the morning, I realized that the cell was running for nearly 16 hours!&lt;/P&gt;&lt;P&gt;Can I get some assistance to understand what I'm doing wrong? I don't want to setup a permanent or long running data stream currently. At this time, I only have a filepath with a very small number of files (less than 10 with some few files being occasionally manually added), and I want to be able to easily view the contents of the files without requiring a permanent or long-running stream.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="ChristianRRL_0-1754403001614.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/18740i1AFA2BD54DBE78AA/image-size/medium?v=v2&amp;amp;px=400" role="button" title="ChristianRRL_0-1754403001614.png" alt="ChristianRRL_0-1754403001614.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 05 Aug 2025 14:18:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/autoloader-write-to-console-notebook-cell-long-running-issue/m-p/127469#M10479</guid>
      <dc:creator>ChristianRRL</dc:creator>
      <dc:date>2025-08-05T14:18:08Z</dc:date>
    </item>
    <item>
      <title>Re: AutoLoader - Write To Console (Notebook Cell) Long Running Issue</title>
      <link>https://community.databricks.com/t5/get-started-discussions/autoloader-write-to-console-notebook-cell-long-running-issue/m-p/127479#M10481</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/96188"&gt;@ChristianRRL&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;It looks like spark.readStream with Auto Loader creates a continuous streaming job by default, which means it keeps running while waiting for new files.&lt;/P&gt;&lt;P&gt;To avoid this, you can control the behaviour using trigger(availableNow=True), which processes all data available at the start, but may break the work into multiple micro-batches.&lt;/P&gt;</description>
      <pubDate>Tue, 05 Aug 2025 15:13:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/autoloader-write-to-console-notebook-cell-long-running-issue/m-p/127479#M10481</guid>
      <dc:creator>SP_6721</dc:creator>
      <dc:date>2025-08-05T15:13:14Z</dc:date>
    </item>
    <item>
      <title>Re: AutoLoader - Write To Console (Notebook Cell) Long Running Issue</title>
      <link>https://community.databricks.com/t5/get-started-discussions/autoloader-write-to-console-notebook-cell-long-running-issue/m-p/127486#M10482</link>
      <description>&lt;P&gt;Fantastic! This is a great step forward, just one more thing. The trigger(availableNow=True) worked as you said, but I'm still not seeing the data displaying in the notebook cell. Is there something else I'm missing?&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="ChristianRRL_0-1754407753844.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/18741iC3879B5B66DC9ACE/image-size/medium?v=v2&amp;amp;px=400" role="button" title="ChristianRRL_0-1754407753844.png" alt="ChristianRRL_0-1754407753844.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 05 Aug 2025 15:32:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/autoloader-write-to-console-notebook-cell-long-running-issue/m-p/127486#M10482</guid>
      <dc:creator>ChristianRRL</dc:creator>
      <dc:date>2025-08-05T15:32:06Z</dc:date>
    </item>
    <item>
      <title>Re: AutoLoader - Write To Console (Notebook Cell) Long Running Issue</title>
      <link>https://community.databricks.com/t5/get-started-discussions/autoloader-write-to-console-notebook-cell-long-running-issue/m-p/127489#M10483</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/96188"&gt;@ChristianRRL&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;This is expected behavior. Under the hood autoloader uses spark structured streaming. In spark structured streaming you can't use display.&amp;nbsp;&lt;/P&gt;&lt;P&gt;It would be beneficial for you to familiarize yourself with structured streaming concept. It is whole different world than traditional batch approach, so hence your confusion:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://spark.apache.org/docs/latest/streaming/index.html" target="_blank"&gt;https://spark.apache.org/docs/latest/streaming/index.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 05 Aug 2025 15:58:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/autoloader-write-to-console-notebook-cell-long-running-issue/m-p/127489#M10483</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-08-05T15:58:04Z</dc:date>
    </item>
  </channel>
</rss>

