Databricks Community

ChristianRRL · ‎08-07-2025

In reference prior post: Re: Autoloader Error Loading and Displaying - Databricks Community - 122579

I am attempting to output results to the console (notebook cell), but am not seeing anything (other than the dataframe schema). Is this expected? I am starting to use Autoloader and I'd like an easy/straightforward way to debug the data, and this seems to be the simplest by using: .trigger(availableNow=True).

szymon_dybczak · ‎08-07-2025

Hi @ChristianRRL ,

Did you run this code before? Maybe all your source files has been already written to checkpoint. Try to upload new json file and run it again. Also, you can check drivers logs. Sometimes you can find them error messages.

ChristianRRL · ‎08-07-2025

In the example I shared, basically there's no checkpoint because I'm simulating running this for the first time with a fresh file. Additionally, the data is not being written to any specific location or managed table. I am able to view the data once it's appended to a raw table (not shown in the picture), but basically trying to figure out if there's a simple way to simulate a simple run and display it without actually writing data out anywhere.

I'm down to check out the driver logs. Where/how can I access them?

szymon_dybczak · ‎08-08-2025

Oh, I didn't notice that you don't have checkpoint. So I guess that's the reason of your issue. You must specify the checkpointLocation option before you run a streaming query. As I replied in different topic, autoloader under the hood is based on spark structured streaming and I Istrongly recommend that you read the overview of spark structured streaming. It should clarify a lot of concepts for you like how streaming query, checkpoints and many more.

PS. To find driver logs go to Compute -> click on your cluster -> Driver logs

ChristianRRL · ‎08-08-2025

Quick couple of follow-ups.

Respectfully (no negative tone I promise), I have browsed through Structured Streaming Programming Guide - Spark 4.0.0 Documentation and other documentation. I'm not an expert, and am learning as I go, but at least when using .format("console"), it doesn't seem like a checkpoint is needed.

I tried running the notebook cell both with & without the checkpoint, and I'm getting the same results (no output on notebook cell).

One thing I stumbled into however, it seems like the console outputMode maybe doesn't work quite how I would've hoped? For example, the Spark Guide shows the execution of an actual python file, whereas I'm trying to run a simple notebook cell. If this is the case, I am not sure why there wouldn't be a simple way to test this in a notebook.

The only way I have been able to test this in a way that works is via @lingareddy_Alva 's 2nd suggestion to use Use Memory Sink for Testing here:

Re: Autoloader Error Loading and Displaying - Databricks Community - 122579

Although I was hoping that the first suggestion would work as it's more concise and intuitive. Please let me know if I'm missing anything!

Databricks Community

Autoloader Console Output Issue

Join Us as a Local Community Builder!

Lakehouse, Lagers & Legends — Bangalore Meetup | December 13

🌟 Community Pulse: Your Weekly Roundup! November 21 – 27, 2025

Join us for another BrickTalk: Vibe-Coding Databricks Apps in Replit with Augusto!

Celebrating Our First Brickster Champion: Louis Frolio

⭐ Setup Spark with Hadoop Anywhere : A DBR aligned local Spark+HDFS+Hive stack on Docker⭐