Databricks Community

AanchalSoni · ‎09-30-2025

Hi!

I'm trying to stream some files using read_files.format("cloudFiles"). However, when new files arrive, the subsequent SQL query and monitoring graphs are not getting updated. Please suggest.

BS_THE_ANALYST · ‎09-30-2025

Hi @AanchalSoni ,

How have you set up your stream? Could you provide the code? 😊. Perhaps you've not setup the stream trigger to behave like you want: https://docs.databricks.com/aws/en/structured-streaming/triggers

All the best,
BS

AanchalSoni · ‎09-30-2025

Please check the attachments

szymon_dybczak · ‎09-30-2025

It seems there is a problem with attachements on community in recent days. All of them are stuck with this "Virus scan in progress". Could you try copy those images directly into text box.

Hi, sorry for calling you directly @Advika , @Sujitha - but mayby do you know if there have been any changes to the attachment adding system recently?

AanchalSoni · ‎09-30-2025

df_super = (

spark.readStream.format("cloudFiles")

.option("cloudFiles.format", "csv")

.option("cloudFiles.validateOptions", "false")

.option(

"cloudFiles.schemaHints",

"ROW_ID int, Order_Date date, Ship_Date date, Postal_Code int, Sales double, Quantity double, Discount double, Profit double",

)

.option("cloudFiles.schemaLocation", "/Volumes/dlt/default/data/Schema2/")

.load("/Volumes/dlt/default/data/Superstore/")

)

--------

spark.sql("select * from vw_superstore").display(checkpointLocation = "/Volumes/dlt/default/data/Check_super13/")

saurabh18cs · ‎09-30-2025

Hi @AanchalSoni thanks for sharing your code. your job is also not scheduled and running continuously right?

+

You need to write the streaming DataFrame to a table or view for vw_superstore

example:

f_super.writeStream \
.format("delta") \
.option("checkpointLocation", "/Volumes/dlt/default/data/Check_super13/") \
.outputMode("append") \
.table("vw_superstore").start()

Advika · ‎09-30-2025

Thanks for tagging, @szymon_dybczak! We’ll check and get back on this.

saurabh18cs · ‎09-30-2025

Hi @AanchalSoni If you are using .readStream, make sure you have set a trigger interval (e.g., .trigger(processingTime='1 minute'))

szymon_dybczak · ‎09-30-2025

If you don't define trigger by default it will trigger microbatch every 0.5 second. So I guess this is not an issue here.

AanchalSoni · ‎09-30-2025

Hi Saurabh!

If I'm not explicitly mentioning the trigger then by default the trigger should be 500 ms and there should be a quick check for new files. however, even after a few minutes there is no expected activity.

Databricks Community

Streaming- Results not getting updated on arrival of new files

Join Us as a Local Community Builder!

Join us for another BrickTalk: Vibe-Coding Databricks Apps in Replit with Augusto!

🌟 Community Pulse: Your Weekly Roundup! November 14 – 20, 2025

Celebrating Our First Brickster Champion: Louis Frolio

⭐ Setup Spark with Hadoop Anywhere : A DBR aligned local Spark+HDFS+Hive stack on Docker⭐

Big Book of Data Engineering - Get how-tos, code snippets and real-world examples