Databricks Community

Sneeze7432 · ‎07-09-2025

I have a job with one task which is to run a notebook. The job run is setup with a File arrival trigger with my blob storage as the location. The trigger works and the job will start when a new file arrives in the location, but it does not run for multiple files.

For example I had three files uploaded at different times. First at 3:57:03, second at 3:57:07, and the last at 3:57:10. Three new files arrived, but only one job was started. Why did three jobs not get queued to run?

nayan_wylde · ‎07-09-2025

Did you overwrite the file with the same name because overwriting an existing file with a file of the same name does not trigger a run.

Sneeze7432 · ‎07-09-2025

No each file had a unique name associated with them.

nayan_wylde · ‎07-09-2025

Check if you have configured this two options.

Sneeze7432 · ‎07-09-2025

They are both set to 00h 00m.

szymon_dybczak · ‎07-09-2025

Hi @Sneeze7432 ,

I think it could be caused by following option Wait after last change in seconds. According to documentation:

"The time to wait to trigger a run after file arrival. Another file arrival in this period resets the timer. This settings can be used when files arrive in batches, and the whole batch needs to be processed after all files have arrived."

An important thing to keep in mind is that "another file arrival in this period resets the timer". Put differently, if you've continuously arriving files, your Workflow will never start as its execution will be continuously delayed. For that reason this setting should be used only to optimize the batch of processed files.

Sneeze7432 · ‎07-09-2025

I have the "Wait after last change" setting set to 00h 00m which I would assume means that immediately after a file drops in the storage location the job run will start. I would also assume that means if I drop multiple files in the same location multiple jobs should start, and based on my concurrency limits some may have to be queued.

szymon_dybczak · ‎07-10-2025

I'm just guessing, because unfortunately we don't have insight into how this was implemented, but it seems to me that the Databricks engineers treat files uploaded within a short time interval as a single batch — most likely for optimization purposes. If a trigger were to be generated every second, it wouldn’t be a very efficient approach.
Even that option is specified in minutes (as if they assume that anything below that would still be treated as a single batch).

Sneeze7432 · ‎07-10-2025

What doesn't make sense is that the notification bar will tell me "3 new files" but only one job runs. So even though they can display the number of new files between checks it will still only do one job?

I don't know, it doesn't seem to be setup very well.

szymon_dybczak · ‎07-11-2025

Maybe some databricks employee will jump in and will shed some light about implementation details. But for me treating really short intervals as one batch is quite reasonable approach to avoid massive amount of triggers.

Sneeze7432 · ‎07-11-2025

Same, I would really appreciate more details around this.

MariuszK · ‎07-10-2025

It looks like the trigger process files in batches, which means that each of the files uploaded doesn't create a new instance of a job.

Wait after last change in seconds: The time to wait to trigger a run after file arrival. Another file arrival in this period resets the timer. This setting can be used when files arrive in batches, and the whole batch needs to be processed after all files have arrived.

If you need to process files immediately or separately, you can play with Auto Loader configuration.

nayan_wylde · ‎07-10-2025

@Sneeze7432 you can also try editing the max concurrent runs in the workflow.

Sneeze7432 · ‎07-11-2025

That doesn't solve the problem of jobs not queueing. That would actually not be good because I could have multiple jobs writing to the same location and potentially overwriting each other creating inaccurate data.

Databricks Community

File Trigger Not Triggering Multiple Runs

Join Us as a Local Community Builder!

Lakehouse, Lagers & Legends — Bangalore Meetup | December 13

🌟 Community Pulse: Your Weekly Roundup! November 21 – 27, 2025

Join us for another BrickTalk: Vibe-Coding Databricks Apps in Replit with Augusto!

Celebrating Our First Brickster Champion: Louis Frolio

⭐ Setup Spark with Hadoop Anywhere : A DBR aligned local Spark+HDFS+Hive stack on Docker⭐