cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
San Diego
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

File trigger for workflows

tuckera
New Contributor

Does anyone know what triggers a "new" file in workflows? Is it a checksum etc?

2 REPLIES 2

anardinelli
Databricks Employee
Databricks Employee

Hi @tuckera how are you?

When you mean "file in workflows" you mean files that are added to a source on a Structured Streaming?

If you use Autoloader (AKA CloudFiles) on the process of ingesting, there are two ways it can identify files: The first one is by Lexical Ordering, and the second is the File Notification method.

For the first one, if you add files such as:

FILE_1.parquet and after add FILE_2.parquet on a bucket, the Autoloader will identify this as a new file using lexical ordering.

If you would like to add a more reliable way of ingesting, turning on the File Notification method will help you achieve it.

Please check further on this using the links of the documentation.

https://docs.databricks.com/en/ingestion/auto-loader/directory-listing-mode.html

https://docs.databricks.com/en/ingestion/auto-loader/file-notification-mode.html

Best,

Alessandro

nakany15
New Contributor II

You refere that one?

If so, it is a workfows trigger that trigger a job when a new file arrives in an external location.

https://docs.databricks.com/en/workflows/jobs/file-arrival-triggers.html

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group