- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-20-2024 12:15 PM
Hi,
I'm not aware of default Autoloader functionality that does what you're looking to do given that Autoloader is designed to incrementally ingest data as it arrives in cloud storage. Can you describe more about: "...exclude a specific file until I decided to include it in the load..."; how do you know when to include it in a load?
Perhaps you should consider Databricks Workflows "File Arrival" trigger (https://docs.databricks.com/en/workflows/jobs/file-arrival-triggers.html#trigger-jobs-when-new-files... . Maybe this could be used to trigger a job run, make a decision on what action to take (aka "until I decided to include it in the load"), and maybe even copy the file to an alternate location once that decision is made and have DLT Autloader watch that new, copied location?
Hope it's helpful.