Hi @Digvijay_11 ,
1. You can use metadata column for that purpose File metadata column - Azure Databricks | Microsoft Learn
2. With the default setting (cloudFiles.allowOverwrites = false), files are processed exactly once. When a file is appended to or overwritten, Auto Loader cannot guarantee which file version will be processed. To allow Auto Loader to process the file again when it is appended to or overwritten, you can set cloudFiles.allowOverwrites to true. In this case, Auto Loader is guaranteed to process the latest version of the file. However, Auto Loader cannot guarantee which intermediate version is processed.
3. With cloudFiles.allowOverwrites = true, Auto Loader will reprocess the entire file even when it is appended or partially updated.
So: