-
AddFiles Actions:
- The
AddFiles
actions represent the addition of new files to the shared Delta table. These files contain data that has been appended or ingested.
- To calculate the number of
AddFiles
actions, you need to consider the total count of files added to the table over time.
- Each time new data is ingested or appended, it contributes to the
AddFiles
count.
-
RemoveFiles Actions:
- The
RemoveFiles
actions represent the removal of files from the shared Delta table. These files might have been deleted due to operations like OPTIMIZE
or MERGE
.
- To calculate the number of
RemoveFiles
actions, you need to track the files that have been deleted.
- Keep in mind that even if a file is removed by an operation like
OPTIMIZE
, it still counts as a RemoveFiles
action.
-
Total Actions:
- The total number of actions (both
AddFiles
and RemoveFiles
) in the DeltaLog determines the metadata size of the shared table.
- You can sum up the
AddFiles
and RemoveFiles
counts to get the total number of actions.
-
Monitoring and Tracking:
- To monitor these actions, you can query the DeltaLog associated with the shared table.
- Databricks provide APIs and commands to inspect the DeltaLog and retrieve information about actions performed on the table.
Remember that staying within the specified limits is crucial to ensure the smooth operation of your shared Delta table. If you approach the limits, consider optimizing your data management strategies or archiving older data to maintain a healthy metadata size.
For more detailed information, refer to the official Delta Sharing documentation and explore the spe...12.
Happy sharing! 🚀