Thursday
Hi,
I had a query regarding how to accurately retrieve the last data refresh datetime and last metadata refresh datetime for tables in Databricks. Currently, the only reliable approach I am aware of is using the DESCRIBE HISTORY command with filters on operations column such as WRITE, UPDATE, DELETE, MERGE, or ALTER TABLE.
However, I could not find a comprehensive list of all possible operations that can appear in the operation column of DESCRIBE HISTORY so I can separate them. The documentation here โ Databricks Delta Table History โ only provides examples of operation metric key values, but not the full list of supported operations.
Could you please help me.
Thank you!
Thursday
Hi @gayatrikhatale,
I think you already found correct place in documentation. It lists all supported operations in the first column.
https://docs.databricks.com/aws/en/delta/history?#operation-metrics-key
Thursday
Hi @szymon_dybczak ,
Thank you for your response.
If this is the correct documentation, then why am I not able to see operations like ADD COLUMNS or DROP COLUMNS listed there? I have observed these operation types in the output of DESCRIBE HISTORY, but they donโt seem to be documented in the link.
Thank you!
Thursday
Thanks for information. To be honest I've always thought that this table contains all supported operations. I learn everyday something new ๐
Thursday
Hi,
I had a similar need once and referred the delta-io git repository for full list of metrics. It is the not the best way but we get an idea of what all is available in the metrics. Hope it helps.
Regards,
Siva
Thursday
Hi @siva-anantha ,
This is super useful, thanks for sharing. I've prepared list of all distinct operation based on source code for reference:
yesterday
Thank you @szymon_dybczak , @siva-anantha !
Passionate about hosting events and connecting people? Help us grow a vibrant local communityโsign up today to get started!
Sign Up Now