Monitor and Alert Databricks Resource Utilization and Cost Consumption
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-19-2024 09:07 AM
We want to build monitoring and Alerting solution for Azure Databricks that should capture Resource Utilization details (like Aggregated CPU%, Memory% etc.) and Cost consumption at the Account Level.
We have Unity Catalog Enabled and there are multiple workspaces associated.
We prefer utilizing Diagnostics on Azure Log Analytics Workspace, as we do not want additional cost overhead. Diagnostic setting is enabled for Clusters, Jobs, UC, Notebooks, Accounts etc.
What are various KPIs and Metrics to consider? Please share the evaluation of the KPIs
What are the ways Alerting can be implemented using Log Analytics for Databricks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-20-2024 12:05 AM
@smehta_0908 Greetings!
You can utilize Datadog for monitoring CPU and memory of clusters.
https://docs.datadoghq.com/integrations/databricks/?tab=driveronly
For Cost consumption at accounts level you can make use of billable usage logs using the Account API
https://docs.databricks.com/en/administration-guide/account-settings/usage.html

