cancel
Showing results for 
Search instead for 
Did you mean: 
Warehousing & Analytics
cancel
Showing results for 
Search instead for 
Did you mean: 

How to get cluster metrics by Power BI?

Mado
Valued Contributor II

Assume that I have a few clusters in Databricks and I want to make a report in Power BI that shows cluster metrics. For example:

  • How many hours clusters were running categorized by username
  • Cost
  • CPU load

Is there any way that I can send this information from Databricks to Power BI?

Also, I want it to work continuously (not only one time).

1 ACCEPTED SOLUTION

Accepted Solutions

Anonymous
Not applicable

@Mohammad Saber​ :

Here's an overview of how you can set up a pipeline to send cluster metrics from Databricks to Power BI:

Configure the Databricks cluster to send logs to an Azure Event Hub or Azure Log Analytics workspace. You can do this by following the instructions in the Databricks documentation:

Create an Azure Stream Analytics job that reads the logs from the Event Hub or Log Analytics workspace, transforms the data as needed, and sends it to Power BI. You can use the Azure Stream Analytics portal to create the job and define the input, query, and output. You can find more information on how to create a Stream Analytics job in the Azure documentation:

In Power BI, create a new report that connects to the Stream Analytics output and shows the desired metrics. You can use the Power BI Desktop or the Power BI service to create the report. You can find more information on how to create a report in the Power BI documentation:

Once you have set up the pipeline, the data will flow continuously from Databricks to Power BI, and you can monitor the cluster metrics in real-time.

View solution in original post

2 REPLIES 2

Anonymous
Not applicable

@Mohammad Saber​ :

Here's an overview of how you can set up a pipeline to send cluster metrics from Databricks to Power BI:

Configure the Databricks cluster to send logs to an Azure Event Hub or Azure Log Analytics workspace. You can do this by following the instructions in the Databricks documentation:

Create an Azure Stream Analytics job that reads the logs from the Event Hub or Log Analytics workspace, transforms the data as needed, and sends it to Power BI. You can use the Azure Stream Analytics portal to create the job and define the input, query, and output. You can find more information on how to create a Stream Analytics job in the Azure documentation:

In Power BI, create a new report that connects to the Stream Analytics output and shows the desired metrics. You can use the Power BI Desktop or the Power BI service to create the report. You can find more information on how to create a report in the Power BI documentation:

Once you have set up the pipeline, the data will flow continuously from Databricks to Power BI, and you can monitor the cluster metrics in real-time.

Anonymous
Not applicable

Hi @Mohammad Saber​ 

Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. 

We'd love to hear from you.

Thanks!

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.