cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

How to get cluster metrics by Power BI?

Mado
Valued Contributor II

Assume that I have a few clusters in Databricks and I want to make a report in Power BI that shows cluster metrics. For example:

  • How many hours clusters were running categorized by username
  • Cost
  • CPU load

Is there any way that I can send this information from Databricks to Power BI?

Also, I want it to work continuously (not only one time).

1 ACCEPTED SOLUTION

Accepted Solutions

Anonymous
Not applicable

@Mohammad Saberโ€‹ :

Here's an overview of how you can set up a pipeline to send cluster metrics from Databricks to Power BI:

Configure the Databricks cluster to send logs to an Azure Event Hub or Azure Log Analytics workspace. You can do this by following the instructions in the Databricks documentation:

Create an Azure Stream Analytics job that reads the logs from the Event Hub or Log Analytics workspace, transforms the data as needed, and sends it to Power BI. You can use the Azure Stream Analytics portal to create the job and define the input, query, and output. You can find more information on how to create a Stream Analytics job in the Azure documentation:

In Power BI, create a new report that connects to the Stream Analytics output and shows the desired metrics. You can use the Power BI Desktop or the Power BI service to create the report. You can find more information on how to create a report in the Power BI documentation:

Once you have set up the pipeline, the data will flow continuously from Databricks to Power BI, and you can monitor the cluster metrics in real-time.

View solution in original post

2 REPLIES 2

Anonymous
Not applicable

@Mohammad Saberโ€‹ :

Here's an overview of how you can set up a pipeline to send cluster metrics from Databricks to Power BI:

Configure the Databricks cluster to send logs to an Azure Event Hub or Azure Log Analytics workspace. You can do this by following the instructions in the Databricks documentation:

Create an Azure Stream Analytics job that reads the logs from the Event Hub or Log Analytics workspace, transforms the data as needed, and sends it to Power BI. You can use the Azure Stream Analytics portal to create the job and define the input, query, and output. You can find more information on how to create a Stream Analytics job in the Azure documentation:

In Power BI, create a new report that connects to the Stream Analytics output and shows the desired metrics. You can use the Power BI Desktop or the Power BI service to create the report. You can find more information on how to create a report in the Power BI documentation:

Once you have set up the pipeline, the data will flow continuously from Databricks to Power BI, and you can monitor the cluster metrics in real-time.

Anonymous
Not applicable

Hi @Mohammad Saberโ€‹ 

Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. 

We'd love to hear from you.

Thanks!

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group