cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Help trying to calculate a percentage

DanVartanian
New Contributor II

The image below shows what my source data is (HAVE) and what I'm trying to get to (WANT).

I want to be able to calculate the percentage of bad messages (where formattedMessage = false) by source and date.

I'm not sure how to achieve this in DatabricksSql. Any help appreciated.

havewant 

1 ACCEPTED SOLUTION

Accepted Solutions

-werners-
Esteemed Contributor III

you could use a windows function over source and date with a sum of messagecount. This gives you the total per source/date repeated on every line.

Then apply a filter on formattedmessage == false and divide messagecount by the sum above.

View solution in original post

4 REPLIES 4

Kaniz
Community Manager
Community Manager

Hi @ DanVartanian! My name is Kaniz, and I'm the technical moderator here. Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question first. Or else I will get back to you soon. Thanks.

DanVartanian
New Contributor II

Thank you so much

-werners-
Esteemed Contributor III

you could use a windows function over source and date with a sum of messagecount. This gives you the total per source/date repeated on every line.

Then apply a filter on formattedmessage == false and divide messagecount by the sum above.

Thank you, I was able to get it following your instructions😀