cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Unable to obtain count of dataframe

qwerty3
Contributor

I am unable to obtain a count of a dataframe, it always get stuck at 1 stage, I have tried reducing the size, what can be the issue? How can I read cluster logs to identify the issue?



 

3 REPLIES 3

qwerty3
Contributor

qwerty3_0-1727365156978.png

 

szymon_dybczak
Esteemed Contributor III

You should look at Spark Web UI and investigate information there.There could be several causes of this like:

- skewed data

- memory issues

- shuffling data

 

qwerty3
Contributor

Driver memory is good enough, it is able to handle 90 lakhs data, what I am giving it is definitely less than that, what can I do about skewed data and shuffling?

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local communityโ€”sign up today to get started!

Sign Up Now