cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

ConnectException: Connection refused (Connection refused) This is often caused by an OOM error

Rnmj
New Contributor III

I am trying to run a python code where a json file is flattened to pipe separated file . The code works with smaller files but for huge files of 2.4 GB I get below error:

ConnectException: Connection refused (Connection refused)

Error while obtaining a new communication channel

ConnectException error: This is often caused by an OOM error that causes the connection to the Python REPL to be closed. Check your query's memory usage.

Databricks version 9.1 LTS

The cluster is 5 node Standard_DS4_V2

3 REPLIES 3

-werners-
Esteemed Contributor III

jose_gonzalez
Databricks Employee
Databricks Employee

hi @RN mj​ ,

Could you provide more details? how do you read your JSON file? are you using an autoscaling cluster? what is the full error stack-trace?

Rnmj
New Contributor III

Hi @Jose Gonzalez​ , @Werner Stinckens​  @Kaniz Fatma​ ,

Thanks for your response .Appreciate a lot.

The issue was in the code, it was a python /panda code running on Spark. Due to this only driver node was being used. i did validate this by increasing the driver configuration. The next steps is to revisit the code and use RDD/dataframes so code has some parallel processing

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group