cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

SocketTimeout exception when running a display command on spark dataframe

Manjusha
New Contributor II

I am using runtime 9.1LTS

I have a R notebook that reads a csv into a R dataframe and does some transformations and finally is converted to spark dataframe using the createDataFrame function.

after that when I call the display function on this spark dataframe, I get the following error

org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in stage 0.0 (TID 3) (10.139.64.4 executor 0): java.net.SocketTimeoutException: Accept timed out

In the spark UI, in stderr, I can see this message

BufferedStream Thread: Error in file(filename, "r", encoding = encoding)

But when I run this same code in the uat environment that has the same runtime(9.1LTS) this is not happening.

Also, till yesterday we were using 8.2 runtime and the error was not happening.

Any help is much appreciated

1 REPLY 1

Anonymous
Not applicable

Hi @Manjusha Unnikrishnan​ 

Great to meet you, and thanks for your question! 

Let's see if your peers in the community have an answer to your question first. Or else bricksters will get back to you soon.

Thanks.

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group