SocketTimeout exception when running a display command on spark dataframe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-13-2022 05:16 AM
I am using runtime 9.1LTS
I have a R notebook that reads a csv into a R dataframe and does some transformations and finally is converted to spark dataframe using the createDataFrame function.
after that when I call the display function on this spark dataframe, I get the following error
org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in stage 0.0 (TID 3) (10.139.64.4 executor 0): java.net.SocketTimeoutException: Accept timed out
In the spark UI, in stderr, I can see this message
BufferedStream Thread: Error in file(filename, "r", encoding = encoding)
But when I run this same code in the uat environment that has the same runtime(9.1LTS) this is not happening.
Also, till yesterday we were using 8.2 runtime and the error was not happening.
Any help is much appreciated
- Labels:
-
Dataframe
-
Display Command
-
Stage failure
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-24-2022 10:36 PM
Hi @Manjusha Unnikrishnan
Great to meet you, and thanks for your question!
Let's see if your peers in the community have an answer to your question first. Or else bricksters will get back to you soon.
Thanks.

