Data Engineering

Forum Posts

Sorted by:

by Tahseen0354 • Valued Contributor

10-14-2021 10:45:35 AM

28458 Views
9 replies
5 kudos

Resolved! Getting "Job aborted due to stage failure" SparkException when trying to download full result

I have generated a result using SQL. But whenever I try to download the full result (1 million rows), it is throwing SparkException. I can download the preview result but not the full result. Why ? What happens under the hood when I try to download ...

Data Engineering

28458 Views
9 replies
5 kudos

10-14-2021 10:45:35 AM

View Replies

Latest Reply

ac567
New Contributor III

12-13-2024 1:56:15 PM

5 kudos

Job aborted due to stage failure: Task 6506 in stage 46.0 failed 4 times, most recent failure: Lost task 6506.3 in stage 46.0 (TID 12896) (10.**.***.*** executor 12): java.lang.OutOfMemoryError: Cannot reserve 4194304 bytes of direct buffer memory (a...

5 kudos

12-13-2024 1:56:15 PM

8 More Replies

by nadia • New Contributor II

06-12-2022 2:19:33 PM

30539 Views
4 replies
2 kudos

Resolved! Executor heartbeat timed out

Hello, I'm trying to read a table that is located on Postgreqsl and contains 28 million rows. I have the following result:"SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in sta...

Data Engineering

30539 Views
4 replies
2 kudos

06-12-2022 2:19:33 PM

View Replies

Latest Reply

SparkJun
Databricks Employee

06-18-2024 1:52:44 PM

2 kudos

Please also review the Spark UI to see the failed Spark job and Spark stage. Please check on the GC time and data spill to memory and disk. See if there is any error in the failed task in the Spark stage view. This will confirm data skew or GC/memory...

2 kudos

06-18-2024 1:52:44 PM

3 More Replies

by Data_Analytics1 • Contributor III

02-03-2023 12:07:42 AM

20741 Views
8 replies
2 kudos

TimeoutException: Futures timed out after [5 seconds]. I am getting this error while running few parallel jobs at an interval of 5 minutes.

java.util.concurrent.TimeoutException: Futures timed out after [5 seconds] at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:259) at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:263) at scala.concurrent.Await$.$...

Data Engineering

20741 Views
8 replies
2 kudos

02-03-2023 12:07:42 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-08-2023 12:38:19 AM

2 kudos

Hi @Mahesh Chahare Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us s...

2 kudos

04-08-2023 12:38:19 AM

7 More Replies

by arul_parthiban • New Contributor

09-23-2022 8:42:43 AM

1168 Views
0 replies
0 kudos

How to customize the result of COPY INTO command?

COPY INTO statement produces results in following format, which is more similar to INSERT INTO statement results; also it's a summary of all the files loaded. Is there a way to customize in way that it produces the detailed results at file level?

Data Engineering

1168 Views
0 replies
0 kudos

09-23-2022 8:42:43 AM

by data_boy_2022 • New Contributor III

08-19-2022 1:51:44 PM

3783 Views
2 replies
0 kudos

Resolved! Writing transformed DataFrame to a persistent table is unbearable slow

I want to transform a DF with a simple UDF. Afterwards I want to store the resulting DF in a new table (see code below)key = "test_key" schema = StructType([ StructField("***", StringType(), True), StructField("yyy", StringType(), True), StructF...

Data Engineering

3783 Views
2 replies
0 kudos

08-19-2022 1:51:44 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-11-2022 11:48:29 PM

0 kudos

Hello @Jan R Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

0 kudos

09-11-2022 11:48:29 PM

1 More Replies

Databricks Community

Resolved! Getting "Job aborted due to stage failure" SparkException when trying to download full result

Resolved! Executor heartbeat timed out

TimeoutException: Futures timed out after [5 seconds]. I am getting this error while running few parallel jobs at an interval of 5 minutes.

How to customize the result of COPY INTO command?

Resolved! Writing transformed DataFrame to a persistent table is unbearable slow