Data Engineering

Forum Posts

Sorted by:

by him • New Contributor III

08-25-2022 12:08:00 AM

19210 Views
13 replies
9 kudos

i am getting the below error while making a GET request to job in databrick after successfully running it

"error_code": "INVALID_PARAMETER_VALUE", "message": "Retrieving the output of runs with multiple tasks is not supported. Please retrieve the output of each individual task run instead."}

Data Engineering

19210 Views
13 replies
9 kudos

08-25-2022 12:08:00 AM

View Replies

Latest Reply

Octavian1
Contributor

yesterday

9 kudos

Hi @Debayan I'd suggest to also mention this explicitly in the documentation of the workspace client for get_run_outputOne has to pay extra attention to the examplerun_id=run.tasks[0].run_id otherwise it can be easily missed.

9 kudos

yesterday

12 More Replies

by MrJava • New Contributor III

02-07-2023 8:59:29 AM

11114 Views
16 replies
12 kudos

How to know, who started a job run?

Hi there!We have different jobs/workflows configured in our Databricks workspace running on AWS and would like to know who actually started the job run? Are they started by a user or a service principle using curl?Currently one can only see, who is t...

Data Engineering

11114 Views
16 replies
12 kudos

02-07-2023 8:59:29 AM

View Replies

Latest Reply

Ayush_Arora
New Contributor II

01-09-2025 4:40:27 AM

12 kudos

The system table solution works only when the job is manually triggered each time. I have a job which is triggered using the job scheduler on databricks. So once someone resumes the trigger, the job goes into execution. After this, the audit tables d...

12 kudos

01-09-2025 4:40:27 AM

15 More Replies

by Tahseen0354 • Valued Contributor

10-14-2021 10:45:35 AM

24240 Views
9 replies
5 kudos

Resolved! Getting "Job aborted due to stage failure" SparkException when trying to download full result

I have generated a result using SQL. But whenever I try to download the full result (1 million rows), it is throwing SparkException. I can download the preview result but not the full result. Why ? What happens under the hood when I try to download ...

Data Engineering

24240 Views
9 replies
5 kudos

10-14-2021 10:45:35 AM

View Replies

Latest Reply

ac567
New Contributor III

12-13-2024 1:56:15 PM

5 kudos

Job aborted due to stage failure: Task 6506 in stage 46.0 failed 4 times, most recent failure: Lost task 6506.3 in stage 46.0 (TID 12896) (10.**.***.*** executor 12): java.lang.OutOfMemoryError: Cannot reserve 4194304 bytes of direct buffer memory (a...

5 kudos

12-13-2024 1:56:15 PM

8 More Replies

by cmilligan • Contributor II

11-11-2022 9:09:24 AM

4317 Views
4 replies
4 kudos

Dropdown for parameters in a job

I want to be able to denote the type of run from a predetermined list of values that a user can choose from when kicking off a run using different parameters. Our team does standardized job runs on a weekly cadence but can have timeframes that change...

Data Engineering

4317 Views
4 replies
4 kudos

11-11-2022 9:09:24 AM

View Replies

Latest Reply

Leon_K
New Contributor II

12-11-2024 10:23:42 AM

4 kudos

I'm looking to this too. Wonder if there a way to make as a drop down for a job parameter

4 kudos

12-11-2024 10:23:42 AM

3 More Replies

by ImAbhishekTomar • New Contributor III

10-07-2022 6:45:43 AM

10331 Views
7 replies
4 kudos

kafkashaded.org.apache.kafka.common.errors.TimeoutException: topic-downstream-data-nonprod not present in metadata after 60000 ms.

I am facing an error when trying to write data on Kafka using spark stream.#Extract source_stream_df= (spark.readStream .format("cosmos.oltp.changeFeed") .option("spark.cosmos.container", PARM_CONTAINER_NAME) .option("spark.cosmos.read.inferSchema.en...

Data Engineering

10331 Views
7 replies
4 kudos

10-07-2022 6:45:43 AM

View Replies

Latest Reply

devmehta
New Contributor III

09-10-2024 2:54:28 AM

4 kudos

What event hub namespace you were using?I had same problem and resolved by changing pricing plan from basic to standard as Kafka apps is not supporting in basic planLet me know if you had anything else. Thanks

4 kudos

09-10-2024 2:54:28 AM

6 More Replies

by karolinalbinsso • New Contributor II

05-03-2022 2:18:01 AM

3288 Views
2 replies
3 kudos

Resolved! How to access the job-Scheduling Date from within the notebook?

I have created a job that contains a notebook that reads a file from Azure Storage. The file-name contains the date of when the file was transferred to the storage. A new file arrives every Monday, and the read-job is scheduled to run every Monday. I...

Data Engineering

3288 Views
2 replies
3 kudos

05-03-2022 2:18:01 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

05-03-2022 4:58:42 AM

3 kudos

Hi, I guess the files are in the same directory structure so that you can use cloud files autoloader. It will incrementally read only new files https://docs.microsoft.com/en-us/azure/databricks/spark/latest/structured-streaming/auto-loaderSo it will ...

3 kudos

05-03-2022 4:58:42 AM

1 More Replies

by kjoth • Contributor II

02-01-2022 11:38:45 PM

20751 Views
9 replies
7 kudos

How to make the job fail via code after handling exception

Hi , We are capturing the exception if an error occurs using try except. But we want the job status to be failed once we got the exception. Whats the best way to do that. We are using pyspark.

Data Engineering

20751 Views
9 replies
7 kudos

02-01-2022 11:38:45 PM

View Replies

Latest Reply

kumar_ravi
New Contributor III

09-04-2024 4:19:28 AM

7 kudos

you can do some hack arround dbutils = get_dbutils(spark) tables_with_exceptions = [] for table_config in table_configs: try: process(spark, table_config) except Exception as e: exception_detail = f"Error p...

7 kudos

09-04-2024 4:19:28 AM

8 More Replies

by hanish • New Contributor II

01-25-2023 1:22:35 AM

3606 Views
5 replies
2 kudos

Job cluster support in jobs/runs/submit API

We are using jobs/runs/submit API of databricks to create and trigger a one-time run with new_cluster and existing_cluster configuration. We would like to check if there is provision to pass "job_clusters" in this API to reuse the same cluster across...

Data Engineering

3606 Views
5 replies
2 kudos

01-25-2023 1:22:35 AM

View Replies

Latest Reply

Nagrjuna
New Contributor II

09-03-2024 1:15:37 PM

2 kudos

Hi, Any update on the above mentioned issue? Unable to submit a one time new job run (api/2.0 or 21/jobs/runs/submit) with shared job cluster or one new cluster has to be used for all TASKs in the job

2 kudos

09-03-2024 1:15:37 PM

4 More Replies

by Mohit_m • Valued Contributor II

06-15-2022 5:23:13 AM

28661 Views
3 replies
4 kudos

Resolved! How to get the Job ID and Run ID and save into a database

We are having Databricks Job running with main class and JAR file in it. Our JAR file code base is in Scala. Now, when our job starts running, we need to log Job ID and Run ID into the database for future purpose. How can we achieve this?

Data Engineering

28661 Views
3 replies
4 kudos

06-15-2022 5:23:13 AM

View Replies

Latest Reply

Bruno-Castro
New Contributor II

05-08-2024 1:05:13 AM

4 kudos

That article is for members only, can we also specify here how to do it (for those that are not medium members?). Thanks!

4 kudos

05-08-2024 1:05:13 AM

2 More Replies

by lstk • New Contributor

04-26-2022 8:47:18 AM

2863 Views
2 replies
1 kudos

Resolved! Job ID value out of range - Azure Logic App Connector

Hello everybody,i tried to build a Logic App Custom Connector following this one explanation. (https://medium.com/@poojaanilshinde/create-azure-logic-apps-custom-connector-for-azure-databricks-e51f4524ab27)Now i run in the following Problem and wante...

Data Engineering

2863 Views
2 replies
1 kudos

04-26-2022 8:47:18 AM

View Replies

Latest Reply

stefnhuy
New Contributor III

10-20-2023 1:43:41 AM

1 kudos

Hey Lukas,I can totally relate to the frustration of encountering those confounding errors when building custom connectors in Azure Logic Apps. The "Job ID value out of range" issue can be quite perplexing, but fear not, for there's a solution on the...

1 kudos

10-20-2023 1:43:41 AM

1 More Replies

by brickster_2018 • Databricks Employee

06-25-2021 11:43:48 AM

3112 Views
2 replies
0 kudos

Resolved! The driver is temporarily unavailable

My job fails with Driver is temporarily unavailable. Apparently, it's permanently unavailable, because the job is not pausing but failing.

Data Engineering

3112 Views
2 replies
0 kudos

06-25-2021 11:43:48 AM

View Replies

Latest Reply

Chalki
New Contributor III

08-14-2023 1:10:17 PM

0 kudos

I am facing the same issues . I am writing in batches using a simple for loop. I don't have any collect statements inside the loop. I am rewriting the partitions with partition overwrite dynamic mode in a huge wide delta table - several tb. The incr...

0 kudos

08-14-2023 1:10:17 PM

1 More Replies

by ravi28 • New Contributor III

06-07-2023 5:40:47 PM

18874 Views
7 replies
8 kudos

How to setup Job notifications using Microsoft Teams webhook ?

Couple of things I tried:1. I created a webhook connector in msft teams and copied it Notifications destinations via Admin page -> New destination -> from dropdown I selected Microsoft teams -> added webhook url and saved it.outcome: I don't get the ...

Data Engineering

18874 Views
7 replies
8 kudos

06-07-2023 5:40:47 PM

View Replies

Latest Reply

youssefmrini
Databricks Employee

08-08-2023 8:49:56 AM

8 kudos

You can set up job notifications for Databricks jobs using Microsoft Teams webhooks by following these steps:Set up a Microsoft Teams webhook:Go to the channel where you want to receive notifications in Microsoft Teams.Click on the "..." icon next to...

8 kudos

08-08-2023 8:49:56 AM

6 More Replies

by dave_hiltbrand • New Contributor II

06-22-2023 7:47:26 PM

5964 Views
3 replies
0 kudos

I have a job with multiple tasks running asynchronously and I don't think its leveraging all the nodes on the cluster based on runtime.

I have a job with multiple tasks running asynchronously and I don't think its leveraging all the nodes on the cluster based on runtime. I open the Spark UI for the cluster and checkout the executors and don't see any tasks for my worker nodes. How ca...

Data Engineering

5964 Views
3 replies
0 kudos

06-22-2023 7:47:26 PM

View Replies

Latest Reply

Anonymous
Not applicable

06-23-2023 12:18:56 AM

0 kudos

Hi @Dave Hiltbrand Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question. Thanks.

0 kudos

06-23-2023 12:18:56 AM

2 More Replies

by Data_Analytics1 • Contributor III

06-20-2023 3:33:52 AM

2582 Views
1 replies
0 kudos

Getting JsonParseException: Unexpected character ('<' (code 60))

I have a scheduled job that is executed using a notebook. Within one of the notebook cells, there is a check to determine if a table exists. However, even when the table does exist, it incorrectly identifies it as non-existent and proceeds to execut...

Data Engineering

2582 Views
1 replies
0 kudos

06-20-2023 3:33:52 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-20-2023 8:20:39 PM

0 kudos

Hi @Mahesh Chahare Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question. Thanks.

0 kudos

06-20-2023 8:20:39 PM

by Pras1 • New Contributor II

06-08-2023 10:34:03 AM

9412 Views
2 replies
2 kudos

Resolved! AZURE_QUOTA_EXCEEDED_EXCEPTION - even with more than vCPUs than Databricks recommends

I am running this Delta Live Tables PoC from databricks-industry-solutions/industry-solutions-blueprintshttps://github.com/databricks-industry-solutions/pos-dltI have Standard_DS4_v2 with 28GB and 8 cores x 2 workers - so a total of 16 cores. This is...

Data Engineering

9412 Views
2 replies
2 kudos

06-08-2023 10:34:03 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-15-2023 12:05:36 AM

2 kudos

Hi @Prasenjit Biswas We haven't heard from you since the last response from @Jose Gonzalez . Kindly share the information with us, and in return, we will provide you with the necessary solution.Thanks and Regards

2 kudos

06-15-2023 12:05:36 AM

1 More Replies