Data Engineering

Forum Posts

Sorted by:

by swzzzsw • New Contributor III

01-24-2022 11:17:24 AM

9382 Views
4 replies
9 kudos

"Run now with different parameters" - different parameters not recognized by jobs involving multiple tasks

I'm running a databricks job involving multiple tasks and would like to run the job with different set of task parameters. I can achieve that by edit each task and and change the parameter values. However, it gets very manual when I have a lot of tas...

Data Engineering

9382 Views
4 replies
9 kudos

01-24-2022 11:17:24 AM

View Replies

Latest Reply

VijayNakkonda
New Contributor II

3 weeks ago

9 kudos

Dear Team, For now, I found a solution. Disconnect the bundle source on Databricks, edit the parameters that you want to run. After execution, redeploy your code again from repository.

9 kudos

3 weeks ago

3 More Replies

by Tahseen0354 • Valued Contributor

07-01-2022 1:59:18 AM

5737 Views
5 replies
3 kudos

Resolved! Why I am not receiving any mail sent to the Azure AD Group mailbox when databricks job fails ?

I have created an Azure AD Group in "Microsoft 365" type with its own email address, which being added to the Notification of a Databricks Job (on failure). But there is no mail sent to the Azure Group mailbox when the job fails.I am able to send a d...

Data Engineering

5737 Views
5 replies
3 kudos

07-01-2022 1:59:18 AM

View Replies

Latest Reply

Lanky
New Contributor II

01-06-2025 5:43:42 AM

3 kudos

Hello Guys, I have setup ses receive email for databricks notification. When i send email message from google mail or yahoo mail, it gets to the SES email receiving rule. However, notification from databricks doesn't get to the same SES email receivi...

3 kudos

01-06-2025 5:43:42 AM

4 More Replies

by Mohit_m • Valued Contributor II

06-15-2022 5:23:13 AM

28666 Views
3 replies
4 kudos

Resolved! How to get the Job ID and Run ID and save into a database

We are having Databricks Job running with main class and JAR file in it. Our JAR file code base is in Scala. Now, when our job starts running, we need to log Job ID and Run ID into the database for future purpose. How can we achieve this?

Data Engineering

28666 Views
3 replies
4 kudos

06-15-2022 5:23:13 AM

View Replies

Latest Reply

Bruno-Castro
New Contributor II

05-08-2024 1:05:13 AM

4 kudos

That article is for members only, can we also specify here how to do it (for those that are not medium members?). Thanks!

4 kudos

05-08-2024 1:05:13 AM

2 More Replies

by sandeep91 • New Contributor III

03-01-2022 8:53:30 AM

8120 Views
5 replies
2 kudos

Resolved! Databricks Job: Package Name and EntryPoint parameters for the Python Wheel file

I have created Python wheel file with simple file structure and uploaded into cluster library and was able to run the packages in Notebook but, when I am trying to create a Job using python wheel and provide the package name and run the task it fails...

Data Engineering

8120 Views
5 replies
2 kudos

03-01-2022 8:53:30 AM

View Replies

Latest Reply

AndréSalvati
New Contributor III

03-06-2024 9:00:50 AM

2 kudos

There you can see a complete template project with (the new!!!) Databricks Asset Bundles tool and a python wheel task. Please, follow the instructions for deployment.https://github.com/andre-salvati/databricks-template

2 kudos

03-06-2024 9:00:50 AM

4 More Replies

by User16790091296 • Contributor II

06-24-2021 8:52:27 AM

3921 Views
1 replies
0 kudos

How to create a databricks job with parameters via CLI?

I'm creating a new job in databricks using the databricks-cli:databricks jobs create --json-file ./deploy/databricks/config/job.config.jsonWith the following json:{ "name": "Job Name", "new_cluster": { "spark_version": "4.1.x-scala2.1...

Data Engineering

3921 Views
1 replies
0 kudos

06-24-2021 8:52:27 AM

View Replies

Latest Reply

matthew_m
Databricks Employee

10-12-2023 9:37:24 AM

0 kudos

This is an old post but still relevant for future readers, so will answer how it is done. You need to add base_parameters flag in the notebook_task config, like the following. "notebook_task": { "notebook_path": "...", "base_parameters": { ...

0 kudos

10-12-2023 9:37:24 AM

by LidorAbo • New Contributor II

05-15-2023 9:13:15 AM

7260 Views
1 replies
1 kudos

bucket ownership of s3 bucket in databricks

We had a databricks job that has strange behavior,when we passing 'output_path' to function saveAsTextFile and not output_path variable the data saved to the following path: s3://dev-databricks-hy1-rootbucket/nvirginiaprod/3219117805926709/output_pa...

Data Engineering

7260 Views
1 replies
1 kudos

05-15-2023 9:13:15 AM

View Replies

Latest Reply

User16752239289
Databricks Employee

06-05-2023 5:27:40 PM

1 kudos

I suspect you provided a dbfs path to save the data hence the data saved under your workspace root bucket.For the workspace root bucket, databricks workspace will interact with databricks credential to make sure databricks has access to it and able t...

1 kudos

06-05-2023 5:27:40 PM

by Divya_Bhadauria • New Contributor II

05-01-2023 12:09:31 PM

6389 Views
2 replies
2 kudos

Running databricks job with different parameter automatically

I have a python script running as databricks job. Is there a way I can run this job with different set of parameters automatically or programmatically without using run with different parameter option available in UI ?

Data Engineering

6389 Views
2 replies
2 kudos

05-01-2023 12:09:31 PM

View Replies

Latest Reply

Anonymous
Not applicable

05-18-2023 10:51:28 PM

2 kudos

Hi @Divya Bhadauria We haven't heard from you since the last response from @Lakshay Goel , and I was checking back to see if her suggestions helped you.Or else, If you have any solution, please share it with the community, as it can be helpful to ...

2 kudos

05-18-2023 10:51:28 PM

1 More Replies

by source2sea • Contributor

05-12-2023 7:21:54 AM

5656 Views
4 replies
2 kudos

Resolved! how to make databricks job to fail when the application has already given "exit code 1"?

object OurMainObject extends LazyLogging with IOApp { def run(args: List[String]): IO[ExitCode] = { logger.info("Started the application") val conf = defaultOverrides.withFallback(defaultApplication).withFallback(defaultReference) val...

Data Engineering

5656 Views
4 replies
2 kudos

05-12-2023 7:21:54 AM

View Replies

Latest Reply

source2sea
Contributor

05-16-2023 7:59:39 AM

2 kudos

my workaround now is to make the code like below, so the databricks jobs becomes failure. case Left(ex) => { IO(logger.error("Glue failure", ex)).map(_ => ExitCode.Error) IO.raiseError(ex) }

2 kudos

05-16-2023 7:59:39 AM

3 More Replies

by psps • New Contributor III

05-04-2023 2:11:43 AM

5143 Views
3 replies
5 kudos

Databricks Job run logs only shows prints/logs from driver and not executors

Hi,In Databricks Job run output, only logs from driver are displayed. We have a function parallelized to run on executor nodes. The logs/prints from that function are not displayed in job run output. Is there a way to configure and show those logs i...

Data Engineering

5143 Views
3 replies
5 kudos

05-04-2023 2:11:43 AM

View Replies

Latest Reply

psps
New Contributor III

05-09-2023 9:29:48 AM

5 kudos

Thanks @Debayan Mukherjee . This is to enable executor logging. However, the executor logs do not appear in Databricks Job run output. Only driver logs are displayed.

5 kudos

05-09-2023 9:29:48 AM

2 More Replies

by Divya_Bhadauria • New Contributor II

04-26-2023 2:25:15 PM

10164 Views
3 replies
2 kudos

Unable to run python script from git repo in Databricks job

I'm getting cannot read python file on running this job which is configured to run a python script from git repo. Run result unavailable: run failed with error message Cannot read the python file /Repos/.internal/7c39d645692_commits/ff669d089cd8f93e9...

Data Engineering

10164 Views
3 replies
2 kudos

04-26-2023 2:25:15 PM

View Replies

Latest Reply

Divya_Bhadauria
New Contributor II

05-01-2023 10:24:04 AM

2 kudos

Hi Vidula,Yes, the above solution worked out for me. Tried debugging using all of the above steps and it turned out the path I was using in the job config was incorrect.

2 kudos

05-01-2023 10:24:04 AM

2 More Replies

by MarsSu • New Contributor II

04-20-2023 7:36:38 PM

9410 Views
5 replies
1 kudos

Resolved! Databricks job about spark structured streaming zero downtime deployment in terraform.

I would like to ask how to implement zero downtime deployment of spark structured streaming in databricks job compute with terraform. Because we will upgrade spark application code version. But currently we found every deployment will cancel original...

Data Engineering

9410 Views
5 replies
1 kudos

04-20-2023 7:36:38 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-25-2023 10:22:00 PM

1 kudos

@Mars Su :Yes, you can implement zero downtime deployment of Spark Structured Streaming in Databricks job compute using Terraform. One way to achieve this is by using Databricks' "job clusters" feature, which allows you to create a cluster specifica...

1 kudos

04-25-2023 10:22:00 PM

4 More Replies

by Michael_Papadop • New Contributor II

04-07-2023 6:35:05 AM

11553 Views
3 replies
0 kudos

How can I set the status of a databricks job as skipped via python?

I have a basic 2 task job. The 1st notebook (task) checks whether the source file has changes and if so then refreshes a corresponding materialized view. In case we have no changes then I use dbutils.jobs.taskValues.set(key = "skip_job", value = 1) &...

Data Engineering

11553 Views
3 replies
0 kudos

04-07-2023 6:35:05 AM

View Replies

Latest Reply

karthik_p
Esteemed Contributor

04-07-2023 1:32:12 PM

0 kudos

@Michael Papadopoulos usually that should not be the case i think, as for task level we have 3 level notifications ( success, failure,start), where as whole job level skip option is available to discard notification . will see if some one from commu...

0 kudos

04-07-2023 1:32:12 PM

2 More Replies

by youssefmrini • Databricks Employee

02-28-2023 3:17:57 AM

1283 Views
1 replies
1 kudos

Resolved! Does Databricks workflows support continuous jobs ?

Data Engineering

1283 Views
1 replies
1 kudos

02-28-2023 3:17:57 AM

View Replies

Latest Reply

youssefmrini
Databricks Employee

02-28-2023 3:18:02 AM

1 kudos

You can ensure there is always an active run of your Databricks job with the new continuous trigger type. https://docs.databricks.com/workflows/jobs/jobs.html#continuous-jobs

1 kudos

02-28-2023 3:18:02 AM

by vinaykumar • New Contributor III

02-13-2023 10:06:47 PM

4877 Views
3 replies
1 kudos

Resolved! Run databricks job instantly without waiting job cluster get active

when we run databricks job it take some time to get job cluster active . I created pool also and attached with job cluster but still it take time to attached the cluster and job cluster get active to start the job run. is there any way - we can run d...

Data Engineering

4877 Views
3 replies
1 kudos

02-13-2023 10:06:47 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

02-14-2023 5:06:49 AM

1 kudos

If you want instant processing, you will have to have a cluster running all the time.As mentioned above, Databricks is testing serverless compute for data engineering workloads (comparable to serverless SQL). This fires up a cluster in a few seconds...

1 kudos

02-14-2023 5:06:49 AM

2 More Replies

by joakon • New Contributor III

01-25-2023 12:25:39 PM

3003 Views
4 replies
4 kudos

Resolved! Databricks - Workflow- Jobs- Script to automate

Hi - I have created a Databricks job - under Workflow - its running fine without any issues . I would like to promote this job to other workspaces using a script.Is there a way to script the job definition and deploy it across multiple workspaces .I ...

Data Engineering

3003 Views
4 replies
4 kudos

01-25-2023 12:25:39 PM

View Replies

Latest Reply

joakon
New Contributor III

01-27-2023 10:04:57 AM

4 kudos

thank you @Landan George

4 kudos

01-27-2023 10:04:57 AM

3 More Replies