Topics with Label: Jobs & Workflows

Forum Posts

Sorted by:

by User16826990884 • New Contributor III

06-25-2021 12:01:34 PM

2034 Views
3 replies
0 kudos

Version control jobs

How do engineering teams out there version control their jobs? If there is a production issue, can I revert to an older version of the job?

Data Engineering

2034 Views
3 replies
0 kudos

06-25-2021 12:01:34 PM

View Replies

Latest Reply

Rom
New Contributor III

10-29-2023 9:49:42 AM

0 kudos

You can use version controlled source code for you databricks job and each time you need to rollback to older version of your job you need just to move to older version code. For version controlled source code you have multiple choises:- Use a noteb...

0 kudos

10-29-2023 9:49:42 AM

2 More Replies

by Serhii • Contributor

08-18-2022 1:40:05 AM

953 Views
3 replies
1 kudos

Could not launch jobs due to node_type_id (instance) unavailability

I am running hourly job on a cluster using p3.2xlarge GPU instance, but sometimes cluster couldn't start due to instance unavailability. I wander is there is any fallback mechanism to, for example, try a different instance type if one is not availabl...

Data Engineering

953 Views
3 replies
1 kudos

08-18-2022 1:40:05 AM

View Replies

Latest Reply

abagshaw
New Contributor III

06-27-2023 11:57:30 AM

1 kudos

(AWS only) For anyone experiencing capacity related cluster launch failures on non-GPU instance types, AWS Fleet instance types are now GA and available for clusters and instance pools. They help improve chance of successful cluster launch by allowi...

1 kudos

06-27-2023 11:57:30 AM

2 More Replies

by thib • New Contributor III

06-14-2022 10:52:04 AM

2907 Views
3 replies
2 kudos

Can we use multiple git repos for a job running multiple tasks?

I have a job running multiple tasks :Task 1 runs a machine learning pipeline from git repo 1Task 2 runs an ETL pipeline from git repo 1Task 2 is actually a generic pipeline and should not be checked in repo 1, and will be made available in another re...

Data Engineering

2907 Views
3 replies
2 kudos

06-14-2022 10:52:04 AM

View Replies

Latest Reply

trijit
New Contributor II

05-11-2023 1:15:35 AM

2 kudos

The way to go about this would be to create Databricks repos in the workspace and then use that in the task formation. This way we can refer multiple repos in different tasks.

2 kudos

05-11-2023 1:15:35 AM

2 More Replies

by RJB • New Contributor II

03-03-2022 1:16:27 PM

7157 Views
6 replies
0 kudos

Resolved! How to pass outputs from a python task to a notebook task

I am trying to create a job which has 2 tasks as follows:A python task which accepts a date and an integer from the user and outputs a list of dates (say, a list of 5 dates in string format).A notebook which runs once for each of the dates from the d...

Data Engineering

7157 Views
6 replies
0 kudos

03-03-2022 1:16:27 PM

View Replies

Latest Reply

BilalAslamDbrx
Honored Contributor II

10-22-2022 1:14:35 AM

0 kudos

Just a note that this feature, Task Values, has been generally available for a while.

0 kudos

10-22-2022 1:14:35 AM

5 More Replies

by swetha • New Contributor III

09-08-2022 5:05:05 PM

1394 Views
3 replies
4 kudos

Resolved! Retrieving the job-id's of a notebook running inside tasks

I have created a job, Inside a job I have created tasks which are independent, I have used the concept of concurrent futures to exhibit parallelism and in each task there are couple of notebooks running(which are independent) Each notebook running ha...

Data Engineering

1394 Views
3 replies
4 kudos

09-08-2022 5:05:05 PM

View Replies

Latest Reply

Anonymous
Not applicable

09-23-2022 11:20:58 PM

4 kudos

Hi @swetha kadiyala Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Th...

4 kudos

09-23-2022 11:20:58 PM

2 More Replies

by sawya • New Contributor II

08-19-2022 12:56:21 AM

1250 Views
3 replies
0 kudos

Migrate workspaces to another AWS account

Hi everyone,I have a Databricks workspace in an AWS account that I have to migrate to a new AWS accountDo you know how I can do it ? Or it's better to recreate a new one and move all the workbooks and if I choose to create one new how can you export ...

Data Engineering

1250 Views
3 replies
0 kudos

08-19-2022 12:56:21 AM

View Replies

Latest Reply

Abishek
Valued Contributor

08-25-2022 2:00:24 AM

0 kudos

@AMADOU THIOUNE Can you check the below link to export the run jobs? https://docs.databricks.com/jobs.html#export-job-runs. Try to reuse the same job_id with the /update and /reset endpoints, it should allow you much better access to previous run re...

0 kudos

08-25-2022 2:00:24 AM

2 More Replies

by Sunny • New Contributor III

06-17-2022 5:04:18 AM

3841 Views
6 replies
1 kudos

Using Thread.sleep in Scala

We need to hit REST web service every 5 mins until success message is received. The Scala object is inside a Jar file and gets invoked by Databricks task within a workflow.Thread.sleep(5000) is working fine but not sure if it is safe practice or is t...

Data Engineering

3841 Views
6 replies
1 kudos

06-17-2022 5:04:18 AM

View Replies

Latest Reply

Vartika
Moderator

08-24-2022 9:18:18 AM

1 kudos

Hey there @Sundeep P Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.C...

1 kudos

08-24-2022 9:18:18 AM

5 More Replies

by Serhii • Contributor

08-18-2022 9:23:59 AM

1926 Views
1 replies
1 kudos

Resolved! Behaviour of cluster launches in multi-task jobs

We are adapting the multi-tasks workflow example from dbx documentation for our pipelines https://dbx.readthedocs.io/en/latest/examples/python_multitask_deployment_example.html. As a part of configuration we specify cluster configuration and provide ...

Data Engineering

1926 Views
1 replies
1 kudos

08-18-2022 9:23:59 AM

View Replies

Latest Reply

User16873043099
Contributor

08-18-2022 10:22:33 AM

1 kudos

Tasks within the same multi task job can reuse the clusters. A shared job cluster allows multiple tasks in the same job to use the cluster. The cluster is created and started when the first task using the cluster starts and terminates after the last ...

1 kudos

08-18-2022 10:22:33 AM

by MadelynM • New Contributor III

07-05-2022 10:32:35 AM

3117 Views
3 replies
3 kudos

How do I move existing workflows and jobs running on an all-purpose cluster to a shared jobs cluster?

A Databricks cluster is a set of computation resources that performs the heavy lifting of all of the data workloads you run in Databricks. Databricks provides a number of options when you create and configure clusters to help you get the best perform...

Left navigation bar selecting Data Science & Engineering

Data Engineering

3117 Views
3 replies
3 kudos

07-05-2022 10:32:35 AM

View Replies

Latest Reply

Kaniz
Community Manager

07-07-2022 11:54:48 PM

3 kudos

Hi @Madelyn Mullen , Thank you for sharing such an excellent and informative post. We hope to see these very often.

3 kudos

07-07-2022 11:54:48 PM

2 More Replies

by Sunny • New Contributor III

06-08-2022 10:55:55 AM

4653 Views
8 replies
4 kudos

Resolved! Retrieve job id and run id from scala

I need to retrieve job id and run id of the job from a jar file in Scala.When I try to compile below code in IntelliJ, below error is shown.import com.databricks.dbutils_v1.DBUtilsHolder.dbutils object MainSNL { @throws(classOf[Exception]) de...

Data Engineering

4653 Views
8 replies
4 kudos

06-08-2022 10:55:55 AM

View Replies

Latest Reply

Mohit_m
Valued Contributor II

07-05-2022 3:08:22 AM

4 kudos

Maybe its worth going through the Task Parameter variables section of the below dochttps://docs.databricks.com/data-engineering/jobs/jobs.html#task-parameter-variables

4 kudos

07-05-2022 3:08:22 AM

7 More Replies

by Mohit_m • Valued Contributor II

07-05-2022 3:03:27 AM

2934 Views
1 replies
2 kudos

Resolved! Databricks jobs create API throws unexpected error

Databricks jobs create API throws unexpected errorError response :{"error_code": "INVALID_PARAMETER_VALUE","message": "Cluster validation error: Missing required field: settings.cluster_spec.new_cluster.size"}Any idea on this?

Data Engineering

2934 Views
1 replies
2 kudos

07-05-2022 3:03:27 AM

View Replies

Latest Reply

Mohit_m
Valued Contributor II

07-05-2022 3:04:20 AM

2 kudos

Could you please specify num_workers in the json body and try API again.Also, another recommendation can be configuring what you want in UI, and then pressing “JSON” button that should show corresponding JSON which you can use for API

2 kudos

07-05-2022 3:04:20 AM

by Maverick1 • Valued Contributor II

06-16-2022 8:16:09 AM

3410 Views
4 replies
8 kudos

Resolved! How to get the list of all jobs available for a particular user?

As of now, if I try to list the jobs via "list job" API then there is a limit of 25 jobs only.Is there a way to list all the available/visible jobs to a user?

Data Engineering

3410 Views
4 replies
8 kudos

06-16-2022 8:16:09 AM

View Replies

Latest Reply

Kaniz
Community Manager

06-23-2022 7:27:09 AM

8 kudos

Hi @Saurabh Verma, We haven’t heard from you on the last response from @Arvind Ravish, and I was checking back to see if his suggestions helped you. Or else, If you have any solution, please share it with the community as it can be helpful to other...

8 kudos

06-23-2022 7:27:09 AM

3 More Replies

by Robbie • New Contributor III

06-09-2022 9:38:58 AM

1634 Views
4 replies
5 kudos

Resolved! Why can't I create new jobs? ("You are not entitled to run this type of task...")

This morning I encountered an issue when trying to create a new job using the Workflows UI (in browser). Never had this issue before.The error message that appears is:"You are not entitled to run this type of task, please contact your Databricks admi...

Data Engineering

1634 Views
4 replies
5 kudos

06-09-2022 9:38:58 AM

View Replies

Latest Reply

Kaniz
Community Manager

06-13-2022 8:32:40 PM

5 kudos

Hi @Robbie Capps, I'm glad we could help you. Thank you for marking the best answer for us.

5 kudos

06-13-2022 8:32:40 PM

3 More Replies

by Sunny • New Contributor III

05-26-2022 7:57:05 PM

591 Views
1 replies
0 kudos

Update task status from external application

I am having a workflow with a task that is dependant on external application execution (not residing in Databricks). After external application finishes, how to update the status of a task to complete. Currently, Jobs API doesn't support status updat...

Data Engineering

591 Views
1 replies
0 kudos

05-26-2022 7:57:05 PM

View Replies

Latest Reply

Sunny
New Contributor III

05-30-2022 10:52:08 AM

0 kudos

Any inputs on this one please

0 kudos

05-30-2022 10:52:08 AM

by User16783855534 • New Contributor III

06-07-2021 10:47:16 AM

5308 Views
6 replies
5 kudos

Resolved! How can I get the json spec of my Databricks Job?

Data Engineering

5308 Views
6 replies
5 kudos

06-07-2021 10:47:16 AM

View Replies

Latest Reply

Kaniz
Community Manager

05-18-2022 2:04:14 PM

5 kudos

Hi @Neil Patel , Just a friendly follow-up. Do you still need help, or do the above responses help you find the solution? Please let us know.

5 kudos

05-18-2022 2:04:14 PM

5 More Replies