03-03-2022 01:16 PM
I am trying to create a job which has 2 tasks as follows:
While this is relatively easy to do using azure pipelines (current implementation) I am not able to do this from within DataBricks Jobs. Basically, I don't know how to do the following:
Please let me know if anything is not clear.
Edit:
I tried the date_tests=dbutils.jobs.taskValues.set() and get() to pass the values. However, it seems to be disabled for my workspace. I get the error "com.databricks.common.client.DatabricksServiceHttpClientException: FEATURE_DISABLED: The task values API is disabled for this workspace". And, I do not know how to enable it. I do have access to the admin console but I have no clue where to look for this feature to enable. Please let me know if you have some idea where to find it.
Thanks,
03-04-2022 04:24 AM
@Rahul Bahadur there a few ways to pass values between tasks in a job:
So it appears you are trying to use No. 1. Please email me at bilal dot aslam at databricks dot com and I will get you enrolled in the preview.
03-03-2022 10:56 PM
I think that option was disabled when the new job functionality was introduced.
There is only one jobs setting in the admin panel: "Task orchestration in Jobs"
To pass parameters into a job you can use the jobs API
(https://docs.microsoft.com/en-us/azure/databricks/jobs) or the CLI.
Or use notebook workflows, where you can run notebooks in parallel.
For the latter you are not using jobs, but jobs are just a way of scheduling notebooks (or jars).
This will also be the most transparent imo.
03-04-2022 02:48 AM
"task which accepts a date and an integer from the user " how user enter this dates?, through web on external web server? or inside databricks through widget? or some other way?
03-04-2022 08:27 AM
The input is through the UI or can also be through a .txt file. See attached pic for details
03-04-2022 04:24 AM
@Rahul Bahadur there a few ways to pass values between tasks in a job:
So it appears you are trying to use No. 1. Please email me at bilal dot aslam at databricks dot com and I will get you enrolled in the preview.
03-04-2022 06:13 AM
Thanks @Bilal Aslam . This is exactly what I was looking for. Also, is there a way to create parallel loops for running a notebook concurrently . For example, running a notebook 5 times with 5 different values from a list which was entered as a parameter? I know I can put a for loop within a notebook, but, that would mean running the code serially for the values in the list. I was wondering if it is possible to run the notebooks concurrently - Is there a way to architect a parallel for loop from within the Jobs API or the Jobs UI?. Thanks
10-22-2022 01:14 AM
Just a note that this feature, Task Values, has been generally available for a while.
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group