03-03-2022 01:16 PM
I am trying to create a job which has 2 tasks as follows:
While this is relatively easy to do using azure pipelines (current implementation) I am not able to do this from within DataBricks Jobs. Basically, I don't know how to do the following:
Please let me know if anything is not clear.
Edit:
I tried the date_tests=dbutils.jobs.taskValues.set() and get() to pass the values. However, it seems to be disabled for my workspace. I get the error "com.databricks.common.client.DatabricksServiceHttpClientException: FEATURE_DISABLED: The task values API is disabled for this workspace". And, I do not know how to enable it. I do have access to the admin console but I have no clue where to look for this feature to enable. Please let me know if you have some idea where to find it.
Thanks,
03-04-2022 04:24 AM
@Rahul Bahadur there a few ways to pass values between tasks in a job:
So it appears you are trying to use No. 1. Please email me at bilal dot aslam at databricks dot com and I will get you enrolled in the preview.
03-03-2022 10:56 PM
I think that option was disabled when the new job functionality was introduced.
There is only one jobs setting in the admin panel: "Task orchestration in Jobs"
To pass parameters into a job you can use the jobs API
(https://docs.microsoft.com/en-us/azure/databricks/jobs) or the CLI.
Or use notebook workflows, where you can run notebooks in parallel.
For the latter you are not using jobs, but jobs are just a way of scheduling notebooks (or jars).
This will also be the most transparent imo.
03-04-2022 02:48 AM
"task which accepts a date and an integer from the user " how user enter this dates?, through web on external web server? or inside databricks through widget? or some other way?
03-04-2022 08:27 AM
The input is through the UI or can also be through a .txt file. See attached pic for details
03-04-2022 04:24 AM
@Rahul Bahadur there a few ways to pass values between tasks in a job:
So it appears you are trying to use No. 1. Please email me at bilal dot aslam at databricks dot com and I will get you enrolled in the preview.
03-04-2022 06:13 AM
Thanks @Bilal Aslam . This is exactly what I was looking for. Also, is there a way to create parallel loops for running a notebook concurrently . For example, running a notebook 5 times with 5 different values from a list which was entered as a parameter? I know I can put a for loop within a notebook, but, that would mean running the code serially for the values in the list. I was wondering if it is possible to run the notebooks concurrently - Is there a way to architect a parallel for loop from within the Jobs API or the Jobs UI?. Thanks
10-22-2022 01:14 AM
Just a note that this feature, Task Values, has been generally available for a while.
Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections.
Click here to register and join today!
Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.