<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: API for Restarting Individual Failed Tasks within a Job? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115662#M45148</link>
    <description>&lt;P&gt;Hey &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/135091"&gt;@minhhung0507&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Keep me posted on how this works for you.&lt;/P&gt;&lt;P&gt;Looking forward&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers!&lt;/P&gt;</description>
    <pubDate>Wed, 16 Apr 2025 14:40:26 GMT</pubDate>
    <dc:creator>RiyazAliM</dc:creator>
    <dc:date>2025-04-16T14:40:26Z</dc:date>
    <item>
      <title>API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115596#M45114</link>
      <description>&lt;P&gt;Hi everyone,&lt;/P&gt;&lt;P&gt;I'm exploring ways to streamline my workflow in Databricks and could really use some expert advice. In my current setup, I have a job (named job_silver) with multiple tasks (e.g., task 1, task 2, task 3). When one of these tasks fails—say task 2—I want the ability to restart just that specific task without rerunning the entire job.&lt;/P&gt;&lt;P&gt;I did some research and came across the “Repair and Rerun” feature (&lt;A href="https://www.databricks.com/blog/2022/05/06/save-time-and-money-on-data-and-ml-workflows-with-repair-and-rerun.html" target="_blank" rel="noopener"&gt;Databricks Blog&lt;/A&gt;). While that's a great tool for saving time and money in data and ML workflows, my use case requires more flexibility. Specifically, I'm looking for an API-based solution that I can integrate into my code, allowing dynamic control over which task to restart based on custom logic.&lt;/P&gt;&lt;P&gt;Some points I’m particularly interested in:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;Is there an existing API (or a combination of APIs) that allows for restarting individual tasks within a job?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Could this be done via the REST API, and if so, what endpoints or methods should I look at?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Are there any workarounds or best practices for implementing this functionality if a dedicated API is not available?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;How might this approach scale in environments with a large number of jobs and complex dependency graphs?&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;I’d love to hear about your experiences and any code snippets or documentation pointers that could help me get started. Thanks in advance for your insights!&lt;/P&gt;</description>
      <pubDate>Wed, 16 Apr 2025 04:13:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115596#M45114</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-04-16T04:13:27Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115599#M45115</link>
      <description>&lt;P&gt;use retry job or task option it is available bottom right side of the job or task plane&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 16 Apr 2025 04:27:26 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115599#M45115</guid>
      <dc:creator>Aviral-Bhardwaj</dc:creator>
      <dc:date>2025-04-16T04:27:26Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115603#M45118</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/53460"&gt;@Aviral-Bhardwaj&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for your suggestion. But&amp;nbsp;I really need is a code/API-driven approach rather than having to click around in the UI to manage a specific task in a job retry.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Apr 2025 04:39:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115603#M45118</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-04-16T04:39:40Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115604#M45119</link>
      <description>&lt;P&gt;&lt;A href="https://docs.databricks.com/api/workspace/jobs/reset" target="_blank"&gt;https://docs.databricks.com/api/workspace/jobs/reset&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 16 Apr 2025 04:41:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115604#M45119</guid>
      <dc:creator>Aviral-Bhardwaj</dc:creator>
      <dc:date>2025-04-16T04:41:41Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115606#M45120</link>
      <description>&lt;P&gt;Thank you so much for sharing that information. After reviewing the /api/2.2/jobs/reset API documentation, it looks like this endpoint is designed for updating the entire job configuration for future runs and doesn’t support restarting only the failed tasks of an active job run.&lt;/P&gt;&lt;P&gt;I appreciate your help, and if you have any other suggestions or ideas on achieving that level of granularity, I'd be happy to hear them.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Apr 2025 04:53:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115606#M45120</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-04-16T04:53:28Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115614#M45127</link>
      <description>&lt;P&gt;hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/135091"&gt;@minhhung0507&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please check if the below code using rest api is able to cater to your needs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;list_runs_url &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;DATABRICKS_INSTANCE&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;/api/2.1/jobs/runs/list"&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;params &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; {&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"job_id"&lt;/SPAN&gt;&lt;SPAN&gt;: JOB_ID,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"limit"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;1&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"active_only"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;False&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;response &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; requests.&lt;/SPAN&gt;&lt;SPAN&gt;get&lt;/SPAN&gt;&lt;SPAN&gt;(list_runs_url, &lt;/SPAN&gt;&lt;SPAN&gt;headers&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;headers, &lt;/SPAN&gt;&lt;SPAN&gt;params&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;params)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;latest_run_id &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; response.&lt;/SPAN&gt;&lt;SPAN&gt;json&lt;/SPAN&gt;&lt;SPAN&gt;()[&lt;/SPAN&gt;&lt;SPAN&gt;"runs"&lt;/SPAN&gt;&lt;SPAN&gt;][&lt;/SPAN&gt;&lt;SPAN&gt;0&lt;/SPAN&gt;&lt;SPAN&gt;][&lt;/SPAN&gt;&lt;SPAN&gt;"run_id"&lt;/SPAN&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;run_details_url &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;DATABRICKS_INSTANCE&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;/api/2.1/jobs/runs/get"&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;response &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; requests.&lt;/SPAN&gt;&lt;SPAN&gt;get&lt;/SPAN&gt;&lt;SPAN&gt;(run_details_url, &lt;/SPAN&gt;&lt;SPAN&gt;headers&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;headers, &lt;/SPAN&gt;&lt;SPAN&gt;params&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;"run_id"&lt;/SPAN&gt;&lt;SPAN&gt;: latest_run_id})&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;tasks &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; response.&lt;/SPAN&gt;&lt;SPAN&gt;json&lt;/SPAN&gt;&lt;SPAN&gt;().&lt;/SPAN&gt;&lt;SPAN&gt;get&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;"tasks"&lt;/SPAN&gt;&lt;SPAN&gt;, [])&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;failed_tasks &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; [task[&lt;/SPAN&gt;&lt;SPAN&gt;"task_key"&lt;/SPAN&gt;&lt;SPAN&gt;] &lt;/SPAN&gt;&lt;SPAN&gt;for&lt;/SPAN&gt;&lt;SPAN&gt; task &lt;/SPAN&gt;&lt;SPAN&gt;in&lt;/SPAN&gt;&lt;SPAN&gt; tasks &lt;/SPAN&gt;&lt;SPAN&gt;if&lt;/SPAN&gt;&lt;SPAN&gt; task[&lt;/SPAN&gt;&lt;SPAN&gt;"state"&lt;/SPAN&gt;&lt;SPAN&gt;][&lt;/SPAN&gt;&lt;SPAN&gt;"result_state"&lt;/SPAN&gt;&lt;SPAN&gt;] &lt;/SPAN&gt;&lt;SPAN&gt;==&lt;/SPAN&gt; &lt;SPAN&gt;"FAILED"&lt;/SPAN&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;&lt;SPAN&gt;(failed_tasks)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;if&lt;/SPAN&gt; &lt;SPAN&gt;not&lt;/SPAN&gt;&lt;SPAN&gt; failed_tasks:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;"No failed tasks found."&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;else&lt;/SPAN&gt;&lt;SPAN&gt;:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"Retrying failed tasks: &lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;failed_tasks&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;# Submit new run for each failed task (assuming same notebook and cluster setup)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;for&lt;/SPAN&gt;&lt;SPAN&gt; task &lt;/SPAN&gt;&lt;SPAN&gt;in&lt;/SPAN&gt;&lt;SPAN&gt; tasks:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;if&lt;/SPAN&gt;&lt;SPAN&gt; task[&lt;/SPAN&gt;&lt;SPAN&gt;"task_key"&lt;/SPAN&gt;&lt;SPAN&gt;] &lt;/SPAN&gt;&lt;SPAN&gt;in&lt;/SPAN&gt;&lt;SPAN&gt; failed_tasks:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;submit_url &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;DATABRICKS_INSTANCE&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;/api/2.1/jobs/runs/submit"&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;payload &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; {&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"run_name"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"Rerun failed task: &lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;task[&lt;/SPAN&gt;&lt;SPAN&gt;'task_key'&lt;/SPAN&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"tasks"&lt;/SPAN&gt;&lt;SPAN&gt;: [&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"task_key"&lt;/SPAN&gt;&lt;SPAN&gt;: task[&lt;/SPAN&gt;&lt;SPAN&gt;"task_key"&lt;/SPAN&gt;&lt;SPAN&gt;],&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"notebook_task"&lt;/SPAN&gt;&lt;SPAN&gt;: task[&lt;/SPAN&gt;&lt;SPAN&gt;"notebook_task"&lt;/SPAN&gt;&lt;SPAN&gt;],&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"existing_cluster_id"&lt;/SPAN&gt;&lt;SPAN&gt;: task[&lt;/SPAN&gt;&lt;SPAN&gt;"existing_cluster_id"&lt;/SPAN&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;submit_resp &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; requests.&lt;/SPAN&gt;&lt;SPAN&gt;post&lt;/SPAN&gt;&lt;SPAN&gt;(submit_url, &lt;/SPAN&gt;&lt;SPAN&gt;headers&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;headers, &lt;/SPAN&gt;&lt;SPAN&gt;data&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;json.&lt;/SPAN&gt;&lt;SPAN&gt;dumps&lt;/SPAN&gt;&lt;SPAN&gt;(payload))&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"Submitted rerun for task &lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;task[&lt;/SPAN&gt;&lt;SPAN&gt;'task_key'&lt;/SPAN&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;submit_resp.status_code&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt; - &lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;submit_resp.text&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 16 Apr 2025 08:20:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115614#M45127</guid>
      <dc:creator>RiyazAliM</dc:creator>
      <dc:date>2025-04-16T08:20:58Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115662#M45148</link>
      <description>&lt;P&gt;Hey &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/135091"&gt;@minhhung0507&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Keep me posted on how this works for you.&lt;/P&gt;&lt;P&gt;Looking forward&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers!&lt;/P&gt;</description>
      <pubDate>Wed, 16 Apr 2025 14:40:26 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115662#M45148</guid>
      <dc:creator>RiyazAliM</dc:creator>
      <dc:date>2025-04-16T14:40:26Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115701#M45160</link>
      <description>&lt;P&gt;Sure, we will try again based on the solution you provided and will feedback to you whether it works or not. Thank you very much.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Apr 2025 02:44:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115701#M45160</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-04-17T02:44:50Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115707#M45161</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/15469"&gt;@RiyazAliM&lt;/a&gt;&amp;nbsp;,&amp;nbsp;after applying your code to the pipeline it seems that it can't work because of the following error:&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;400 --- {"error_code":"INVALID_PARAMETER_VALUE","message":"One of job_cluster_key, new_cluster, or existing_cluster_id must be specified. Serverless compute for workflows is not enabled in the workspace.","details":[{"@type":"type.googleapis.com/google.rpc.RequestInfo","request_id":"74ef6feb-9cff-4d3d-8cb6-3f45f3525768","serving_data":""}]}&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;Could you plese help us investigate this error?&lt;/P&gt;</description>
      <pubDate>Thu, 17 Apr 2025 03:45:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115707#M45161</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-04-17T03:45:05Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115711#M45163</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/135091"&gt;@minhhung0507&lt;/a&gt;&amp;nbsp;- quick question - what is the cluster type you're using to run your workflow?&lt;/P&gt;&lt;P&gt;I'm using a shared, interactive cluster, so I'm passing the parameter&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;{'existing_cluster_id' : task['existing_cluster_id']}&lt;/LI-CODE&gt;&lt;P&gt;in the payload. This parameter will change if you're using a job_cluster, I'd have to check what it's gonna be for serveless though.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope it helps.&lt;/P&gt;&lt;P&gt;Cheers!&lt;/P&gt;</description>
      <pubDate>Thu, 17 Apr 2025 05:43:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115711#M45163</guid>
      <dc:creator>RiyazAliM</dc:creator>
      <dc:date>2025-04-17T05:43:07Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115730#M45170</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/15469"&gt;@RiyazAliM&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Thanks for showing how to pass the existing_cluster_id in the payload. One thing I’m worried about from a production standpoint is what happens if the ID we supply doesn’t correspond to an active cluster—will Databricks automatically spin up a new cluster under the hood, or will the run fail immediately?&lt;/P&gt;&lt;P&gt;Auto‑provisioning clusters without explicit control could lead to unwanted resource sprawl and cost overruns. Could you please clarify the behavior of the Submit Run API when existing_cluster_id can’t be resolved, and recommend best practices to ensure we only target known, managed clusters?&lt;/P&gt;</description>
      <pubDate>Thu, 17 Apr 2025 09:25:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115730#M45170</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-04-17T09:25:55Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115746#M45174</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/15469"&gt;@RiyazAliM&lt;/a&gt;&amp;nbsp;FYI&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/15469"&gt;@RiyazAliM&lt;/a&gt;&amp;nbsp; FYI:&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;400 --- {"error_code":"INVALID_PARAMETER_VALUE","message":"The cluster 0417-102632-cuhsyfm0 is not an all-purpose cluster. existing_cluster_id only supports all-purpose cluster IDs.","details":[{"@type":"type.googleapis.com/google.rpc.RequestInfo","request_id":"3379c0ea-0fc0-405a-808c-7f06759aad83","serving_data":""}]}&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 17 Apr 2025 11:09:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/115746#M45174</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-04-17T11:09:44Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/116042#M45231</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/135091"&gt;@minhhung0507&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The trigger won't be successful if the cluster-id is not provided. I tried to replicate it and found that it throws an error as below:&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;{"error_code":"INVALID_PARAMETER_VALUE","message":"Missing required fields: settings.job_clusters.job_cluster_key, settings.job_clusters.new_cluster","details":[{"@type":"type.googleapis.com/google.rpc.RequestInfo","request_id":"b9631652-3101-4b69-95a6-45be477c85a5","serving_data":""}]}&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Apr 2025 08:07:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/116042#M45231</guid>
      <dc:creator>RiyazAliM</dc:creator>
      <dc:date>2025-04-21T08:07:36Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/116044#M45232</link>
      <description>&lt;P&gt;The error depicts that you've passed job_cluster id with the key as existing_cluster_id which is used for interactive clusters. Please pass job_cluster_details if you're using the key `&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"job_cluster_key"`. Thank you&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 21 Apr 2025 08:10:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/116044#M45232</guid>
      <dc:creator>RiyazAliM</dc:creator>
      <dc:date>2025-04-21T08:10:53Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/116048#M45233</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/15469"&gt;@RiyazAliM&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;I have two quick questions I’d appreciate your help with:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;Why do we need to use an interactive (all‑purpose) cluster when submitting a job, rather than a job cluster?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;I can’t find any reference to a parameter called job_cluster_details in the official docs—could you point me to where it’s documented or share a link?&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Thanks in advance for any pointers!&lt;/P&gt;</description>
      <pubDate>Mon, 21 Apr 2025 08:50:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/116048#M45233</guid>
      <dc:creator>minhhung0507</dc:creator>
      <dc:date>2025-04-21T08:50:27Z</dc:date>
    </item>
    <item>
      <title>Re: API for Restarting Individual Failed Tasks within a Job?</title>
      <link>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/116458#M45321</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/135091"&gt;@minhhung0507&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;Great questions, let me answer as per my understanding:&lt;/P&gt;&lt;P&gt;&lt;FONT color="#FF0000"&gt;&lt;EM&gt;Q1:&amp;nbsp;Why do we need to use an interactive (all‑purpose) cluster when submitting a job, rather than a job cluster?&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000"&gt;As I'm testing, it's a matter of convenience, nothing more, I couldn't wait to run the job till a job cluster spins up, hence using an interactive cluster. As I figured out you're using a job cluster to pass the request, I suggested to use a different parameter in the JSON payload.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#FF0000"&gt;Q2: I can’t find any reference to a parameter called job_cluster_details in the official docs—could you point me to where it’s documented or share a link?&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000"&gt;I was checking the RestAPI docs and from the link below, saw how the job_cluster is being referenced. Suggested the same to you.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="RiyazAli_0-1745493412644.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/16215iA2369584E4E652DE/image-size/medium?v=v2&amp;amp;px=400" role="button" title="RiyazAli_0-1745493412644.png" alt="RiyazAli_0-1745493412644.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;In the example above, "auto_scaling_cluster" is the name of the job cluster.&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000"&gt;Link to the doc -&amp;nbsp;&lt;A href="https://docs.databricks.com/api/workspace/jobs/get" target="_blank" rel="noopener"&gt;https://docs.databricks.com/api/workspace/jobs/get&lt;/A&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#000000"&gt;Also, check this API for repair run, I believe this would be the right URL to hit for your usecase.&lt;BR /&gt;&lt;A href="https://docs.databricks.com/api/workspace/jobs/repairrun" target="_self"&gt;https://docs.databricks.com/api/workspace/jobs/repairrun&lt;/A&gt;&lt;BR /&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;Let me know your thoughts.&lt;/P&gt;</description>
      <pubDate>Thu, 24 Apr 2025 11:19:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/api-for-restarting-individual-failed-tasks-within-a-job/m-p/116458#M45321</guid>
      <dc:creator>RiyazAliM</dc:creator>
      <dc:date>2025-04-24T11:19:45Z</dc:date>
    </item>
  </channel>
</rss>

