cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Repairing running workflow with few failed child jobs

holychs
New Contributor III

I have a parent job that calls multiple child jobs in workflow, Out of 10 child jobs, one has failed and rest 9 are still running, I want to repair the failed child tasks. can I do that while the other child jobs are running?

1 REPLY 1

Brahmareddy
Honored Contributor III

Hi holychs,

How are you doing today?, As per my understanding, yes, in Databricks Workflows, if you're running a multi-task job (like your parent job triggering multiple child tasks), you can repair only the failed task without restarting the entire job. However, to do this, you need to wait until the rest of the tasks finish running, because Databricks doesn't currently allow you to repair individual tasks while others are still in progress. Once all the other child jobs have completed (successfully or failed), you can go to the job run in the UI and click “Repair run”, then select just the failed task to retry. It’s a helpful feature for large workflows, and it avoids re-running everything from scratch. Let me know if you want help setting up repair-friendly dependencies or alerts!

Regards,

Brahma

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now