Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-01-2023 12:21 AM
I have a workflow with a model which trains every sunday in Azure Databricks. Sometimes the workflow fails as the max wait time is exceeded (currently I am using 1200 seconds). To solve the problem I was thinking of either increasing the wait time or increasing the size of the cluster used.
This made me wonder, which is better (and cheaper) of the two options below:
- Train the model on a bigger and more expensive cluster which will hopefully reduce the time used
- Train the model on a smaller and cheaper cluster and then simply increase the wait time
Or is there a third and better solution?
Labels:
- Labels:
-
Azure databricks
-
Models