cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Adding tags without Cluster Restart

ryomen_sukuna
New Contributor II

Hi Team, I want to add tags to cluster that is shared for multiple job runs.
To isolate cost we want to leverage tagging.
If a job run is already in place on a running cluster, I want to trigger one more run and add tags.
I'm not able to add tags without cluster restart which I don't want, as already a job is using that cluster.

How to Isolate cost per run in such cases.

4 REPLIES 4

szymon_dybczak
Contributor III

Hi @ryomen_sukuna ,

Unfortunately, I believe it's not possible. Any change to the cluster configuration requires a restart for the changes to take effect. Below snippet from documenatation:

"

Updates the configuration of a cluster to match the provided attributes and size. A cluster can be updated if it is in a RUNNING or TERMINATED state.
If a cluster is updated while in a RUNNING state, it will be restarted so that the new attributes can take effect.
If a cluster is updated while in a TERMINATED state, it will remain TERMINATED. The next time it is started using the clusters/start API, the new attributes will take effect. Any attempt to update a cluster in any other state will be rejected with an INVALID_STATE error code."
 

ryomen_sukuna
New Contributor II

@szymon_dybczak do we have some other way we can isolate cost per run?

Can't you just add additional tags on a job level? According to documenatation tags defined on job level also propagate to job clusters created when a job is run, allowing you to use tags with your existing cluster monitoring.

Slash_0-1725433445489.png

 

ryomen_sukuna
New Contributor II

Correct, we can leverage job level tagging, but again it comes with a cost of handling concurrent runs or repair runs and some special cases as the job definition will change.
We generally don't trigger the runs using UI as we have an automated process to trigger the runs.

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group