cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

How can I customize the Notebook Job # while using dbutils.notebook.run method?

soumyaPattnaik
New Contributor III

When running multiple notebooks parallelly using dbutils.notebook.run from a parent notebook, an url to that running notebook is printed, like below

  • Notebook job #211371132480519

Is there a way I can print the notebook name or some customized string instead of this job id. I am running multiple notebooks (more than 100 at a time) - so with just the job id it is difficult to identify which notebook run is mapped with which job id

Thanks in advance!

3 REPLIES 3

Debayan
Esteemed Contributor III

Hi @Soumya Pattnaikโ€‹ , Configurations for the job can be updated using the reset or update endpoint in jobs API 2.1 (https://docs.databricks.com/dev-tools/api/latest/jobs.html).

Also, you can use cluster tags, which allow you to easily monitor the cost of cloud resources used by various groups in your organization. You can specify tags as key-value pairs when you create a cluster, and Azure Databricks applies these tags to cloud resources like VMs and disk volumes etc.

You can monitor usage using cluster or pool tags. (https://docs.databricks.com/administration-guide/account-settings/usage-detail-tags-aws.html).

For convenience, Azure Databricks applies four default tags to each cluster:  

Vendor

Creator

ClusterName

and  

ClusterId.

soumyaPattnaik
New Contributor III

Hi @Debayan Thank you for your reply.
However, the answer I am looking for is : how to print/get a more meaningful name of the jobs when running multiple notebooks parallelly using dbutils.notebook.run from a parent notebook.

Now in the parent notebook console something like this appears: (in case I am starting 2 notebook run from the parent NB)

  • Notebook job #211371132480519
  • Notebook job #2113711324805120

I am expecting something like below:

  • <path_to_nb1> or name of notebook1
  • <path_to_nb2> or name of notebook2

+1, I'd like to know this as well.

Guess we'll just have to play around and see what we can do...

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group