cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Add custom tags to DLT cluster

BradSheridan
Valued Contributor

We have a policy in our AWS account that whenever an EC2 instance is created, there are 5 mandatory tags that need to be added. When we create an All-Purpose cluster in the Databricks console, we can easily add these under Advance Options > Tags. However, this isn't available when creating a DLT job under Workflows > Delta Live Tables. Now the EC2 instances that are supposed to spin up for the DLT job never get created b/c they don't have tags. When we go to Compute and Terminate the cluster creation to edit it and add tags, we get a message "Error: dlt prefixed spark images cannot be used outside of the Delta Live Tables service". Any help is greatly appreciated! Brad

1 ACCEPTED SOLUTION

Accepted Solutions

tomasz
Databricks Employee
Databricks Employee

Great!

When it comes to your question, autoloader and DLT can be used together so they are not mutually exclusive, with autoloader being the mechanism to load data into your DLT pipeline. It is common for both of them to be used when implementing a CDC pipeline.

View solution in original post

4 REPLIES 4

tomasz
Databricks Employee
Databricks Employee

You can add "custom_tags" in the DLT configuration under "clusters" (in JSON view):

Screen Shot 2022-07-11 at 9.44.20 AM 

The full range of attributes allowed in this section are listed here.

BradSheridan
Valued Contributor

@Tomasz Bacewiczโ€‹ Perfect!! worked like a charm...thank you!!

do you think auto loader or DLT is a better approach to implementing a CDC pipeline in Databricks?

tomasz
Databricks Employee
Databricks Employee

Great!

When it comes to your question, autoloader and DLT can be used together so they are not mutually exclusive, with autoloader being the mechanism to load data into your DLT pipeline. It is common for both of them to be used when implementing a CDC pipeline.

BradSheridan
Valued Contributor

Thanks again!

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group