Get Started Discussions

by T0M • New Contributor III

a month ago

186 Views
2 replies
1 kudos

DLT Pipeline Validate will always spawn new cluster

Hi all!I've started learning DLT-Pipelines but I am struggling with the development of a pipeline.As far as I understand it, once I click on “Validate” a cluster will spin-up and stay (by default for 2hours), if the pipeline is in “Development” mode....

Get Started Discussions

Reply

186 Views
2 replies
1 kudos

a month ago

View Replies

Latest Reply

Alberto_Umana
Databricks Employee

a month ago

1 kudos

Hi @T0M, It is expected that the cluster restarts with each validation to ensure that your changes are accurately reflected. In development mode, once you click "Validate," a cluster will spin up, and it stays active for up to two hours by default. ...

1 kudos

a month ago

1 More Replies

by ChristianRRL • Valued Contributor

06-04-2024 7:55:33 AM

3446 Views
6 replies
1 kudos

Materialized Views Without DLT?

I'm curious, is DLT *required* to use Materialized Views in Databricks? Is it not possible to create and refresh a Materialized view via a standard Databricks Workflow?

Get Started Discussions

Reply

3446 Views
6 replies
1 kudos

06-04-2024 7:55:33 AM

View Replies

Latest Reply

hari-prasad
Valued Contributor II

01-16-2025 6:31:59 AM

1 kudos

Hi @ChristianRRL ,When creating a materialized view in Databricks, the data is stored in DBFS, cloud storage, or Unity Catalog volume. You can still create a materialized view by overwriting the same table each time, instead of using Append, Update, ...

1 kudos

01-16-2025 6:31:59 AM

5 More Replies

by ChristianRRL • Valued Contributor

01-24-2024 1:10:20 PM

3018 Views
4 replies
1 kudos

Purpose of DLT Table table_properties > quality:medallion

Hi there, silly question here but can anyone help me understand what practical purpose does labelling the table_properties with "quality":"<specific_medallion>"? For example: @Dlt.table( comment="Bronze live streaming table for Test data", name="...

Get Started Discussions

Reply

3018 Views
4 replies
1 kudos

01-24-2024 1:10:20 PM

View Replies

Latest Reply

nochimo
New Contributor II

01-15-2025 12:50:51 PM

1 kudos

I'm with the same doubt @ChristianRRL, did you figured out something related to it?My doubt is to check if it's possible to apply any kind of access control based on this property.

1 kudos

01-15-2025 12:50:51 PM

3 More Replies

by Thms317 • New Contributor III

10-22-2024 1:25:03 AM

3169 Views
2 replies
2 kudos

Resolved! Cannot install wheel from Workspace in DLT

Hi all. I am no longer able to install my custom wheel in my DLT pipeline. No matter what configuration I try I cannot get it to work: parameterized or just hard-coding the path to the wheel. If I run the hard-coded cell with an all-purpose cluster t...

Get Started Discussions

Reply

3169 Views
2 replies
2 kudos

10-22-2024 1:25:03 AM

View Replies

Latest Reply

Thms317
New Contributor III

10-22-2024 7:11:05 AM

2 kudos

I managed to fix the issue. The problem was that my wheel was built for Databricks Runtime 14.3 LTS and I was using the PREVIEW channel rather than the CURRENT channel. At time of writing:CURRENT(default): Databricks Runtime 14.1 --> Python: 3.10.12P...

2 kudos

10-22-2024 7:11:05 AM

1 More Replies

by ChristianRRL • Valued Contributor

07-09-2024 9:22:28 AM

2274 Views
3 replies
0 kudos

Resolved! DLT Compute: "Ephemeral" Job Compute vs. All-purpose compute 2.0 ... WHY?

Hi there, this is a follow-up from a discussion I started last monthSolved: Re: DLT Compute: "Ephemeral" Job Compute vs. All-p... - Databricks Community - 71661Based on what was discussed, I understand that it's not possible to use "All Purpose Clust...

Get Started Discussions

Reply

2274 Views
3 replies
0 kudos

07-09-2024 9:22:28 AM

View Replies

Latest Reply

raphaelblg
Databricks Employee

07-23-2024 7:43:18 AM

0 kudos

@ChristianRRL regarding on why DLT doesn't allow you to use all-purpose clusters: 1. The DLT runtime is derived from the shared compute DBR, it's not the same runtime and has different features than the common all-purpose runtime. A DLT pipeline is n...

0 kudos

07-23-2024 7:43:18 AM

2 More Replies

by ChristianRRL • Valued Contributor

06-04-2024 6:35:44 AM

1218 Views
1 replies
0 kudos

Materialized Views Still In Public Preview?

The title is self-explanatory, but I'm curious if anyone has any inclination as to when Materialized Views will enter General Availability? I'm surprised it's not already in GA considering (A) it's been a year or more since Databricks originally anno...

Get Started Discussions

Reply

1218 Views
1 replies
0 kudos

06-04-2024 6:35:44 AM

View Replies

Latest Reply

EugeniaMZ
Databricks Employee

06-05-2024 7:59:43 AM

0 kudos

Yes, if the documentation states it, it's still in preview.

0 kudos

06-05-2024 7:59:43 AM

by ChristianRRL • Valued Contributor

02-01-2024 1:42:07 PM

6532 Views
3 replies
3 kudos

Resolved! DLT Job Clusters: Continuous vs Triggered Cluster Start Times

Hi there,I'm curious if anyone is able to definitively help me answer how DLT Job Clusters operate/run.For example, the following is my baseline understanding of DLT Job Clusters. If I run a Triggered DLT Pipeline (e.g. daily) the job cluster takes m...

Get Started Discussions

Reply

6532 Views
3 replies
3 kudos

02-01-2024 1:42:07 PM

View Replies

Latest Reply

melbourne
Contributor

02-02-2024 3:50:43 AM

3 kudos

Ideally one would expect clusters used for DLT pipeline to terminate after the pipeline execution has finished. However, while running in `development` environment, you'll notice it doesn't terminate on its own, whereas in `production` it terminates ...

3 kudos

02-02-2024 3:50:43 AM

2 More Replies

by ChristianRRL • Valued Contributor

01-23-2024 12:48:35 PM

5104 Views
2 replies
1 kudos

DLT Primary Key Deduplication: Expectations vs. Constraints vs. Other?

I'm trying to figure out what's the best way to "de-duplicate" data via DLT. Currently, my only leads are:Manage data quality with Delta Live Tables | Databricks on AWSVia "Drop invalid records"Constraints on Databricks | Databricks on AWSVia "pre-de...

Get Started Discussions

Auto Loader

autoloader

Delta Live Table

Delta Live Table Pipeline

dlt

Reply

5104 Views
2 replies
1 kudos

01-23-2024 12:48:35 PM

View Replies

Latest Reply

Palash01
Valued Contributor

01-23-2024 10:22:29 PM

1 kudos

Hey @ChristianRRL ,Based on my understanding you want to de-duplicate your data during your DLT pipeline processing unfortunately I was not able to find a solution to this when I ran into this problem due to the native feature limitations.Limitations...

1 kudos

01-23-2024 10:22:29 PM

1 More Replies

by ChristianRRL • Valued Contributor

01-18-2024 9:00:53 AM

4932 Views
2 replies
1 kudos

Resolved! DLT Notebook and Pipeline Separation vs Consolidation

Super basic question. For DLT pipelines I see there's an option to add multiple "Paths". Is it generally best practice to completely separate `bronze` from `silver` notebooks? Or is it more recommended to bundle both raw `bronze` and clean `silver` d...

Get Started Discussions

Reply

4932 Views
2 replies
1 kudos

01-18-2024 9:00:53 AM

View Replies

Latest Reply

ChristianRRL
Valued Contributor

01-18-2024 3:54:28 PM

1 kudos

This is great! I completely missed the list view before.

1 kudos

01-18-2024 3:54:28 PM

1 More Replies

by ChristianRRL • Valued Contributor

12-19-2023 2:47:00 PM

7239 Views
5 replies
2 kudos

DLT Compute Resources - What Compute Is It???

Hi there, I'm wondering if someone can help me understand what compute resources DLT uses? It's not clear to me at all if it uses the last compute cluster I had been working on, or something else entirely.Can someone please help clarify this?

Get Started Discussions

Reply

7239 Views
5 replies
2 kudos

12-19-2023 2:47:00 PM

View Replies

Latest Reply

quakenbush
Contributor

12-22-2023 12:04:22 AM

2 kudos

Well, one thing they emphasize in the 'Adavanced Data Engineer' Training is that job-clusters will terminate within 5 minutes after a job is completed. So this could be in support of your theory to lower costs. I think job-cluster are actually design...

2 kudos

12-22-2023 12:04:22 AM

4 More Replies