Topics with Label: Delta Live Table Pipelines

by ChristianRRL • Contributor

02-01-2024 1:42:07 PM

1756 Views
5 replies
3 kudos

Resolved! DLT Job Clusters: Continuous vs Triggered Cluster Start Times

Hi there,I'm curious if anyone is able to definitively help me answer how DLT Job Clusters operate/run.For example, the following is my baseline understanding of DLT Job Clusters. If I run a Triggered DLT Pipeline (e.g. daily) the job cluster takes m...

Get Started Discussions

Reply

1756 Views
5 replies
3 kudos

02-01-2024 1:42:07 PM

View Replies

Latest Reply

Kaniz
Community Manager

02-11-2024 11:01:04 PM

3 kudos

Hey there! Thanks a bunch for being part of our awesome community! We love having you around and appreciate all your questions. Take a moment to check out the responses – you'll find some great info. Your input is valuable, so pick the best solution...

3 kudos

02-11-2024 11:01:04 PM

4 More Replies

by ChristianRRL • Contributor

01-23-2024 12:48:35 PM

1286 Views
2 replies
1 kudos

DLT Primary Key Deduplication: Expectations vs. Constraints vs. Other?

I'm trying to figure out what's the best way to "de-duplicate" data via DLT. Currently, my only leads are:Manage data quality with Delta Live Tables | Databricks on AWSVia "Drop invalid records"Constraints on Databricks | Databricks on AWSVia "pre-de...

Get Started Discussions

Auto Loader

autoloader

Delta Live Table

Delta Live Table Pipeline

dlt

Reply

1286 Views
2 replies
1 kudos

01-23-2024 12:48:35 PM

View Replies

Latest Reply

Palash01
Contributor III

01-23-2024 10:22:29 PM

1 kudos

Hey @ChristianRRL ,Based on my understanding you want to de-duplicate your data during your DLT pipeline processing unfortunately I was not able to find a solution to this when I ran into this problem due to the native feature limitations.Limitations...

1 kudos

01-23-2024 10:22:29 PM

1 More Replies

by ChristianRRL • Contributor

01-18-2024 9:00:53 AM

1951 Views
2 replies
1 kudos

Resolved! DLT Notebook and Pipeline Separation vs Consolidation

Super basic question. For DLT pipelines I see there's an option to add multiple "Paths". Is it generally best practice to completely separate `bronze` from `silver` notebooks? Or is it more recommended to bundle both raw `bronze` and clean `silver` d...

Get Started Discussions

Reply

1951 Views
2 replies
1 kudos

01-18-2024 9:00:53 AM

View Replies

Latest Reply

ChristianRRL
Contributor

01-18-2024 3:54:28 PM

1 kudos

This is great! I completely missed the list view before.

1 kudos

01-18-2024 3:54:28 PM

1 More Replies

by ChristianRRL • Contributor

12-19-2023 2:47:00 PM

1172 Views
5 replies
2 kudos

DLT Compute Resources - What Compute Is It???

Hi there, I'm wondering if someone can help me understand what compute resources DLT uses? It's not clear to me at all if it uses the last compute cluster I had been working on, or something else entirely.Can someone please help clarify this?

Get Started Discussions

Reply

1172 Views
5 replies
2 kudos

12-19-2023 2:47:00 PM

View Replies

Latest Reply

quakenbush
Contributor

12-22-2023 12:04:22 AM

2 kudos

Well, one thing they emphasize in the 'Adavanced Data Engineer' Training is that job-clusters will terminate within 5 minutes after a job is completed. So this could be in support of your theory to lower costs. I think job-cluster are actually design...

2 kudos

12-22-2023 12:04:22 AM

4 More Replies