Topics with Label: Delta Live Tables

by ChristianRRL • Valued Contributor

02-01-2024 1:42:07 PM

6534 Views
3 replies
3 kudos

Resolved! DLT Job Clusters: Continuous vs Triggered Cluster Start Times

Hi there,I'm curious if anyone is able to definitively help me answer how DLT Job Clusters operate/run.For example, the following is my baseline understanding of DLT Job Clusters. If I run a Triggered DLT Pipeline (e.g. daily) the job cluster takes m...

Get Started Discussions

Reply

6534 Views
3 replies
3 kudos

02-01-2024 1:42:07 PM

View Replies

Latest Reply

melbourne
Contributor

02-02-2024 3:50:43 AM

3 kudos

Ideally one would expect clusters used for DLT pipeline to terminate after the pipeline execution has finished. However, while running in `development` environment, you'll notice it doesn't terminate on its own, whereas in `production` it terminates ...

3 kudos

02-02-2024 3:50:43 AM

2 More Replies

by ChristianRRL • Valued Contributor

01-23-2024 12:48:35 PM

5104 Views
2 replies
1 kudos

DLT Primary Key Deduplication: Expectations vs. Constraints vs. Other?

I'm trying to figure out what's the best way to "de-duplicate" data via DLT. Currently, my only leads are:Manage data quality with Delta Live Tables | Databricks on AWSVia "Drop invalid records"Constraints on Databricks | Databricks on AWSVia "pre-de...

Get Started Discussions

Auto Loader

autoloader

Delta Live Table

Delta Live Table Pipeline

dlt

Reply

5104 Views
2 replies
1 kudos

01-23-2024 12:48:35 PM

View Replies

Latest Reply

Palash01
Valued Contributor

01-23-2024 10:22:29 PM

1 kudos

Hey @ChristianRRL ,Based on my understanding you want to de-duplicate your data during your DLT pipeline processing unfortunately I was not able to find a solution to this when I ran into this problem due to the native feature limitations.Limitations...

1 kudos

01-23-2024 10:22:29 PM

1 More Replies

by ChristianRRL • Valued Contributor

01-18-2024 9:00:53 AM

4932 Views
2 replies
1 kudos

Resolved! DLT Notebook and Pipeline Separation vs Consolidation

Super basic question. For DLT pipelines I see there's an option to add multiple "Paths". Is it generally best practice to completely separate `bronze` from `silver` notebooks? Or is it more recommended to bundle both raw `bronze` and clean `silver` d...

Get Started Discussions

Reply

4932 Views
2 replies
1 kudos

01-18-2024 9:00:53 AM

View Replies

Latest Reply

ChristianRRL
Valued Contributor

01-18-2024 3:54:28 PM

1 kudos

This is great! I completely missed the list view before.

1 kudos

01-18-2024 3:54:28 PM

1 More Replies

by ChristianRRL • Valued Contributor

12-19-2023 2:47:00 PM

7241 Views
5 replies
2 kudos

DLT Compute Resources - What Compute Is It???

Hi there, I'm wondering if someone can help me understand what compute resources DLT uses? It's not clear to me at all if it uses the last compute cluster I had been working on, or something else entirely.Can someone please help clarify this?

Get Started Discussions

Reply

7241 Views
5 replies
2 kudos

12-19-2023 2:47:00 PM

View Replies

Latest Reply

quakenbush
Contributor

12-22-2023 12:04:22 AM

2 kudos

Well, one thing they emphasize in the 'Adavanced Data Engineer' Training is that job-clusters will terminate within 5 minutes after a job is completed. So this could be in support of your theory to lower costs. I think job-cluster are actually design...

2 kudos

12-22-2023 12:04:22 AM

4 More Replies

by ChristianRRL • Valued Contributor

12-13-2023 9:48:55 AM

2030 Views
2 replies
0 kudos

Auto Loader Use Case Question - Centralized Dropzone to Bronze?

Good day,I am trying to use Auto Loader (potentially extending into DLT in the future) to easily pull data coming from an external system (currently located in a single location) and organize it and load it respectively. I am struggling quite a bit a...

Get Started Discussions

Reply

2030 Views
2 replies
0 kudos

12-13-2023 9:48:55 AM

View Replies

Latest Reply

ChristianRRL
Valued Contributor

12-18-2023 9:37:27 AM

0 kudos

Quick follow-up on this @Retired_mod (or to anyone else in the Databricks multi-verse who is able to help clarify this case).I understand that the proposed solution would work for a "one-to-one" case where many files are landing in a specific dbfs pa...

0 kudos

12-18-2023 9:37:27 AM

1 More Replies

by vkuznetsov • New Contributor III

07-13-2023 7:58:39 AM

1171 Views
1 replies
0 kudos

Problem sharing a streaming table created in Delta Live Table via Delta Sharing

Hi all,I hope you could help me to figure out what I am missing.I'm trying to do a simple thing. To read the data from the data ingestion zone (csv files saved to Azure Storage Account) using the Delta Live Tables pipeline and share the resulting tab...

Get Started Discussions

Reply

1171 Views
1 replies
0 kudos

07-13-2023 7:58:39 AM

View Replies

Latest Reply

vkuznetsov
New Contributor III

07-13-2023 8:09:33 AM

0 kudos

Sorry, I think I've created the post in the wrong thread. Created the same post in the Community Cove.

0 kudos

07-13-2023 8:09:33 AM

Databricks Community

Forum Posts

Resolved! DLT Job Clusters: Continuous vs Triggered Cluster Start Times

DLT Primary Key Deduplication: Expectations vs. Constraints vs. Other?

Resolved! DLT Notebook and Pipeline Separation vs Consolidation

DLT Compute Resources - What Compute Is It???

Auto Loader Use Case Question - Centralized Dropzone to Bronze?

Problem sharing a streaming table created in Delta Live Table via Delta Sharing