Databricks Community

Nam_Nguyen · 01-02-2025

Hello @kazinahian , Azure Databricks offers several options for building ETL (Extract, Transform, Load) data pipelines, ranging from low-code to more code-centric approaches: Delta Live Tables Delta Live Tables (DLT) is a declarative framework for bu...

Nam_Nguyen · 01-02-2025

Hi @NathanSundarara , regarding your current approach, here are the potential solutions and considerations- Deduplication: Implement deduplication strategies within your DLT pipeline. For example clicksDedupDf = ( spark.readStream.table("LIVE.rawCl...

Nam_Nguyen · 11-27-2024

Hello @jeft , will you be able to share some screenshots of the driver logs?

Nam_Nguyen · 11-22-2024

Hi @ChristianRRL , as a first quick look, could you please try to create a PySpark dataframe with the _metadata and _rescued_data columns, query the dataframe to make sure you can see those columns, and then create a view using this dataframe?

Nam_Nguyen · 11-22-2024

Hello @guangyi , I am getting back to you with some insights Regarding your first question about checkpointing You can manually check the checkpointing location of your stream table. The checkpoints of your Delta Live Tables are under Storage locatio...

Databricks Community

User Stats

User Activity

Re: Lowcode ETL in Databricks

Re: Lakehouse federation bringing data from SQL Server

Re: mongodb ingest data into databricks error

Re: CREATE view USING json and include _metadata, _rescued_data

Re: DLT pipeline observability questions (and maybe suggestions)

Databricks Community

User Stats

User Activity

Re: Lowcode ETL in Databricks

Re: Lakehouse federation bringing data from SQL Server

Re: mongodb ingest data into databricks error

Re: CREATE view USING json and *include* _metadata, _rescued_data

Re: DLT pipeline observability questions (and maybe suggestions)

Re: CREATE view USING json and include _metadata, _rescued_data