Data Engineering

Forum Posts

Sorted by:

by Graham • New Contributor III

09-16-2022 10:41:51 AM

8595 Views
5 replies
3 kudos

"MERGE" always slower than "CREATE OR REPLACE"

OverviewTo update our Data Warehouse tables, we have tried two methods: "CREATE OR REPLACE" and "MERGE". With every query we've tried, "MERGE" is slower.My question is this: Has anyone successfully gotten a "MERGE" to perform faster than a "CREATE OR...

Data Engineering

8595 Views
5 replies
3 kudos

09-16-2022 10:41:51 AM

View Replies

Latest Reply

Manisha_Jena
Databricks Employee

11-02-2023 2:18:28 AM

3 kudos

Hi @Graham Can you please try Low Shuffle Merge [LSM] and see if it helps? LSM is a new MERGE algorithm that aims to maintain the existing data organization (including z-order clustering) for unmodified data, while simultaneously improving performan...

3 kudos

11-02-2023 2:18:28 AM

4 More Replies

by Rishabh-Pandey • Esteemed Contributor

03-11-2023 12:13:52 AM

1166 Views
0 replies
3 kudos

Hey there! I've noticed that many people seem to be confused about the differences between databases, data warehouses, and data lakes. It's un...

Hey there! I've noticed that many people seem to be confused about the differences between databases, data warehouses, and data lakes. It's understandable, as these terms can be easily misunderstood or used interchangeablyHere is the summary for all ...

Data Engineering

1166 Views
0 replies
3 kudos

03-11-2023 12:13:52 AM

by StephanieAlba • Databricks Employee

11-24-2021 12:26:25 PM

6840 Views
2 replies
3 kudos

Resolved! Best Data Model for moving from DW to Delta lake

I’m curious what Databricks recommends how we model the data. Do they recommend that the data be in 3rd normal form (3NF). Or should be it be dimensionally modeled (facts and dimensions)

Data Engineering

6840 Views
2 replies
3 kudos

11-24-2021 12:26:25 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

11-25-2021 12:13:53 AM

3 kudos

It all depends on the use case.3NF is ideal for transactional systems. So for a data warehouse/lakehouse that might not be ideal.However there certainly are cases where it is interesting.Star schema's are def still relevant, BUT with the processing p...

3 kudos

11-25-2021 12:13:53 AM

1 More Replies

by User16790091296 • Contributor II

06-24-2021 8:37:03 AM

1043 Views
1 replies
0 kudos

Databricks- How to move data from Databricks temp to DataWarehouse or move from Databricks Table to Data warehouse directly?

Data Engineering

1043 Views
1 replies
0 kudos

06-24-2021 8:37:03 AM

View Replies

Latest Reply

Ryan_Chynoweth
Esteemed Contributor

07-30-2021 9:40:14 AM

0 kudos

You have a couple options to write data into a Data Warehouse. Some DWs have special connectors that allow for high performance between Databricks and the DW (for example there is a Spark connector for Snowflake and for Azure Synapse DW). Some data w...

0 kudos

07-30-2021 9:40:14 AM

by AbhishekBreeks • New Contributor II

07-28-2021 7:21:57 AM

941 Views
0 replies
0 kudos

Host a Star Schema Data Warehouse on Azure Databricks

Hello, Is it a good idea to Host a Schema Data Warehouse on Azure Databricks database itself. Usually we use Azure Databricks to Prep the data and then Host it on Azure Sql Database. However question is can we not Host the data on Azure Databricks i...

Data Engineering

941 Views
0 replies
0 kudos

07-28-2021 7:21:57 AM

Databricks Community

"MERGE" always slower than "CREATE OR REPLACE"

Hey there! I&#39;ve noticed that many people seem to be confused about the differences between databases, data warehouses, and data lakes. It&#39;s un...

Resolved! Best Data Model for moving from DW to Delta lake

Databricks- How to move data from Databricks temp to DataWarehouse or move from Databricks Table to Data warehouse directly?

Host a Star Schema Data Warehouse on Azure Databricks

Hey there! I've noticed that many people seem to be confused about the differences between databases, data warehouses, and data lakes. It's un...