by
Graham
• New Contributor III
- 8595 Views
- 5 replies
- 3 kudos
OverviewTo update our Data Warehouse tables, we have tried two methods: "CREATE OR REPLACE" and "MERGE". With every query we've tried, "MERGE" is slower.My question is this: Has anyone successfully gotten a "MERGE" to perform faster than a "CREATE OR...
- 8595 Views
- 5 replies
- 3 kudos
Latest Reply
Hi @Graham Can you please try Low Shuffle Merge [LSM] and see if it helps? LSM is a new MERGE algorithm that aims to maintain the existing data organization (including z-order clustering) for unmodified data, while simultaneously improving performan...
4 More Replies
- 1166 Views
- 0 replies
- 3 kudos
Hey there! I've noticed that many people seem to be confused about the differences between databases, data warehouses, and data lakes. It's understandable, as these terms can be easily misunderstood or used interchangeablyHere is the summary for all ...
- 1166 Views
- 0 replies
- 3 kudos
- 6840 Views
- 2 replies
- 3 kudos
I’m curious what Databricks recommends how we model the data. Do they recommend that the data be in 3rd normal form (3NF). Or should be it be dimensionally modeled (facts and dimensions)
- 6840 Views
- 2 replies
- 3 kudos
Latest Reply
It all depends on the use case.3NF is ideal for transactional systems. So for a data warehouse/lakehouse that might not be ideal.However there certainly are cases where it is interesting.Star schema's are def still relevant, BUT with the processing p...
1 More Replies
- 941 Views
- 0 replies
- 0 kudos
Hello,
Is it a good idea to Host a Schema Data Warehouse on Azure Databricks database itself.
Usually we use Azure Databricks to Prep the data and then Host it on Azure Sql Database. However question is can we not Host the data on Azure Databricks i...
- 941 Views
- 0 replies
- 0 kudos