Palash01
Valued Contributor

Hey @param_sen 

Given your concerns about expensive joins and prioritizing analytics with flat raw data, here are some suggestions:

  1. Analyze the most common queries and reports you anticipate. Do they heavily rely on joins across dimensions? If not, the performance impact of normalization might be minimal.
  2. Prioritize denormalization for key joins: Strategically denormalize specific dimensions frequently joined with fact tables in the Gold layer. This can balance flexibility with query performance.
  3. Materialized views: Explore materialized views in the Gold layer to pre-join frequently accessed data, potentially mitigating some join costs.
  4. Begin with a less normalized Silver layer, focusing on cleansing and enriching the data.You can always introduce normalization later if needed.
Leave a like if this helps! Kudos,
Palash