Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-06-2024 11:37 PM
Hey @param_sen
Given your concerns about expensive joins and prioritizing analytics with flat raw data, here are some suggestions:
- Analyze the most common queries and reports you anticipate. Do they heavily rely on joins across dimensions? If not, the performance impact of normalization might be minimal.
- Prioritize denormalization for key joins: Strategically denormalize specific dimensions frequently joined with fact tables in the Gold layer. This can balance flexibility with query performance.
- Materialized views: Explore materialized views in the Gold layer to pre-join frequently accessed data, potentially mitigating some join costs.
- Begin with a less normalized Silver layer, focusing on cleansing and enriching the data.You can always introduce normalization later if needed.
Leave a like if this helps! Kudos,
Palash
Palash