User16826994223
Databricks Employee
Databricks Employee

Lakehouse is a concept defined with the following Parameter-

  1. Data is stored in an open standard format.
  2. Data is stored in a way which support Data Science,ML and BI loads.
  3. Delta is just a way or engine on cloud storage that provides control on data and prevent it from becoming data swamp and also add performance and provide sql like query support
  4. for lake house it is always recommended to have 3 layers,
  • Bronze - Raw data as it is from OTP
  • Silver -data in a curated format and with a filter that does not allow any junk data to silver, this layer is best suited for Data science and ML
  • gold layer-Purely aggregated data that helps in BI and can be used in Machine learning too.