cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

What are best practices for designing a large-scale data engineering pipeline on Databricks for real

Suheb
New Contributor II

How do you design a scalable, reliable pipeline that handles both fast/continuous data and slower bulk data in the same system?

1 REPLY 1

Coffee77
Contributor III

Very generic question ๐Ÿ™‚ Here are general rules and best practices related to Databricks well-architected framework: https://docs.databricks.com/aws/en/lakehouse-architecture/well-architected Take a deeper look on operational excellence, reliability and performance efficiency. On the other hand, try to adopt a mediallion architecture to logically organize data https://www.databricks.com/glossary/medallion-architecture and usage of Unity catalog to centrally control and governance data.


Lifelong Learner Cloud & Data Solution Architect | https://www.youtube.com/@CafeConData