โ10-12-2022 08:50 PM
Our use case is simple - to store our PB scale data and transform and use for BI, reporting and analytics. As my title says am trying to eliminate expenditure on Redshift as we are starting as a green field.
I know I have designed/used just Delta lake as an analytics data warehouse though we still used Redshift as a legacy warehouse by other teams in my previous org. But just trying to get some consensus here to gain more confidence.
Thanks,
Swetha
โ10-12-2022 11:01 PM
Yeah I believe so delta lake is just getting better and better I think its a great way forward to support usecases similar to yours
โ10-13-2022 01:36 AM
I think it can. If we were a few years earlier I would say: no. But now with databricks sql (or other software like Trino etc) and delta lake, I don't see why it would not work.
As we speak I am looking into getting rid of your Azure Synapse database, and I think we can pull that of without issues.
The main attention point however is to not forget to look at what tools are being used to consume data.
Because from a technical point of view, this is quickly overlooked. f.e. if your company uses some paginated reporting software, you better make sure that will still work.
โ10-18-2022 03:57 AM
Yes yes yes ๐
โ10-18-2022 04:01 AM
Yes that data lake/lakehouse is excellent for replacing Redshift but I would not write more details as it is a long topic.
โ10-18-2022 04:03 AM
That one wasn't so great, hehe Thank you!
โ11-27-2022 04:51 AM
Hi @Swetha Marakaniโ
Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help.
We'd love to hear from you.
Thanks!
Passionate about hosting events and connecting people? Help us grow a vibrant local communityโsign up today to get started!
Sign Up Now