โ03-01-2022 07:00 AM
Is there a difference between cluster modes in this case? Can it be that Graphx would work better on single than on standart cluster or high concurrency cluster (for multiple users)? Does less concurrent cluster wourld be more efficient for graph modelling?
โ03-01-2022 11:34 AM
Single - for development purposes,
high concurrency - when multiple users run notebooks at the same time in parallel,
so usually standard is the best option.
Regarding VM I bet compute optimized.
I recommend also to read ebook "Spark GraphX in action" https://livebook.manning.com/book/spark-graphx-in-action/table-of-contents/
โ03-01-2022 11:34 AM
Single - for development purposes,
high concurrency - when multiple users run notebooks at the same time in parallel,
so usually standard is the best option.
Regarding VM I bet compute optimized.
I recommend also to read ebook "Spark GraphX in action" https://livebook.manning.com/book/spark-graphx-in-action/table-of-contents/
โ03-02-2022 12:19 AM
Iยดd say start as cheap as possible and check the runtime.
โ03-06-2022 04:16 PM
@Direo Direoโ - What do you think of these answers? If either of them stands out as best, would you please mark it that way? If you have more questions, please, bring them on!
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.
Request a New Group