cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Optimal Cluster Configuration for Training on Billion-Row Datasets

moh3th1
New Contributor

Hello Databricks Community,

I am currently facing a challenge in configuring a cluster for training machine learning models on a dataset consisting of approximately a billion rows and 40 features. Given the volume of data, I want to ensure that the cluster is optimally configured to handle such a workload efficiently.

I would greatly appreciate insights from the community on the following:

  1. Machine Selection: What are the key considerations when selecting machine types for the cluster? Should I prioritize memory, CPU, or GPU for specific models?

  2. Cluster Configuration: What are the best practices for setting up the cluster configuration regarding node types and quantity? How do you decide on the balance between driver and worker nodes?

  3. Performance Optimization: Are there specific settings or tips for optimizing Spark configurations or Databricks-specific features that you have found effective for handling large-scale data?

  4. Cost Efficiency: How do you manage the trade-off between performance and cost? Are there specific configurations that provide a good balance?

Any examples, experiences, or resources you could share would be incredibly helpful. I am particularly interested in case studies or benchmarks that might guide the configuration process.

0 REPLIES 0

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group