cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Auto ML training - Early Stopping (training time) / Data Split

spearitchmeta
New Contributor III

Greetings dear community,

I am using AutoML for the first time ands was wondering whether it is possible to have early stopping or incorporate any approach in my code to make the training of a model stop when the performance plateaus. Early stopping is something one can implement in the traditional way of training models (without auto ML). Additionally tracking loss function, performance evolution, etc...
I would be interested to have you thoughts on this since I  am doing a client demo in the coming days.

# Run AutoML with manual's data split (0.8/0.2)
automl_result_manual_split = automl.classify(
dataset=train_df,
target_col="cae_type",
primary_metric="f1",
timeout_minutes=30,
experiment_dir=f"{group_workspace_base}/manual_split",
experiment_name=experiment_auto_ml_manual_split
)
 
2) My second question is regarding data split. As you can see here, I did a manual split (0.8 training data/ 0.2 testing data) but I am aware that data splitting can be done automatically by AutoML. Are there any resources that recommend the one or the other? (I also have class imbalance but I did not consider this in this first demo trial)

Best regards

1 ACCEPTED SOLUTION

Accepted Solutions

BigRoux
Databricks Employee
Databricks Employee

First question: See here for what is possible. https://docs.databricks.com/aws/en/machine-learning/automl/classification

Second question: See here for what is possible. https://docs.databricks.com/gcp/en/machine-learning/automl/classification-data-prep

 

Hope this helps, Louis.

View solution in original post

1 REPLY 1

BigRoux
Databricks Employee
Databricks Employee

First question: See here for what is possible. https://docs.databricks.com/aws/en/machine-learning/automl/classification

Second question: See here for what is possible. https://docs.databricks.com/gcp/en/machine-learning/automl/classification-data-prep

 

Hope this helps, Louis.

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now