cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

AutoML split with dt column not working properly

Noura_azza
New Contributor II

I am using AutoML and want to split my data to train/validation and test  using a dt column (one date for train one different date for validation and a third date for test. The problem that the autoML fails, there are only training metrics (no valiation nor test ones) and when I check the data exploratory notebook it seems that all samples are considered as training eventhough the corresponding dt are different. When I look to model artifacts, I see that the column dt were taken into consideration  as feature by the model

2 REPLIES 2

Noura_azza
New Contributor II

this is what I see in my data exploration  notebook. All dates are considered part of the training split 

Noura_azza_1-1706102702277.png

 

 

maggiewang
Databricks Employee
Databricks Employee

Hello! Did you try specify a column name as manual split column? 

Then you can fully control which rows are in train / validate / test splits: https://docs.databricks.com/en/machine-learning/automl/automl-data-preparation.html#split-data-for-r...

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now