Random vs Stratified Splits

Photo by Max van den Oetelaar on Unsplash

In machine learning and deep learning, we often split our full dataset into train, validation and test sets.

The training set is used for training (fitting) the model. The validation set is used for hyperparameter tuning. Finally, the test set is used for the final evaluation of the model. If we don’t want…

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Rukshan Pramoditha

Rukshan Pramoditha

1M+ Total Views | 65K+ Monthly Views | Top 50 Data Science/AI/ML Writer on Medium | Sign up: https://rukshanpramoditha.medium.com/membership