MACHINE LEARNING

APPLICATION OF SUPERVISED LEARNING

MACHINE LEARNINGHARD QUESTIONS

Question [CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
Suppose you have picked the parameter for a model using 10-fold cross validation (CV). Which of the following is the best way to pick a final model to use and estimate its error?
A
Pick any of the 10 models you built for your model; use its error estimate on the held-out data
B
Train a new model on the full data set, using the parameter you found; use the average CV error as its error estimate
C
Average all of the 10 models you got; use the average CV error as its error estimate
D
Average all of the 10 models you got; use the error the combined model gives on the full training set The best way to pick a final model is to train a new machine learning model on the full data set using the parameter learnt and to use the average cross-validation error as its error estimate.
Explanation: 

Detailed explanation-1: -Similarly, 10-fold=100/10=10. So, 90% of data will be used for training and 10% will be used for validation. You should divide your set to 5 parts, and in each part 80 % for training and 20% for testing. But in each subset different 20 % is for testing.

Detailed explanation-2: -model selection module’s cross val score helper class can be used for applying K-fold cross-validation in a simple manner. Use the LOOCV method for very small data sets. For very large data sets, one can use the value of K as 5. The value of K = 10 is standard value of K.

Detailed explanation-3: -Pick k – number of times the model will be trained. Pick a number of samples which will be the test set. Split the dataset. Train on the training set. Validate on the test set. Save the result of the validation. Repeat steps 3-6 k times. More items

There is 1 question to complete.