MCQ IN COMPUTER SCIENCE & ENGINEERING

COMPUTER SCIENCE AND ENGINEERING

MACHINE LEARNING

Question [CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
What is limitation of over sampling?
A
It could make model underfit
B
It could make model overfit
C
Either A or B
D
None of the above
Explanation: 

Detailed explanation-1: -“the random oversampling may increase the likelihood of overfitting occurring, since it makes exact copies of the minority class examples.

Detailed explanation-2: -Overfitting occurs when the model cannot generalize and fits too closely to the training dataset instead. Overfitting happens due to several reasons, such as: The training data size is too small and does not contain enough data samples to accurately represent all possible input data values.

Detailed explanation-3: -The main disadvantage with oversampling, from our perspective, is that by making exact copies of existing examples, it makes overfitting likely. In fact, with oversampling it is quite common for a learner to generate a classification rule to cover a single, replicated, example.

Detailed explanation-4: -Typically, if there are not enough samples in the training data set, especially if the number of samples is less than the number of model parameters (count by element), overfitting is more likely to occur. Additionally, as we increase the amount of training data, the generalization error typically decreases.

There is 1 question to complete.