DATABASE FUNDAMENTALS
BASICS OF BIG DATA
Question
[CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
|
|
1 and 3
|
|
1 and 4
|
|
2 and 4
|
|
2 and 3
|
Detailed explanation-1: -Machine Learning algorithm to be used purely depends on the type of data in a given dataset.
Detailed explanation-2: -The smaller the training data set, the lower the test accuracy, while the training accuracy remains at about the same level.
Detailed explanation-3: -1 Answer. When the training set is small, a model that has a right bias and low variance seems to work better because they are less likely to overfit. For example, Naives bayes works best when the training set is large.
Detailed explanation-4: -Now, to use which algorithm depends on the objective of the business problem. If inference is the goal, then restrictive models are better as they are much more interpretable. Flexible models are better if higher accuracy is the goal. In general, as the flexibility of a method increases, its interpretability decreases.