COMPUTER SCIENCE AND ENGINEERING
MACHINE LEARNING
Question
[CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
|
|
Impute ‘A’ for 50% of the missing values and ‘B’ for the rest.
|
|
Impute ‘B’ for all missing values
|
|
Impute ‘B’ for 50% of the missing values and ‘C’ for the rest.
|
|
Impute randomly one of the three values (A, B or C)
|
Detailed explanation-1: -Step 2: Data Cleaning Next, this data flows to the cleaning step. To make sure the data paints a consistent picture that your pipeline can learn from, Cortex automatically detects and scrubs away outliers, missing values, duplicates, and other errors.
Detailed explanation-2: -Feature engineering can help data scientists by accelerating the time it takes to extract variables from data, allowing for the extraction of more variables. Automating feature engineering will help organizations and data scientists create models with better accuracy.
Detailed explanation-3: -What Amazon SageMaker option should the company use to train their ML models that reduces the management and automates the pipeline for future retraining? Create and train your XGBoost algorithm on your local laptop and then use an Amazon SageMaker endpoint to host the ML model.
Detailed explanation-4: -3. How do we perform Bayesian classification when some features are missing? Explanation: When some features are missing, while performing Bayesian classification we don’t use general methods of handling missing values but we integrate the posteriors probabilities over the missing features for better predictions.