A Data Scientist at a retail company is using Amazon SageMaker to classify social media posts that mention the company into one of two categories:Posts that require a response from the company, and posts that do not. The Data Scientist is using a training dataset of 10, 000 posts, each of which contain the timestamp, author, and full text of each post. However, the Data Scientist is missing the target labels that are required for training.Which approach can the Data Scientist take to create valid target label data? (Select TWO.)

MACHINE LEARNING PIPELINE

Please wait while the activity loads.
If this activity does not load, try refreshing your browser. Also, this page requires javascript. Please visit using a browser with javascript enabled.

If loading fails, click here to try again

Question [CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]

A	Ask the social media handling team to review each post using Amazon SageMaker GroundTruth and provide the label
B	Use the sentiment analysis natural language processing library to determine whether a post requires a response
C	Use Amazon Mechanical Turk to publish Human Intelligence Tasks that ask Turk workers to label the posts
D	Use the a priori probability distribution of the two classes. Then, use Monte-Carlo simulation to generate the labels
E	Use K-Means to cluster posts into various groups, and pick the most frequent word in each group as its label

Explanation:

Detailed explanation-1: -A machine learning pipeline is the end-to-end construct that orchestrates the flow of data into, and output from, a machine learning model (or set of multiple models). It includes raw data input, features, outputs, the machine learning model and model parameters, and prediction outputs.

There is 1 question to complete.

APPLICATION OF SUPERVISED LEARNING

MACHINE LEARNING PIPELINE