APPLICATION OF SUPERVISED LEARNING
MACHINE LEARNING PIPELINE
Question
[CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
|
A tech startup is building an image classification model. During the process, they copied over some of their validation data into their training examples, creating duplicate values in the training and validation subsets.Which is a possible result of taking this approach? (Select TWO.)
|
The model may perform worse with the test dataset than with the validation dataset
|
|
This is a common practice in machine learning and will improve the overall performance of the model
|
|
This could lead to overfitting the model
|
|
This is a good way to increase the training dataset size and therefore strengthen the model’s ability to generalize
|
Explanation:
Detailed explanation-1: -Sentiment analysis (or opinion mining) is a natural language processing (NLP) technique used to determine whether data is positive, negative or neutral.
Detailed explanation-2: -Data Ingestion Each ML pipeline starts with the Data ingestion step. In this step, the data is processed into a well-organized format, which could be suitable to apply for further steps.
Detailed explanation-3: -A data science pipeline is the set of processes that convert raw data into actionable answers to business questions. Data science pipelines automate the flow of data from source to destination, ultimately providing you insights for making business decisions.
There is 1 question to complete.