MCQ IN COMPUTER SCIENCE & ENGINEERING

COMPUTER SCIENCE AND ENGINEERING

MACHINE LEARNING

Question [CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
What are the common approaches for encoding categorical data
A
One hot encoding
B
Standardization
C
Ordinal encoding
D
None of the above
Explanation: 

Detailed explanation-1: -Label Encoding or Ordinal Encoding This type of encoding is used when the variables in the data are ordinal, ordinal encoding converts each label into integer values and the encoded data represents the sequence of labels.

Detailed explanation-2: -Hash Encoding Hash Encoding represents the categorical data into numerical value by the hashing function. Hashing is often used in data encryption or data comparison, but the main part is still similar-transform one feature to another using hashing function.

Detailed explanation-3: -Thermometer Encoder is used to represent categorical variables as numerical values, specifically for ordinal variables where the categories have an inherent order. The encoding works by creating a binary representation of each category and concatenating the binary values to form a new numerical variable.

Detailed explanation-4: -Ordinal encoding should be used for ordinal variables (where order matters, like cold, warm, hot ); vs Label encoding should be used for non-ordinal (aka nominal) variables (where order doesn’t matter, like blonde, brunette )

There is 1 question to complete.