DATA REPRESENTATION AND NUMBER SYSTEMS
ASCII AND UNICODE CHARACTER ENCODING
Question
[CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
|
|
2 bytes per character.
|
|
3 bytes per character.
|
|
4 bytes per character.
|
|
None of the above
|
Detailed explanation-1: -One byte character sets can contain 256 characters. The current standard, though, is Unicode which uses two bytes to represent all characters in all writing systems in the world in a single set. The original ASCII was a 7 bit character set (128 possible characters) with no accented letters.
Detailed explanation-2: -The standard ASCII character set is only 7 bits, and characters are represented as 8-bit bytes with the most significant bit set to 0. Modern computers almost universally use 8-bit bytes, and the extended ASCII character set includes 127 more 8-bit characters, where the most significant bit is set to 1.
Detailed explanation-3: -Here are the original ASCII characters from 0-127. These are the same in UTF-8. There are 2048 possible 2-byte characters, but not all of them are valid and not all of the valid characters are used.