Unicode System | infoStud blogs

Unicode is a universal international standard character encoding that is equipped for addressing the majority of the world's written languages or dialects. 

Why java utilizes Unicode System? 

Prior to Unicode, there were numerous language principles: 

  • ASCII (American Standard Code for Information Interchange) for the United States. 
  • ISO 8859-1 for Western European Language. 
  • KOI-8 for Russian. 
  • GB18030 and BIG-5 for Chinese, etc. 

Issue 

This messed two up: 

  • A specific code value compares to various letters in the different language norms or languages. 
  • The encodings for dialects or languages with enormous character sets have variable lengths. Some normal characters are encoded as single bytes, others require at least two bytes. 

Solution or Arrangement 

To tackle these issues, another language standard was created for example Unicode System. 

In Unicode, a character holds 2 byte, so java likewise utilizes 2 byte for characters. 

most reduced value:\u0000

Post a Comment

0 Comments