This content has been marked as final. Show 2 replies
kaja.mohideen wrote:Strings in Java are always UNICODE values encoded a UTF-16 code points. You don't have UTF-8 strings or ISO-8859-1 strings or ASCII strings - you only have UNICODE strings. If Strings are derived from utf-8 bytes then the conversion must be made explicit in the conversion. For example String x = new String(bytes,"utf-8") will convert the utf-8 bytes to a String containing the UNICODE characters encoded as UTF-16 code points. An important point is that once you have a String you lose all knowledge of how that String was created and what it was created from since nothing is stored within String to indicate this.
Most of the string carry UTF-8 values (Japanese).