Database Administration (MOSC)

MOSC Banner

DMU codepoint conversion in UTF8 is Different

edited Apr 30, 2013 8:52AM in Database Administration (MOSC) 6 commentsAnswered
Hi Gurus,
We are in the process of converting our production 11.2.0.3 from WE8ISO8859P15 characterset to AL32UTF8. During trial runs and testing we have found certain WIN1252 characters are not being converted to the correct utf8 character encoding. We have used DMU with assumed characterset as WE8MSWIN1252 which should have been aware of these characters during characterset conversion. We are seeing particularly characters in the range 128-159 to be lost during conversion.

For example

WIN1252 (aka CP1252) 0x9E is ‘LATIN SMALL LETTER Z WITH CARON’, which in UTF8 (when looked up by description) is 0xC5BE. But after conversion, 0x9E is actually becoming 0xC29E instead of 0xC5BE.

Howdy, Stranger!

Log In

To view full details, sign in to My Oracle Support Community.

Register

Don't have a My Oracle Support Community account? Click here to get started.

Category Leaderboard

Top contributors this month

New to My Oracle Support Community? Visit our Welcome Center

MOSC Help Center