Skip to Main Content

APEX

Announcement

For appeals, questions and feedback about Oracle Forums, please email oracle-forums-moderators_us@oracle.com. Technical questions should be asked in the appropriate category. Thank you!

Interested in getting your voice heard by members of the Developer Marketing team at Oracle? Check out this post for AppDev or this post for AI focus group information.

Extended characters rendered from the db

Scott WesleyJan 23 2019 — edited Jan 30 2019

We have a record being received from social media where people have made their name a little fancy, using some extended character set, no doubt using some font generator, eg:

B R O O K E

Some people are just using their name, such as

Edit - These names where filtered out of the post, so here is a screenshot

pastedImage_0.png

I found this because WLS was delivering an APEX page missing some characters, causing some syntax error.

Uncaught SyntaxError: Unexpected string

pastedImage_0.png

I hypothesise it’s because of these special characters, there more bytes than WLS expects, just like the problem we've had after updating files on the server, and the browser is delivered incomplete files, causing a JS error. I would suggest this could not be recreated on Tomcat, where we've had no such problems with updating files. That being said, it was only a problem on one particular page, unless there were just other non-load bearing characters lost from the other pages. Missing HTML is tolerated, unlike missing JS punctuation...

Update - I can replicate this part of the problem on Tomcat, and DB/APEX 18c, and the rendering problem moves as data on the page changes.

But I think the bigger problem is how to treat these records during searches.

pastedImage_1.png

But I can't search on 'emily'

pastedImage_2.png

like I do 'john'

pastedImage_7.png

ASCII() on the first character of the name returns 4036857988, not 69 as ASCII('E') would.

Yet a text search in the browser finds the text.

pastedImage_9.png

I know little about extended character sets, and I'm after some ideas. Should/can we be treating data with special fonts within the SQL?

Our character set is AL32UTF8, and DB 12.1.0.2.0 (soon to be an 18c flavour).

This was from APEX 5.1.3

Scott

Comments

Post Details

Added on Jan 23 2019
3 comments
585 views