Discussions
Categories
- 385.5K All Categories
- 4.9K Data
- 2.5K Big Data Appliance
- 2.4K Data Science
- 453.4K Databases
- 223.2K General Database Discussions
- 3.8K Java and JavaScript in the Database
- 47 Multilingual Engine
- 606 MySQL Community Space
- 486 NoSQL Database
- 7.9K Oracle Database Express Edition (XE)
- 3.2K ORDS, SODA & JSON in the Database
- 585 SQLcl
- 4K SQL Developer Data Modeler
- 188K SQL & PL/SQL
- 21.5K SQL Developer
- 46 Data Integration
- 46 GoldenGate
- 298.4K Development
- 4 Application Development
- 20 Developer Projects
- 166 Programming Languages
- 295K Development Tools
- 150 DevOps
- 3.1K QA/Testing
- 646.7K Java
- 37 Java Learning Subscription
- 37.1K Database Connectivity
- 201 Java Community Process
- 108 Java 25
- 22.2K Java APIs
- 138.3K Java Development Tools
- 165.4K Java EE (Java Enterprise Edition)
- 22 Java Essentials
- 176 Java 8 Questions
- 86K Java Programming
- 82 Java Puzzle Ball
- 65.1K New To Java
- 1.7K Training / Learning / Certification
- 13.8K Java HotSpot Virtual Machine
- 94.3K Java SE
- 13.8K Java Security
- 208 Java User Groups
- 25 JavaScript - Nashorn
- Programs
- 667 LiveLabs
- 41 Workshops
- 10.3K Software
- 6.7K Berkeley DB Family
- 3.6K JHeadstart
- 6K Other Languages
- 2.3K Chinese
- 207 Deutsche Oracle Community
- 1.1K Español
- 1.9K Japanese
- 474 Portuguese
Chinese Character Detection

Hi,
I have a user registration textbox in which the user will enter locale specific characters like Chinese, Japanese, Korean, etc. How I can detect the enter character in the textbox is Chinese , Japanese or Korean? Whether we have any API in java to handle it? I am using JDK1.6. Please let me know the best way in detecting these characters.
Thanks
Answers
-
You can't really do that unless it is a very specific character that only exists in one language but not the other. The three languages share the CJK code pages since there are many common characters. Java itself does not care so it needs to be something custom.
For example, the following code snippet checks whether the character is CJK or not:
public static boolean containsHanScript(String s) {<br/> for (int i = 0; i < s.length(); ) {<br/> int codepoint = s.codePointAt(i);<br/> i += Character.charCount(codepoint);<br/> if (Character.UnicodeScript.of(codepoint) == Character.UnicodeScript.HAN) {<br/> return true;<br/> }<br/> }<br/> return false;<br/>}
For reference see the following for a list of unicode character sets: https://docs.oracle.com/javase/7/docs/api/java/lang/Character.UnicodeScript.html
-
I have a user registration textbox in which the user will enter locale specific characters like Chinese, Japanese, Korean, etc. How I can detect the enter character in the textbox is Chinese , Japanese or Korean?
They can ONLY enter characters supported by the character set you are using for the textbox. You should already know what character set that is.