I use HTMLEditorKit to parse strings with HTML tags. The library has successfully detected and remove embedded tags. However, first thing I discovered was that it parsed <br> into spaces and spaces are compiled into spaces whose ASCII codes are 60s, not the normal one (#32).
So I just want to ask what can HTMLEditorKit do, what are potential bugs and is there any better alternative for a HTML parse in Java?
Thanks very much!