Oracle Text (MOSC)

MOSC Banner

Customizing Lexer for XML constructs

edited Sep 7, 2009 10:58AM in Oracle Text (MOSC) 3 commentsAnswered
I was wondering if it is feasible, or if anyone has any experience with, customizing the Oracle Text lexer to recognize certain XML constructs and affect the identification of tokens.  Specifically, I have some XML entities that I would like to be significant within tokens (e.g., "doesn't", where "'" is the XML entity).  I would also like to have Oracle Text ignore XML tag names, attribute names and attribute values when identifying tokens (basically, ignore everything between "<" and ">").  I using Oracle Text against VARCHAR2 data -- No use of the available XML datatypes.

Any help and insights is greatly appreciated!

Howdy, Stranger!

Log In

To view full details, sign in to My Oracle Support Community.

Register

Don't have a My Oracle Support Community account? Click here to get started.

Category Leaderboard

Top contributors this month

New to My Oracle Support Community? Visit our Welcome Center

MOSC Help Center