Go Directly To
Oracle Technology Network Community
My Oracle Support Community
OPN Cloud Connection
Oracle Employee Community
Oracle User Group Community
OTN Speaker Bureau
Get Started Guide
Join the world’s largest interactive community dedicated to Oracle technologies.
Learn from thousands of community experts
Get answers to your technical questions
Share your knowledge with peers
Please enter a title.
You can not post a blank message. Please type your message and try again.
Oracle Database + Options
This discussion is archived
on Jan 30, 2013 5:32 PM by Roger Ford-Oracle
Is possible define different lexer/tokenizer per section of document?
Jan 30, 2013 4:35 PM
I have documents in format:
And I want have <url> indexed with tokens "test1", "test2" ...
but I want to have content of <email> as one token "email@example.com".
How create oracle text index for this search:
I want content of <email> be indexed with user_lexer with printjoins containing "@ ."
but in other part of document I want to be "." token separator.
I have the same question
Show 0 Likes
This content has been marked as final.
Show 1 reply
Re: Is possible define different lexer/tokenizer per section of document?
Jan 30, 2013 5:32 PM
in response to
You can't have different PRINTJOINS for different sections, unfortunately (it's on our to-do list!)
Your best bet might be to pre-process the text in some fashion such that the punctuations in the URL are replaced by spaces.