I have documents in format:
And I want have <url> indexed with tokens "test1", "test2" ...
but I want to have content of <email> as one token "firstname.lastname@example.org".
How create oracle text index for this search:
I want content of <email> be indexed with user_lexer with printjoins containing "@ ."
but in other part of document I want to be "." token separator.
You can't have different PRINTJOINS for different sections, unfortunately (it's on our to-do list!)
Your best bet might be to pre-process the text in some fashion such that the punctuations in the URL are replaced by spaces.