Return every instance of a word in a document
ant7May 25 2012 — edited May 30 2012I have a 1400 page PDF document that contains an ID in the format of WARxxxxxx, a total of nine characters, every two pages or so. I am looking for a way to query out all of the instances of these IDs. For example:
WAR123123
WAR010101
WAR987654
The table the document is stored in is using standard text index. I tried to use the ctx_doc.highlight and store the results in another table, but I just got PDF gibberish instead of readable English. Is this type of query and result possible through the ctx functions?
Thanks, Tony