12 Replies Latest reply: Jun 4, 2013 5:22 PM by Yannis_r2 RSS

    Searching imported PDF with Greek char through APEX does not return results

    Yannis_r2
      Hi,

      I am using the old good DocLib Apex application to have a nice document management system (I wonder if there will be a newer version anytime soon)
      Lately I uploaded Scanned PDF containing Greek characters.
      I realized that the search within ODF is not working at all!!!!!

      I have Apex 4.2 on Oracle XE 11g beta
      I use Oracle text to create an index to do full text search within files.

      Here is how the index is created:

      begin
      ctx_ddl.drop_preference ('docman_multi');
      ctx_ddl.create_preference('docman_multi', 'MULTI_COLUMN_DATASTORE');
      ctx_ddl.set_attribute('docman_multi', 'columns', 'CONTENTS, DESCRIPTION, CONTENTS, ATTR1 , ATTR2, ATTR3, RECEPIENTS, TAGS, COMMENTS, PROTOCOL_NO');
      end;

      create index DOCLIB_DOCS_IDX2 on DOCLIB_DOCUMENTS (CONTENTS)
      indextype is ctxsys.context
      parameters ('datastore docman_multi section group CTXSYS.AUTO_SECTION_GROUP stoplist ctxsys.empty_stoplist sync (on commit)')



      Then I use a search like:

      select * from doclib_documents
      Where contains( contents, convert(:P11_REPORT_SEARCH,'utf8'), 1)

      All PDFs are searchable.

      When I search words in English it works like charm
      But when I use Greek words, it returns no rows
      Any ideas or suggestions>

      Thank you