This discussion is archived
8 Replies Latest reply: Dec 14, 2011 1:08 AM by Roger Ford RSS

Oracle Content server source issues

user10285200 Newbie
Currently Being Moderated
Hi,

I have created a Oracle content server source in Oracle SES 11.1.2.2.0. when i run the scheduler for this source, it says document are fetched and indexed.
But when i try to search the the documets the results are not shown. Below is the log generated by scheduler.

===============================================================================================
19:06:24:835 INFO     main          =================== Crawling results ===================
19:06:24:835 INFO     main          Crawling started at 12/7/11 6:53 PM
19:06:24:835 INFO     main          Crawling stopped at 12/7/11 7:06 PM
19:06:24:835 INFO     main          Total crawling time = 0:13:12
19:06:24:835 INFO     main          
19:06:24:835 INFO     main          Source type is "Oracle Content Server"
19:06:24:835 INFO     main          Source is "XLUCM"
19:06:24:835 INFO     main          
19:06:24:835 INFO     main          Total number of documents fetched = 363
19:06:24:835 INFO     main          Document fetch failures = 0
19:06:24:835 INFO     main          Document conversion failures = 0
19:06:24:835 INFO     main          Total number of documents queued for indexing = 363
19:06:24:835 INFO     main          Total number of documents indexed = 363
19:06:24:835 INFO     main          Total data queued for indexing = 18,971,107 bytes
19:06:24:835 INFO     main          Total data indexed = 18,971,107 bytes
19:06:24:835 INFO     main          Total number of non-indexable documents = 0
19:06:24:835 INFO     main          Average size of indexed document = 52,262 bytes
19:06:24:835 INFO     main          
19:06:24:835 INFO     main          Total indexing time = 0:0:41 for 18,971,107 bytes of data
19:06:24:835 INFO     main          Number of documents queued for indexing per hour = 1,650
19:06:24:835 INFO     main          
19:06:24:835 INFO     main          Number of times disk cache is full = 0
===========================================================================================

Please help me in this and let me know if you need any other information.

thanks in advance.

Yogesh.
  • 1. Re: Oracle Content server source issues
    Roger Ford Expert
    Currently Being Moderated
    Probably a permissions issue.

    1/ Are you using an identity manager? Which one?
    2/ What user are you logging onto SES with?
    3/ Does that user definitely have access to the documents within Content Server?
  • 2. Re: Oracle Content server source issues
    user10285200 Newbie
    Currently Being Moderated
    Hi,

    Thanks for replay.

    1) I am not using and identity manager.
    2) I am logging with a default SES user eqsys.
    3) Runnig the scheduler with this default user. How can i check if the user has access to documents to the content server.
  • 3. Re: Oracle Content server source issues
    Roger Ford Expert
    Currently Being Moderated
    Oracle Content Server (OCS) is a secure source. Only users who have access to the documents will be able to see them in SES.

    So you need to configure SES to use the same identity manager as OCS (which might be the internal OCS directory or might be Oracle Internet Directory (OID) or even Microsoft Active Directory (AD).

    Then you will need to log into the SES query application as an authorized user before you can find documents indexed from OCS.

    Don't confuse the user logging into the Admin utility (EQSYS) with the user logging in to the query application.
  • 4. Re: Oracle Content server source issues
    user10285200 Newbie
    Currently Being Moderated
    Hi Roger,

    Thanks for reply.

    This is sample log for one document which is fetched and indexed in SES is

    19:05:25:413 INFO     filter_0          submitting doc...idcplg?IdcService=GET_FILE&dDocName=IFLEXPKW261IFL005207&allowInterrupt=1&Rendition=web&RevisionSelectionMethod=latestReleased
    19:05:25:413 INFO     filter_0          Processing idcplg?IdcService=GET_FILE&dDocName=IFLEXPKW261IFL005207&allowInterrupt=1&Rendition=web&RevisionSelectionMethod=latestReleased
    19:05:27:051 INFO     filter_0          Releasing lock on idcplg?IdcService=GET_FILE&dDocName=IFLEXPKW261IFL005207&allowInterrupt=1&Rendition=web&RevisionSelectionMethod=latestReleased
    19:05:27:051 INFO     filter_0          Retrieved idcplg?IdcService=GET_FILE&dDocName=IFLEXPKW261IFL005208&allowInterrupt=1&Rendition=web&RevisionSelectionMethod=latestReleased for thread filter_0
    19:05:27:051 INFO     filter_0          Returned idcplg?IdcService=GET_FILE&dDocName=IFLEXPKW261IFL005208&allowInterrupt=1&Rendition=web&RevisionSelectionMethod=latestReleased for thread filter_0
    19:05:27:051 INFO     filter_0          RSSMultiSAXQueue: Returned idcplg?IdcService=GET_FILE&dDocName=IFLEXPKW261IFL005208&allowInterrupt=1&Rendition=web&RevisionSelectionMethod=latestReleased for thread filter_0
    19:05:27:051 INFO     filter_0          URIHandler initialized for the URI http://IFLEXPKW261.i-flex.com:16200/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTENT&file=df2011-12-07-18-52-14-1673024875_content19.pdf&source=default
    19:05:27:956 INFO     filter_0          HTTP status code: 200

    I am using default SES query application to search the documents. The url for this is http://hostname:port/search/query/search

    Could you please advise if the above document is indexed in SES and what query do i need to enter in query application to search above doc?

    Yogesh
  • 5. Re: Oracle Content server source issues
    user10285200 Newbie
    Currently Being Moderated
    Hi Roger,
    Wanted to add one more thing here...

    Our OCS is using its internal directory for security purposes. We have not setup SES to use OCS users.
    If this is the problem, can you please guide how we can set this up.


    Thanks,
    Yogesh.
  • 6. Re: Oracle Content server source issues
    Roger Ford Expert
    Currently Being Moderated
    You do need to configure SES with this directory.

    Go to Global Settings -> Identity Management Setup in the SES admin utility.

    Choose Oracle Content Server and click "Activate"

    Enter the endpoint, typically http://yourserver/idc/idcplg

    Enter the OCS admin username (could be "idc")

    Enter the password

    Enter NATIVE for the Authentication Type

    Ignore the other settings. Click "Finish".

    Now when you go to the SES query screen, you should see a Login link at the top right, and you can login as any OCS user.
  • 7. Re: Oracle Content server source issues
    user10285200 Newbie
    Currently Being Moderated
    Hi Roger,
    Have done the security setting and now we can see the login link on the search page. But even after logging in using OCS ID we are not able to see any results for our search.

    Tried taking snapshot in OCS under SESCrawlerexport and running crawler in SES again, but even then no results. And note that crawler given status as below, which seems to be successful in indexing all content from OCS. Not sure where we are going wrong!!! ...please help.


    18:26:26:199 INFO     main          =================== Crawling results ===================
    18:26:26:199 INFO     main          Crawling started at 12/9/11 6:12 PM
    18:26:26:199 INFO     main          Crawling stopped at 12/9/11 6:26 PM
    18:26:26:199 INFO     main          Total crawling time = 0:14:15
    18:26:26:199 INFO     main          
    18:26:26:199 INFO     main          Source type is "Oracle Content Server"
    18:26:26:199 INFO     main          Source is "XLUCM"
    18:26:26:199 INFO     main          
    18:26:26:199 INFO     main          Total number of documents fetched = 368
    18:26:26:199 INFO     main          Document fetch failures = 0
    18:26:26:199 INFO     main          Document conversion failures = 0
    18:26:26:199 INFO     main          Total number of documents queued for indexing = 368
    18:26:26:199 INFO     main          Total number of documents indexed = 368
    18:26:26:199 INFO     main          Total data queued for indexing = 20,009,506 bytes
    18:26:26:199 INFO     main          Total data indexed = 20,009,506 bytes
    18:26:26:199 INFO     main          Total number of non-indexable documents = 0
    18:26:26:199 INFO     main          Average size of indexed document = 54,373 bytes
    18:26:26:199 INFO     main          
    18:26:26:199 INFO     main          Total indexing time = 0:1:2 for 20,009,506 bytes of data
    18:26:26:199 INFO     main          Bytes indexed per hour = 1,161,842,283
    18:26:26:199 INFO     main          Number of documents queued for indexing per hour = 1,549
    18:26:26:199 INFO     main          
    18:26:26:199 INFO     main          Number of times disk cache is full = 0


    Thanks,
    Yogesh
  • 8. Re: Oracle Content server source issues
    Roger Ford Expert
    Currently Being Moderated
    If the OCS user you're logging on with should have access rights to the document in OCS, then he should be able to search for them in SES. I think at this point you probably need to raise an SR with Oracle Support so they can help you figure out why it isn't working.

Legend

  • Correct Answers - 10 points
  • Helpful Answers - 5 points