This content has been marked as final. Show 21 replies
I'm confused too.
Data Miner 3.2.
SQL Developer 3.2.09
Java(TM) Platform 1.6.0_35
Oracle IDE 3.2.09.30
Versioning Support 3.2.09.30
Oracle Database 11g Enterprise Edition Release 18.104.22.168.0 64 bit version.
All this installed on Win 7 Enterprise.
Should i try reinstall the db and miner?
We would like to try text mining and text clustering in our company. We would like to make cluster related to themes rather then to tokens.
The theme generation only works in English and French locales. I just want to confirm that, but it seems your locale is English, correct?
I guess you can reinstall the system and see if thing will improve.
If you decide to reinstall, try install the latest db and Data Miner.
I changed locale to US/English and, it works now. Thanks for Your help.
What i'm not happy about is, if i can successfullly run the text mining on others languages then english and french locales? I know, that we need knowledge base for our language. But now i see, there are some other limitations. Is there a chance that the theme generation will be working on other locales then english/french. Can locale oracle branch (Czech Republic/Prague) help me with this?
Once more, Thank You.
Currently, Data Miner only support default knowledge base (English and French).
However, you can add custom knowledge base to the Oracle Text (for your own processing). See doc below:
We may consider adding custom knowledge base support in the Data Miner in future release.
So if i understand, when we prepare another knowledge base, it won't work with data miner , only with oracle text? Right?
Correct, Data Miner only supports English and French knowledge base.
However, there is a chance it may work if export the workflow and make a minor change in the XML file and import it back to the Data Miner (this is not supported though, just a workaround maybe, but not guarantee it will work):
<Token Policy="" StoplistId="1" Frequency="IDF" MaxNumberAllDocs="3000" MaxNumberPerDoc="50">
<Lexer Type="Basic" Name="">
<Attribute Type="String" ValueString="CZECH" Name="THEME_LANGUAGE"/> // ADD THIS LINE TO TELL ORACLE TEXT TO USE CUSTOM KNOWLEDGE BASE FOR CZECH
<Language Type="SingleByte" Name="CZECH"/> // CHANGE THE LANGUAGE NAME TO CZECH