This content has been marked as final. Show 1 reply
Being new to mining you have really set off on a ambitious mining project :)
Couple of technical pointers:
*1) Version of Data Miner being Used*
You are using the original Data Miner release.
I would download the latest SQL Dev release that contains the current Data Miner client and repository installation.
SQL Developer 3.2.2 RTM Version 3.2.20.09 Build MAIN-09.87
Drop the old repository and start with this latest one, assuming you are just getting started and have no significant mining worklows created.
You can always export the workflows to disk if you want to import them to the repository.
Alternatively you can migrate the older repository, but I would avoid unless you really need to, as it requires Data Miner to hold on to some older repository definitions.
*2) Handling of text*
It seems your primary source of data for the clustering process will be the cs_uri_query.
You might find better results processing it as text data rather than as categorical data.
You can use the Build Text node to transform cs_uri_query into a nested column that contains text tokens.
*3) Methodology definition*
This is probably your biggest challenge really.
What is the overall methodology to produce the desired result.
You stated your objective is: develop an intelligent recommend model based on queries recorded in the web log
Once you create clusters from this data, what are your next steps?
What type of recommendation do you want to generate?