user962305 wrote:Take a look in the oracle.kv.hadoop.KVInputFormat javadoc. It discusses how to use Oracle NoSQL Database with Hadoop as well as referring to an example which is included in the distribution.
I would like to know if there are some examples on how to run run map/reduce with Oracle NoSQL.
Is there any source code any where? Can you send me one example?
Where can we download all the necessary tools?It would be used with both, whether or not you were on the BDA. You use the KVInputFormat to read data from Oracle NoSQL Database into Hadoop during map/reduce processing.
In Oracle Big Data Appliance is map/reduce used with Oracle NoSQL or with Hadoop?
user962305 wrote:Download [Oracle NoSQL Database from OTN|http://www.oracle.com/technetwork/database/nosqldb/downloads/index.html] .
Can you please, explain where and what to download and install for this case?
Should we also install hadoop on the same replication nodes as Oracle NoSQL?It depends on your access patterns. In general, probably not, but there may be cases where you achieve better performance with Hadoop and the Rep Nodes co-located.
Is it possible to have an example with pre-loaded keys on Oracle NoSQL to perform the test?Look at the quickstart guide that comes with the above Oracle NoSQL Database package. There is a small HelloWorld example which you can use as the basis for creating a data set.
Is there a version of Oracle NoSQL which comes with some key/value pairs?
I understand the following. Data in Oracle NoSQL will be loaded in hadoop first and then map/reduce is performed in haddop. Is it right?Hadoop is a framework, which among other things happens to run Map/Reduce jobs. Your Map/Reduce job would use the KVInputFormat to read data from Oracle NoSQL Database and process it however it sees fit. It might write the output of the M/R to (say) HDFS. Or it might write it to (say) Oracle RDBMS. Or it might write it back to (say) Oracle NoSQL Database.
I would like to know: Does that mean Oracle NoSQL can not run parallel operations? What is the aim in loading data to hadoop first if Oracle is able do perform parallel operations? Loading data from Oracle NoSQL to hadoop may take enormous time I suppose.I am not sure I understand your question. Hadoop, by its nature will break a job into many subtasks. Those subtasks run in parallel, generally across many Hadoop nodes. Those subtasks may access Oracle NoSQL Database data. Hence, Oracle NoSQL Database is able to perform operations in parallel either on the same or different Rep Nodes.