Ken - 

Riak contains its own implementation of a MapReduce API. This API is written in 
Erlang and C and shares no code with Hadoop. We have, on occasion, described 
our API as "Hadoop like" to help people understand the differences between our 
implementation and others. For example:

* CouchDB's MapReduce incrementally builds a BTree thru the execution of views. 
Riak doesn't cache or store MapReduce results. Each job executes "from scratch" 
and returns results to caller. It is up to the caller to cache or store these 
results if needed.

* Hadoop MapReduce jobs use HDFS or HDFS-adapted resources. Riak MapReduce jobs 
use data stored in a Riak cluster.

* Riak MapReduce jobs take advantage of their execution environment. Map 
functions exploit data locality. This means map functions are sent to the node 
hosting the required data rather than streaming the data to a central 
coordinating node. The most recent version of Riak also includes improvements 
which has significantly boosted the efficiency of mapping over entire buckets 
of data.

* Riak supports writing MapReduce functions in two languages: Erlang and 
Javascript. Erlang is the absolute fastest, in terms of raw speed, but 
Javascript runs a close second and is easier for most people to use.

Depending on what your jobs are doing and which language, Erlang or Javascript, 
there are a few tunable parameters we can tweak to improve performance. More 
information about what your job is doing, how your cluster is set up, and the 
kinds of performance you're seeing would help us debug the situation.

--Kevin
On Sep 7, 2010, at 10:52 PM, Ken Matsumoto wrote:

> Hi all,
> 
> Is there difference between "MapReduce in Hadoop" and "MapReduce in RIAK"?
> I tested these performances and there were difference even in the same 
> machine, same data.
> I'd like to know the difference like:
> 
> - Version? Which is newer or branched?
> - Does RIAK build in on source level or binary?
> - Number of performance tuning parameters?
> - What else?
> 
> Regards,
> 
> Ken.
> 
> -- 
> Ken Matsumoto
> VP / Research & Development
> Nomura Research Institute America, Inc.
> NRI Pacific
> 1400 Fashion Island Blvd., Suite 1010
> San Mateo, CA 94404, U.S.A.
> 
> PLEASE READ:This e-mail is confidential and intended for the named recipient 
> only. If you are not an intended recipient, please notify the sender and 
> delete this e-mail.
> 
> 
> _______________________________________________
> riak-users mailing list
> [email protected]
> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com


_______________________________________________
riak-users mailing list
[email protected]
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to