[jira] [Commented] (HAMA-990) GSoC'16: Apache Hama benchmark against Spark and Flink

Behroz Sikander (JIRA) Thu, 19 May 2016 18:33:07 -0700

    [ 
https://issues.apache.org/jira/browse/HAMA-990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292519#comment-15292519
 ]


Behroz Sikander commented on HAMA-990:
--------------------------------------

>> With this, we may able to derive insight from the results (this should be 
>> our goal)
Ok. What kind of insights are you interested in ?

>>I think I heard that flink uses own serialization techniques and shows good 
>>performance but unstable.
Ok

>> Just FYI, MRQL also can be used for K-Means and PageRank.
Yea we can use that. Good idea.

>> If you can write some scripts that make it possible to auto-produce 
>> benchmark results on clouds such as Amazon or Google cloud, I can help.

I have never written a script like this before but I am interested in learning 
it. The script would install HDFS/Hama/Spark/Flink and then execute the 
commands for the job (K-Mean/PageRank etc) and in the end copy results back to 
local directory ? Further, should it be a simple bash script or CHEF/Puppet 
script ?

> GSoC'16: Apache Hama benchmark against Spark and Flink
> ------------------------------------------------------
>
>                 Key: HAMA-990
>                 URL: https://issues.apache.org/jira/browse/HAMA-990
>             Project: Hama
>          Issue Type: Documentation
>            Reporter: Behroz Sikander
>            Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HAMA-990) GSoC'16: Apache Hama benchmark against Spark and Flink

Reply via email to