Hi Vikalp,

finally I got to run Giraph successfully :-)

Following steps:
#Preparation

 *

   su

 *

   rm -rf /usr/local/giraph; mkdir /usr/local/giraph

 *

   chown hadoop:hadoop /usr/local/giraph

 *

   mkdir ~/setup; cd ~/setup

#Sourcecode (Version 1.1.0)

 *

   git clone https://git-wip-us.apache.org/repos/asf/giraph.git

 *

   mv giraph giraph_src; cd giraph_src

 *

   mvn clean

 *

   mvn -Phadoop_yarn -Dhadoop.version=2.4.0 -DskipTests package

#Installation (little ugly)

 *

   rm -rf myGiraph; mkdir -p myGiraph/lib; mkdir myGiraph/source

 *

   find . -name \*.jar -exec cp {} myGiraph/lib \;

 *

   cp -r conf myGiraph

 *

   cp -r bin myGiraph

 *

   cp CH* CO* RE* LI* NO* myGiraph

 *

   cd myGiraph

 *

   mv lib/*source*.jar source

 *

   mv lib/giraph*.jar .

 *

   rename 's/-SNAPSHOT-for//' *.jar

 *

   rm -rf ../myGiraph; cd ..; mv myGiraph ..

 *

   cd ../myGiraph; rm -rf /usr/local/giraph/*; cp -r * /usr/local/giraph

Now you have to put a variable into hadoop-env.sh (at the end is ok):

     o

       /MY_LIBS="$HADOOP_HOME/share/myLib/*.jar"/

     o

       /export HADOOP_TASKTRACKER_OPTS="-classpath ${MY_LIBS}
       ${HADOOP_TASKTRACKER_OPTS}"/

The folder myLib you have to create and put the Giraph-Jar:

 * mkdir /usr/local/hadoop/share/myLib

 *

   cp
   
/usr/local/giraph/giraph-examples-1.1.0-hadoop-2.4.0-jar-with-dependencies.jar
   /usr/local/hadoop/share/myLib

#Configuration
Put
<property>
<name>mapred.map.tasks</name>
<value>4</value>
</property>

into mapred-site.xml

 *

   echo export PATH=\$PATH:/usr/local/giraph/bin >> ~/.bashrc

 *

   echo export GIRAPH_HOME=/usr/local/giraph >> ~/.bashrc

 *

   source ~/.bashrc

Now you can test:
#creating testdata

 *

   rm -rf ~/giraph; mkdir ~/giraph; cd ~/giraph

 * cat << EOF > tiny_graph.txt
 * [0,0,[[1,1],[3,3]]]
   [1,0,[[0,1],[2,2],[3,1]]]
   [2,0,[[1,2],[4,4]]]
   [3,0,[[0,3],[1,1],[4,4]]]
   [4,0,[[3,4],[2,4]]]
 * EOF

#Run example-Jar

 *

   jar=giraph-examples-1.1.0-hadoop-2.4.0-jar-with-dependencies.jar

 *

   runner=org.apache.giraph.GiraphRunner

 *

   computation=org.apache.giraph.examples.SimpleShortestPathsComputation

 *

   
informat=org.apache.giraph.io.formats.JsonLongDoubleFloatDoubleVertexInputFormat

 *

   outformat=org.apache.giraph.io.formats.IdWithValueTextOutputFormat

 *

   hdfs dfs -rm -r shortestpaths

 *

   yarn jar $GIRAPH_HOME/$jar $runner -Dgiraph.yarn.task.heap.mb=3000
   $computation -vif $informat -vip tiny_graph.txt -vof $outformat -op
   shortestpaths -w 1 -yj $jar

 *

   hdfs dfs -cat shortestpaths/*


Finished!! Hope it functions on your site.

Best regards,
Alexander




On 12.08.2014 15:52, Vikalp Handa wrote:
Hi Alexander,

I looked into my gam-stderr.log file and found *Error: Could not find or load main class org.apache.giraph.yarn.GiraphApplicationMaster* As you have already mentioned in your reply about that for loop in *hadoop-env.sh. * So can you please tell me what is *mylib *and***share/myLib/giraph** ? *Because I am having only doc and hadoop directories inside $HADOOP_HOME/share/

Also how to get *giraph-1.1.0-hadoop-2.4.0.jar *and*giraph-examples-1.1.0-hadoop-*2.4.0.jar (without the dependencies) as I am having only giraph-examples-1.1.0-SNAPSHOT-for-hadoop-2.4.0-jar-with-dependencies.jar inside giraph-examples/target/munged/.





Reply via email to