Hi All,
I am running Decision forest in Mahout, below are the commands that I have used to implement the algo: Info file: mahout org.apache.mahout.df.tools.Describe -p /user/an32665/KDD/KDDTrain+.arff -f /user/an32665/KDD/KDDTrain+.info -d N 3 C 2 N C 4 N C 8 N 2 C 19 N L Building Forest: mahout org.apache.mahout.df.mapreduce.BuildForest -Dmapred.max.split.size=1874231 -oob -d /user/an32665/KDD/KDDTrain+.arff -ds /user/an32665/KDD/KDDTrain+.info -sl 5 -p -t 100 -o nsl-forest Testing Forest: mahout org.apache.mahout.df.mapreduce.TestForest -i /user/an32665/KDD/KDDTest+.arff -ds /user/an32665/KDD/KDDTrain+.info -m nsl-forest -a -mr -o predictions So while building the forest we use "-P" for implementing partial implementation. I just wanted to know the difference in algorithm when we use "-p" and when we don't use "-p". Regards, Akshay Nowal