after 9 hours of compute, it failed. It never went past the colloc combiner pass :(
reason. I will have to tag drew along to identify the possible cause of this out of memory error java.lang.OutOfMemoryError: Java heap space at org.apache.mahout.utils.nlp.collocations.llr.Gram.<init>(Gram.java:67) at org.apache.mahout.utils.nlp.collocations.llr.CollocCombiner.reduce(CollocCombiner.java:62) at org.apache.mahout.utils.nlp.collocations.llr.CollocCombiner.reduce(CollocCombiner.java:30) at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.combineAndSpill(MapTask.java:921) at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.mergeParts(MapTask.java:1077) at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:719) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:233) at org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2216)