Trevor Grant created MAHOUT-1936: ------------------------------------ Summary: FactorMap finds column maximums incorrectly on large data sets Key: MAHOUT-1936 URL: https://issues.apache.org/jira/browse/MAHOUT-1936 Project: Mahout Issue Type: Bug Components: Algorithms Affects Versions: 0.13.0 Reporter: Trevor Grant Fix For: 0.13.0
FactorMap's fit method does not properly find the maximum of the column. Likely due to an impropper allreduceBlock here https://github.com/apache/mahout/blob/master/math-scala/src/main/scala/org/apache/mahout/math/algorithms/preprocessing/AsFactor.scala#L40 Also, factorMap in this instance might be more appropriately named "factorMax" -- This message was sent by Atlassian JIRA (v6.3.15#6346)