[ 
https://issues.apache.org/jira/browse/MAHOUT-1368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13838638#comment-13838638
 ] 

Suneel Marthi commented on MAHOUT-1368:
---------------------------------------

Ted, we need to hold off on committing this patch until we fix the issue with 
ClusterQualitySummarizer which is broken after applying this patch.  I'll look 
at it tomorrow, too late in the night to wrap my head around the issue.

> Convert OnlineSummarizer to use the new TDigest
> -----------------------------------------------
>
>                 Key: MAHOUT-1368
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1368
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Ted Dunning
>             Fix For: 0.9
>
>         Attachments: MAHOUT-1368.patch
>
>
> The new TDigest provides better accuracy for quartile estimation as well as 
> producing any other quantile you might like.  The current quartile estimation 
> of the OnlineSummarizer fails for highly skewed distributions and can't 
> really be extended to provide other quantiles.  The TDigest handles all of 
> this.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to