[ https://issues.apache.org/jira/browse/DATAFU-16?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13878635#comment-13878635 ]
jian wang commented on DATAFU-16: --------------------------------- According to Matt's feedback on review request https://reviews.apache.org/r/17058/, need to re-think how we implement the reservoir sample with exponential jumps. Will do an offline simulation by this weekend. > weighted reservoir sampling with exponential jumps UDF > ------------------------------------------------------ > > Key: DATAFU-16 > URL: https://issues.apache.org/jira/browse/DATAFU-16 > Project: DataFu > Issue Type: New Feature > Environment: Mac, Linux > pig-0.11 > Reporter: jian wang > > Create a weightedReservoirSampleWithExpJump UDF to implement the weighted > reservoir sampling algorithm with exponential jumps. Investigation is tracked > in https://github.com/linkedin/datafu/issues/80. This task is part of > experiment of different weighted sampling algorithms. -- This message was sent by Atlassian JIRA (v6.1.5#6160)