Chengxiang Li created FLINK-2535:
------------------------------------
Summary: Fixed size sample algorithm optimization
Key: FLINK-2535
URL: https://issues.apache.org/jira/browse/FLINK-2535
Project: Flink
Issue Type: Improvement
Components: Core
Reporter: Chengxiang Li
Priority: Minor
Fixed size sample algorithm is known to be less efficient than sample
algorithms with fraction, but sometime it's necessary. Some optimization could
significantly reduce the storage size and computation cost, such as the
algorithm described in [this
paper|http://machinelearning.wustl.edu/mlpapers/papers/icml2013_meng13a].
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)