GitHub user dwmclary opened a pull request: https://github.com/apache/spark/pull/122
SPARK-1170-pyspark-histogram: added histogram method, added max and min to statscounter Here's an implementation of histogram which relies on a small change to statscounter and returns a defaultdict of tuples. You can merge this pull request into a Git repository by running: $ git pull https://github.com/dwmclary/spark SPARK-1170-pyspark-histogram Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/122.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #122 ---- commit 491601600a09de1f7f3b46b7459e320d4933cb08 Author: Dan McClary <dan.mccl...@gmail.com> Date: 2014-03-11T06:58:32Z added histogram method, added max and min to statscounter commit eaf89d957e84d3b926f6c5f3f65acb8764c7ec2f Author: Dan McClary <dan.mccl...@gmail.com> Date: 2014-03-11T07:34:30Z added correct doctest for histogram ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---