Re: computing median and percentiles

2010-02-24 Thread tom kersnick
Jerome... Excellent. That would be the most straightforward. Thanks for the insight. /tom On Wed, Feb 24, 2010 at 10:08 AM, Jerome Boulon wrote: > The 50th percentile would be the median. > Is that answer your question? > /Jerome. > > > On 2/24/10 8:41 AM, "tom kersnick" wrote: > > Gents,

Re: computing median and percentiles

2010-02-24 Thread Jerome Boulon
The 50th percentile would be the median. Is that answer your question? /Jerome. On 2/24/10 8:41 AM, "tom kersnick" wrote: > Gents, > > Any word on this?  Im very curious on how you plan to calculate median.  There > are many flavors.  > > Im hoping that you take the simplistic approach on this

Re: computing median and percentiles

2010-02-24 Thread tom kersnick
Gents, Any word on this? Im very curious on how you plan to calculate median. There are many flavors. Im hoping that you take the simplistic approach on this task. Thanks in advance. /tom On Fri, Feb 19, 2010 at 2:12 PM, Zheng Shao wrote: > Hi Jerome, > > Is there any update on this? > htt

Re: computing median and percentiles

2010-02-19 Thread Zheng Shao
Hi Jerome, Is there any update on this? https://issues.apache.org/jira/browse/HIVE-259 Zheng On Fri, Feb 5, 2010 at 9:34 AM, Jerome Boulon wrote: > Hi Bryan, > I'm working on Hive-259. I'll post an update early next week. > /Jerome. > > > On 2/4/10 9:08 PM, "Bryan Talbot" wrote: > >> What's th

Re: computing median and percentiles

2010-02-05 Thread Jerome Boulon
Hi Bryan, I'm working on Hive-259. I'll post an update early next week. /Jerome. On 2/4/10 9:08 PM, "Bryan Talbot" wrote: > What's the best way to compute median and other percentiles using Hive 0.40? > I've run across http://issues.apache.org/jira/browse/HIVE-259 but there > doesn't seem to be

Re: computing median and percentiles

2010-02-04 Thread Zheng Shao
I would say, just create a histogram of pair, sort at the end, and return the value at the percentile. This assumes that the number of unique values are not big, which can be easily enforced by using round(number, digits). Zheng On Thu, Feb 4, 2010 at 9:08 PM, Bryan Talbot wrote: > What's the

computing median and percentiles

2010-02-04 Thread Bryan Talbot
What's the best way to compute median and other percentiles using Hive 0.40? I've run across http://issues.apache.org/jira/browse/HIVE-259 but there doesn't seem to be any planned implementation yet. -Bryan