Trending patterns

2011-06-18 Thread Mark
Sorry if this isn't the right place to ask but how would I go about finding trending data over a certain period of time. For example: http://www.ebay.com has a section "Trends on eBay" that is updated daily. I was wondering how this can be accomplished using Mahout (if possible) For input I

Re: Trending patterns

2011-06-18 Thread Josh Patterson
I think the most simple way to do this woulds be to bucket/group-by the timestamp and then look for the most frequent search/item/product in each bucket. Fairly simple MapReduce job. Josh On Sat, Jun 18, 2011 at 1:52 PM, Mark wrote: > Sorry if this isn't the right place to ask but how would I go

Re: Trending patterns

2011-06-19 Thread Ted Dunning
Two things will help in addition to what Josh suggested: a) when looking for items that are trending hot, use the difference in the log rank as a score. For most internetly things, rank is proportional to 1/rate so log rank is -log rate. Refining this slightly to -log (epsilon + 1/rank) makes th

Re: Trending patterns

2011-06-21 Thread Mat Kelcey
two of my most popular blog posts are on this exactly! a trending topics algorithm (note: not a _frequent_ topics algorithm, a _trending_ topics algorithm) http://matpalm.com/blog/2010/04/27/trending-topics-in-tweets-about-cheese-part1/ and implemented in pig http://matpalm.com/blog/2010/05/01/t