For binary classification, any click-through data (like online ad click-through data) is extremely unbalanced. Of the order of <0.5% positive examples.
Yahoo has some large data sets of this nature, that can be downloaded free for research purposes from Yahoo Research (I think it's research.yahoo.com) N On 9 Mar 2012, at 09:35, Venkatesh U <venkates...@gmail.com> wrote: > Dear friends, > I am working on an algorithm which works well on imbalanced data, I need > some data sets available in public domain which I can use to test my > algorithm for addressing class imbalance. Any pointers to data sets with > class imbalance appreciated. > > Thanks, > Venkatesh