For binary classification, any click-through data (like online ad click-through 
data) is extremely unbalanced. Of the order of <0.5% positive examples.

Yahoo has some large data sets of this nature, that can be downloaded free for 
research purposes from Yahoo Research (I think it's research.yahoo.com)

N

On 9 Mar 2012, at 09:35, Venkatesh U <venkates...@gmail.com> wrote:

> Dear friends,
> I am working on an algorithm which works well on imbalanced data, I need
> some data sets available in public domain which I can use to test my
> algorithm for addressing class imbalance. Any pointers to data sets with
> class imbalance appreciated.
> 
> Thanks,
> Venkatesh

Reply via email to