Check out http://infochimps.org/datasets for a number of free,
moderately sized, datasets. They won't be terabytes in size, but
they're a good place to start if you're looking for real data to play
with.

On Mon, Apr 5, 2010 at 7:37 PM, Zhanlei Ma <[email protected]> wrote:
> HI all:
>    In the test of hadoop, it is a problem to find the good example and the 
> test big data.Terasort is good, but the data generated by teragen is not 
> stable. Does Anyone has other good examples and the data source to recommend 
> me. Thanks for you.
>



-- 
Eric Sammer
phone: +1-917-287-2675
twitter: esammer
data: www.cloudera.com

Reply via email to