Check out http://infochimps.org/datasets for a number of free, moderately sized, datasets. They won't be terabytes in size, but they're a good place to start if you're looking for real data to play with.
On Mon, Apr 5, 2010 at 7:37 PM, Zhanlei Ma <[email protected]> wrote: > HI all: > In the test of hadoop, it is a problem to find the good example and the > test big data.Terasort is good, but the data generated by teragen is not > stable. Does Anyone has other good examples and the data source to recommend > me. Thanks for you. > -- Eric Sammer phone: +1-917-287-2675 twitter: esammer data: www.cloudera.com
