Hi Sameer,
I've faced this issue before. They don't show up on
http://s3.amazonaws.com/big-data-benchmark/. But you can directly use:
`sc.textFile(s3n://big-data-benchmark/pavlo/text/tiny/crawl)`
The gotcha is that you also need to supply which dataset you want: crawl,
uservisits, or rankings
Hi Burak,Thanks, I will then start benchmarking the cluster.
Date: Wed, 27 Aug 2014 11:52:05 -0700
From: bya...@stanford.edu
To: ssti...@live.com
CC: user@spark.apache.org
Subject: Re: Amplab: big-data-benchmark
Hi Sameer,
I've faced this issue before. They don't show up on
http