S3 will obviously add a network lag, whereas in HDFS, if your spark executors are running on the same data-nodes you have the advantage of data locality.
Thanks Best Regards On Thu, Jul 9, 2015 at 12:05 PM, Brandon White <bwwintheho...@gmail.com> wrote: > Are there any significant performance differences between reading text > files from S3 and hdfs? >