Re: S3 SubFolder Write Issues

2015-03-12 Thread Harshvardhan Chauhan
I use s3n://BucketName/SomeFoler/OutputFolder   and it works for my app.

On Wed, Mar 11, 2015 at 12:14 AM, Akhil Das ak...@sigmoidanalytics.com
wrote:

 Does it write anything in BUCKET/SUB_FOLDER/output?

 Thanks
 Best Regards

 On Wed, Mar 11, 2015 at 10:15 AM, cpalm3 cpa...@gmail.com wrote:

 Hi All,

 I am hoping someone has seen this issue before with S3, as I haven't been
 able to find a solution for this problem.

 When I try to save as Text file to s3 into a subfolder, it only ever
 writes
 out to the bucket level folder
 and produces block level generated file names and not my output folder as
 I
 specified.
 Below is the sample code in Scala, I have also seen this behavior in the
 Java code.

  val out =  inputRdd.map {ir = mapFunction(ir)}.groupByKey().mapValues {
 x
 = mapValuesFunction(x) }
.saveAsTextFile(s3://BUCKET/SUB_FOLDER/output

 Any ideas on how to get saveAsTextFile to write to an S3 subfolder?

 Thanks,
 Chris



 --
 View this message in context:
 http://apache-spark-user-list.1001560.n3.nabble.com/S3-SubFolder-Write-Issues-tp21997.html
 Sent from the Apache Spark User List mailing list archive at Nabble.com.

 -
 To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
 For additional commands, e-mail: user-h...@spark.apache.org





-- 
*Harshvardhan Chauhan*  |  Software Engineer
*GumGum* http://www.gumgum.com/  |  *Ads that stick*
310-260-9666  |  ha...@gumgum.com


Re: S3 SubFolder Write Issues

2015-03-11 Thread Calum Leslie
You want the s3n:// (native) protocol rather than s3://. s3:// is a block
filesystem based on S3 that doesn't respect paths.

More information on the Hadoop site: https://wiki.apache.org/hadoop/AmazonS3

Calum.

On Wed, 11 Mar 2015 04:47 cpalm3 cpa...@gmail.com wrote:

 Hi All,

 I am hoping someone has seen this issue before with S3, as I haven't been
 able to find a solution for this problem.

 When I try to save as Text file to s3 into a subfolder, it only ever writes
 out to the bucket level folder
 and produces block level generated file names and not my output folder as I
 specified.
 Below is the sample code in Scala, I have also seen this behavior in the
 Java code.

  val out =  inputRdd.map {ir = mapFunction(ir)}.groupByKey().mapValues {
 x
 = mapValuesFunction(x) }
.saveAsTextFile(s3://BUCKET/SUB_FOLDER/output

 Any ideas on how to get saveAsTextFile to write to an S3 subfolder?

 Thanks,
 Chris



 --
 View this message in context: http://apache-spark-user-list.
 1001560.n3.nabble.com/S3-SubFolder-Write-Issues-tp21997.html
 Sent from the Apache Spark User List mailing list archive at Nabble.com.

 -
 To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
 For additional commands, e-mail: user-h...@spark.apache.org




Re: S3 SubFolder Write Issues

2015-03-11 Thread Akhil Das
Does it write anything in BUCKET/SUB_FOLDER/output?

Thanks
Best Regards

On Wed, Mar 11, 2015 at 10:15 AM, cpalm3 cpa...@gmail.com wrote:

 Hi All,

 I am hoping someone has seen this issue before with S3, as I haven't been
 able to find a solution for this problem.

 When I try to save as Text file to s3 into a subfolder, it only ever writes
 out to the bucket level folder
 and produces block level generated file names and not my output folder as I
 specified.
 Below is the sample code in Scala, I have also seen this behavior in the
 Java code.

  val out =  inputRdd.map {ir = mapFunction(ir)}.groupByKey().mapValues { x
 = mapValuesFunction(x) }
.saveAsTextFile(s3://BUCKET/SUB_FOLDER/output

 Any ideas on how to get saveAsTextFile to write to an S3 subfolder?

 Thanks,
 Chris



 --
 View this message in context:
 http://apache-spark-user-list.1001560.n3.nabble.com/S3-SubFolder-Write-Issues-tp21997.html
 Sent from the Apache Spark User List mailing list archive at Nabble.com.

 -
 To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
 For additional commands, e-mail: user-h...@spark.apache.org




S3 SubFolder Write Issues

2015-03-10 Thread cpalm3
Hi All, 

I am hoping someone has seen this issue before with S3, as I haven't been
able to find a solution for this problem.

When I try to save as Text file to s3 into a subfolder, it only ever writes
out to the bucket level folder
and produces block level generated file names and not my output folder as I
specified.
Below is the sample code in Scala, I have also seen this behavior in the
Java code.

 val out =  inputRdd.map {ir = mapFunction(ir)}.groupByKey().mapValues { x
= mapValuesFunction(x) }
   .saveAsTextFile(s3://BUCKET/SUB_FOLDER/output

Any ideas on how to get saveAsTextFile to write to an S3 subfolder?

Thanks,
Chris



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/S3-SubFolder-Write-Issues-tp21997.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org