Re: S3 SubFolder Write Issues
I use s3n://BucketName/SomeFoler/OutputFolder and it works for my app. On Wed, Mar 11, 2015 at 12:14 AM, Akhil Das ak...@sigmoidanalytics.com wrote: Does it write anything in BUCKET/SUB_FOLDER/output? Thanks Best Regards On Wed, Mar 11, 2015 at 10:15 AM, cpalm3 cpa...@gmail.com wrote: Hi All, I am hoping someone has seen this issue before with S3, as I haven't been able to find a solution for this problem. When I try to save as Text file to s3 into a subfolder, it only ever writes out to the bucket level folder and produces block level generated file names and not my output folder as I specified. Below is the sample code in Scala, I have also seen this behavior in the Java code. val out = inputRdd.map {ir = mapFunction(ir)}.groupByKey().mapValues { x = mapValuesFunction(x) } .saveAsTextFile(s3://BUCKET/SUB_FOLDER/output Any ideas on how to get saveAsTextFile to write to an S3 subfolder? Thanks, Chris -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/S3-SubFolder-Write-Issues-tp21997.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org -- *Harshvardhan Chauhan* | Software Engineer *GumGum* http://www.gumgum.com/ | *Ads that stick* 310-260-9666 | ha...@gumgum.com
Re: S3 SubFolder Write Issues
You want the s3n:// (native) protocol rather than s3://. s3:// is a block filesystem based on S3 that doesn't respect paths. More information on the Hadoop site: https://wiki.apache.org/hadoop/AmazonS3 Calum. On Wed, 11 Mar 2015 04:47 cpalm3 cpa...@gmail.com wrote: Hi All, I am hoping someone has seen this issue before with S3, as I haven't been able to find a solution for this problem. When I try to save as Text file to s3 into a subfolder, it only ever writes out to the bucket level folder and produces block level generated file names and not my output folder as I specified. Below is the sample code in Scala, I have also seen this behavior in the Java code. val out = inputRdd.map {ir = mapFunction(ir)}.groupByKey().mapValues { x = mapValuesFunction(x) } .saveAsTextFile(s3://BUCKET/SUB_FOLDER/output Any ideas on how to get saveAsTextFile to write to an S3 subfolder? Thanks, Chris -- View this message in context: http://apache-spark-user-list. 1001560.n3.nabble.com/S3-SubFolder-Write-Issues-tp21997.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
Re: S3 SubFolder Write Issues
Does it write anything in BUCKET/SUB_FOLDER/output? Thanks Best Regards On Wed, Mar 11, 2015 at 10:15 AM, cpalm3 cpa...@gmail.com wrote: Hi All, I am hoping someone has seen this issue before with S3, as I haven't been able to find a solution for this problem. When I try to save as Text file to s3 into a subfolder, it only ever writes out to the bucket level folder and produces block level generated file names and not my output folder as I specified. Below is the sample code in Scala, I have also seen this behavior in the Java code. val out = inputRdd.map {ir = mapFunction(ir)}.groupByKey().mapValues { x = mapValuesFunction(x) } .saveAsTextFile(s3://BUCKET/SUB_FOLDER/output Any ideas on how to get saveAsTextFile to write to an S3 subfolder? Thanks, Chris -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/S3-SubFolder-Write-Issues-tp21997.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
S3 SubFolder Write Issues
Hi All, I am hoping someone has seen this issue before with S3, as I haven't been able to find a solution for this problem. When I try to save as Text file to s3 into a subfolder, it only ever writes out to the bucket level folder and produces block level generated file names and not my output folder as I specified. Below is the sample code in Scala, I have also seen this behavior in the Java code. val out = inputRdd.map {ir = mapFunction(ir)}.groupByKey().mapValues { x = mapValuesFunction(x) } .saveAsTextFile(s3://BUCKET/SUB_FOLDER/output Any ideas on how to get saveAsTextFile to write to an S3 subfolder? Thanks, Chris -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/S3-SubFolder-Write-Issues-tp21997.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org