RE: SparkR dataFrame read.df fails to read from aws s3

2015-07-09 Thread Sun, Rui
Hi, Ben


1)  I guess this may be a JDK version mismatch. Could you check the JDK 
version?

2)  I believe this is a bug in SparkR. I will fire a JIRA issue for it.

From: Ben Spark [mailto:ben_spar...@yahoo.com.au]
Sent: Thursday, July 9, 2015 12:14 PM
To: user
Subject: SparkR dataFrame read.df fails to read from aws s3

I have Spark 1.4 deployed on AWS EMR but methods of SparkR dataFrame read.df 
method cannot load data from aws s3.

1) read.df error message

 read.df(sqlContext,s3://some-bucket/some.json,json)

15/07/09 04:07:01 ERROR r.RBackendHandler: loadDF on 
org.apache.spark.sql.api.r.SQLUtils failed

java.lang.IllegalArgumentException: invalid method loadDF for object 
org.apache.spark.sql.api.r.SQLUtils

 at 
org.apache.spark.api.r.RBackendHandler.handleMethodCall(RBackendHandler.scala:143)

 at 
org.apache.spark.api.r.RBackendHandler.channelRead0(RBackendHandler.scala:74)

 at 
org.apache.spark.api.r.RBackendHandler.channelRead0(RBackendHandler.scala:36)
at 
io.netty.channel.SimpleChannelInboundHandler.channelRead(SimpleChannelInboundHandler.java:105)
2) jsonFile is working though with some warning message

Warning message:

In normalizePath(path) :

  path[1]=s3://rea-consumer-data-dev/cbr/profiler/output/20150618/part-0: 
No such file or directory


SparkR dataFrame read.df fails to read from aws s3

2015-07-08 Thread Ben Spark
I have Spark 1.4 deployed on AWS EMR but methods of SparkR dataFrame read.df 
method cannot load data from aws s3.
1) read.df error message 
read.df(sqlContext,s3://some-bucket/some.json,json)
15/07/09 04:07:01 ERROR r.RBackendHandler: loadDF on 
org.apache.spark.sql.api.r.SQLUtils failed
java.lang.IllegalArgumentException: invalid method loadDF for object 
org.apache.spark.sql.api.r.SQLUtils
at 
org.apache.spark.api.r.RBackendHandler.handleMethodCall(RBackendHandler.scala:143)
at 
org.apache.spark.api.r.RBackendHandler.channelRead0(RBackendHandler.scala:74)
at 
org.apache.spark.api.r.RBackendHandler.channelRead0(RBackendHandler.scala:36)  
at 
io.netty.channel.SimpleChannelInboundHandler.channelRead(SimpleChannelInboundHandler.java:105)
 2) jsonFile is working though with some warning messageWarning message:
In normalizePath(path) :
  path[1]=s3://rea-consumer-data-dev/cbr/profiler/output/20150618/part-0: 
No such file or directory