jsonFiles in your code is schemaRDD rather than RDD[Array]. If it is a column in schemaRDD, you can first use Spark SQL query to get a certain column. Or schemaRDD support some SQL like operation such as select / where can also get specific column.
> 在 2014年11月24日,上午4:01,Daniel Haviv <danielru...@gmail.com> 写道: > > Hi, > I have a column in my schemaRDD that is a map but I'm unable to convert it to > a map.. I've tried converting it to a Tuple2[String,String]: > val converted = jsonFiles.map(line=> { > line(10).asInstanceOf[Tuple2[String,String]]}) > > but I get ClassCastException: > 14/11/23 11:51:30 WARN scheduler.TaskSetManager: Lost task 0.0 in stage 1.0 > (TID 2, localhost): java.lang.ClassCastException: > org.apache.spark.sql.catalyst.expressions.GenericRow cannot be cast to > scala.Tuple2 > > And if if convert it to Iterable[String] I can only get the values without > the keys. > > What it the correct data type I should convert it to ? > > Thanks, > Daniel --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org