We are seeing class exceptions when converting to a DataFrame.
Anyone out there with some suggestions on what is going on?
Our original intention was to use a HiveContext to write ORC and we say the
error there and have narrowed it down.
This is an example of our code:
---
def
at 2:56 PM, Chad Urso McDaniel cha...@gmail.com
wrote:
We are seeing class exceptions when converting to a DataFrame.
Anyone out there with some suggestions on what is going on?
Our original intention was to use a HiveContext to write ORC and we say
the error there and have narrowed it down
While it does feel like a filter is what you want to do, a common way to
handle this is to map to different keys.
Using your rddList example it becomes like this (scala style):
---
val rddSplit: RDD[(Int, Any)] = rdd.map(x = (*createKey*(x), x))
val rddBuckets: RDD[(Int, Iterable[Any])] =