Re: Repeating Records w/ Spark + Avro?

2016-03-12 Thread Chris Miller
removed. Finally, if I add rdd.persist(), then it doesn't work. I guess I would need to do .map(_._1.datum) again before the map that does the real work. -- Chris Miller On Sat, Mar 12, 2016 at 4:15 PM, Chris Miller wrote: > Wow! That sure is buried in the documentation! But yeah, that&#x

Re: Repeating Records w/ Spark + Avro?

2016-03-12 Thread Chris Miller
tln(record.get("myValue")) }) * What am I doing wrong? -- Chris Miller On Sat, Mar 12, 2016 at 1:48 PM, Peyman Mohajerian wrote: > Here is the reason for the behavior: > '''Note:''' Because Hadoop's RecordReader class re-uses the same Writable > objec

Repeating Records w/ Spark + Avro?

2016-03-11 Thread Chris Miller
one the datum? Seems I'm not the only one who ran into this problem: https://github.com/GoogleCloudPlatform/DataflowJavaSDK/issues/102. I can't figure out how to fix it in my case without hacking away like the person in the linked PR did. Suggestions? -- Chris Miller

Re: Avro Hive SerDe Issue

2016-02-29 Thread Chris Miller
kSQL, but I get the aforementioned error running the same query. This issue is specific to the Avro decoder and SparkSQL. How do I get Avro enum fields to work with SparkSQL? Maybe this is a bug in org.apache.avro.io.ResolvingDecoder.java? -- Chris Miller On Mon, Feb 29, 2016 at 4:41 PM, Chris Mille

Avro Hive SerDe Issue

2016-02-29 Thread Chris Miller
les are valid because I can pull them down off S3, decode them, and see all the data I expect. I'm pretty sure Hive is reading my files because "MetaSchema" is the first field in the schema. Anyone have any idea what is going on here or how I can further debug? -- Chris Miller

Re: fromJson is failing with null as uniontype

2016-02-29 Thread Chris Miller
Did you ever figure this out? I was having the same problem. -- Chris Miller On Fri, Feb 19, 2016 at 2:53 AM, Siva wrote: > Can someone help on this? Is anyone faced similar issue? > > Thanks, > Sivakumar Bhavanari. > > On Wed, Feb 17, 2016 at 4:21 PM, Siva wrote: > >

Re: Avro sample/test files

2015-12-13 Thread Chris Miller
It's really pretty simple. Everything is explained here with examples: http://avro.apache.org/docs/1.7.7/spec.html. That's what I used to figure out how to build my schema. -- Chris Miller On Mon, Dec 14, 2015 at 1:09 AM, John Lilley wrote: > Does anyone know of a website/reposi

Concatenate multiple files?

2015-11-25 Thread Chris Miller
ing logged also has the same delimiter. What's the best way to deal with this? I couldn't find anything in the standard that supports multiple Avro files concatenated into the same file. -- Chris Miller