Marcos, Parquet currently depends on Hadoop for IO operations and compression. You just need to include hadoop-client in your classpath and it should work fine.
rb On Fri, Jan 6, 2017 at 10:27 AM, marcos rebelo <[email protected]> wrote: > Hi all > > I'm receiving a set of csv/json files on S3 and I would like to transform > them to parquet. Considering the restriction of Lambda Function I would > like to create some code that can generate the parquet file. I didn't found > how to do it without Hadoop. Considering the simplicity of the task (file > conversion), I can't believe that is hard to do something similar to it. > > Note: I'm a Scala developer, but I can code/adapt any java code. > > Can someone give me one hand on this task? > > Best Regards > Marcos Rebelo > -- Ryan Blue Software Engineer Netflix
