Re: Parquet generation on AWS Lambda Function

Ryan Blue Fri, 06 Jan 2017 14:23:45 -0800

Marcos,

Parquet currently depends on Hadoop for IO operations and compression. You
just need to include hadoop-client in your classpath and it should work
fine.


rb

On Fri, Jan 6, 2017 at 10:27 AM, marcos rebelo <[email protected]> wrote:

> Hi all
>
> I'm receiving a set of csv/json files on S3 and I would like to transform
> them to parquet. Considering the restriction of Lambda Function I would
> like to create some code that can generate the parquet file. I didn't found
> how to do it without Hadoop. Considering the simplicity of the task (file
> conversion), I can't believe that is hard to do something similar to it.
>
> Note: I'm a Scala developer, but I can code/adapt any java code.
>
> Can someone give me one hand on this task?
>
> Best Regards
> Marcos Rebelo
>



-- 
Ryan Blue
Software Engineer
Netflix

Re: Parquet generation on AWS Lambda Function

Reply via email to