On Wed, Nov 19, 2014 at 12:41 AM, Daniel Haviv <danielru...@gmail.com> wrote: > > Another problem I have is that I get a lot of small json files and as a > result a lot of small parquet files, I'd like to merge the json files into > a few parquet files.. how I do that? >
You can use `coalesce` on any RDD to merge files.