Hello guys,
I have to write my batch data (Dataset<Row>) to a file format. Actually
what I need to do is:

   1. split the data if it exceeds some size threshold  (by line count or
   max MB)
   2. compress the output data (possibly without converting to the hadoop
   format)

Are there any suggestions / recommendations about that?

Best,
Flavio

Reply via email to