[GitHub] [parquet-mr] dbtsai commented on pull request #793: PARQUET-1866: Replace Hadoop ZSTD with JNI-ZSTD

2020-06-01 Thread GitBox


dbtsai commented on pull request #793:
URL: https://github.com/apache/parquet-mr/pull/793#issuecomment-637195534


   @shangxinli do we have benchmark comparing to native hadoop codec both in 
size and speed? Thanks.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [parquet-mr] dbtsai commented on pull request #793: PARQUET-1866: Replace Hadoop ZSTD with JNI-ZSTD

2020-06-01 Thread GitBox


dbtsai commented on pull request #793:
URL: https://github.com/apache/parquet-mr/pull/793#issuecomment-637193519


   +1 @shangxinli and thank you for this contribution. 
   
   This will allow users who are on order versions of hadoop that don't support 
native ZSTD to use ZSTD compression in Parquet, and also, users don't have to 
go through the very complicated hadoop native installation. For developers, we 
will be able to easily test this out in different local envs.  
   
   cc @rdblue 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org