dongjoon-hyun commented on code in PR #1141: URL: https://github.com/apache/orc/pull/1141#discussion_r882344779
########## java/bench/fetch-data.sh: ########## @@ -15,8 +15,8 @@ # See the License for the specific language governing permissions and # limitations under the License. mkdir -p data/sources/taxi -(cd data/sources/taxi; wget -O - https://s3.amazonaws.com/nyc-tlc/trip+data/yellow_tripdata_2015-11.csv | gzip > yellow_tripdata_2015-11.csv.gz ) -(cd data/sources/taxi; wget -O - https://s3.amazonaws.com/nyc-tlc/trip+data/yellow_tripdata_2015-12.csv | gzip > yellow_tripdata_2015-12.csv.gz ) +(cd data/sources/taxi; wget https://s3.amazonaws.com/nyc-tlc/trip+data/yellow_tripdata_2015-11.parquet ) Review Comment: BTW, could you update the following accordingly because the new file are 10 times smaller now? https://github.com/apache/orc/blob/1afc31d6c04729d7e194a6423c690af4519aab33/java/bench/README.md#L27 ``` m1max orc:$ ls -alh yello* -rw-r--r-- 1 dongjoon staff 1.7G May 25 23:05 yellow_tripdata_2015-11.csv -rw-r--r-- 1 dongjoon staff 150M May 25 23:06 yellow_tripdata_2015-11.parquet ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
