dongjoon-hyun commented on code in PR #1141:
URL: https://github.com/apache/orc/pull/1141#discussion_r882344779


##########
java/bench/fetch-data.sh:
##########
@@ -15,8 +15,8 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 mkdir -p data/sources/taxi
-(cd data/sources/taxi; wget -O - 
https://s3.amazonaws.com/nyc-tlc/trip+data/yellow_tripdata_2015-11.csv | gzip > 
yellow_tripdata_2015-11.csv.gz )
-(cd data/sources/taxi; wget -O - 
https://s3.amazonaws.com/nyc-tlc/trip+data/yellow_tripdata_2015-12.csv | gzip > 
yellow_tripdata_2015-12.csv.gz )
+(cd data/sources/taxi; wget 
https://s3.amazonaws.com/nyc-tlc/trip+data/yellow_tripdata_2015-11.parquet )

Review Comment:
   BTW, could you update the following accordingly because the new file are 10 
times smaller now?
   
https://github.com/apache/orc/blob/1afc31d6c04729d7e194a6423c690af4519aab33/java/bench/README.md#L27
   
   ```
   m1max orc:$ ls -alh yello*
   -rw-r--r--  1 dongjoon  staff   1.7G May 25 23:05 yellow_tripdata_2015-11.csv
   -rw-r--r--  1 dongjoon  staff   150M May 25 23:06 
yellow_tripdata_2015-11.parquet
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to