We currently have data in avro format and we do joins between avro and sequence file data. Will storing these datasets in Parquet make joins any faster ?
The dataset sizes are beyond are between 500 to 1000 GB. -- Deepak
We currently have data in avro format and we do joins between avro and sequence file data. Will storing these datasets in Parquet make joins any faster ?
The dataset sizes are beyond are between 500 to 1000 GB. -- Deepak