Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19113 With 2.7G data, I ran a simple Java problem with 2.5.4 and 2.2.1 with `CsvParser`, and simple e2e read tests. Elapsed time diff was roughly -1.7% ~ +1.2%. I think virtually no diff (or 0.5 improvement). I think we generally trust other communities and libraries we decided to add such as ORC, Parquet, Jackson and etc., and de-duplicate such efforts with the community support. I think we discussed about this before.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org