[ https://issues.apache.org/jira/browse/SPARK-24244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16470947#comment-16470947 ]
Apache Spark commented on SPARK-24244: -------------------------------------- User 'MaxGekk' has created a pull request for this issue: https://github.com/apache/spark/pull/21296 > Parse only required columns of CSV file > --------------------------------------- > > Key: SPARK-24244 > URL: https://issues.apache.org/jira/browse/SPARK-24244 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 2.3.0 > Reporter: Maxim Gekk > Priority: Minor > > uniVocity parser allows to specify only required column names or indexes for > parsing like: > {code} > // Here we select only the columns by their indexes. > // The parser just skips the values in other columns > parserSettings.selectIndexes(4, 0, 1); > CsvParser parser = new CsvParser(parserSettings); > {code} > Need to modify *UnivocityParser* to extract only needed columns from > requiredSchema -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org