[ https://issues.apache.org/jira/browse/SPARK-26577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dongjoon Hyun closed SPARK-26577. --------------------------------- > Add input optimizer when reading Hive table by SparkSQL > ------------------------------------------------------- > > Key: SPARK-26577 > URL: https://issues.apache.org/jira/browse/SPARK-26577 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 2.4.1 > Reporter: Deegue > Priority: Minor > > When using SparkSQL, for example the ThriftServer, if we set > `spark.sql.hive.inputFormat.optimizer.enabled=true` > we can optimize the InputFormat to CombineTextInputFormat automatically if > it's TextInputFormat before. And we can also change the max/min size of input > splits by setting, for example > `mapreduce.input.fileinputformat.split.maxsize=268435456` > `mapreduce.input.fileinputformat.split.minsize=134217728` > > Otherwise, we have to modify Hive Configs and structure of tables. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org