Gopal V created HIVE-10940:
------------------------------
Summary: HiveInputFormat::pushFilters serializes PPD objects for
each getRecordReader call
Key: HIVE-10940
URL: https://issues.apache.org/jira/browse/HIVE-10940
Project: Hive
Issue Type: Bug
Components: File Formats
Affects Versions: 1.2.0
Reporter: Gopal V
Assignee: Sergey Shelukhin
{code}
String filterText = filterExpr.getExprString();
String filterExprSerialized = Utilities.serializeExpression(filterExpr);
{code}
the serializeExpression initializes Kryo and produces a new packed object for
every split.
HiveInputFormat::getRecordReader -> pushProjectionAndFilters -> pushFilters.
And Kryo is very slow to do this for a large filter clause.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)