tao meng created HUDI-2674: ------------------------------ Summary: hudi hive reader should not log read values Key: HUDI-2674 URL: https://issues.apache.org/jira/browse/HUDI-2674 Project: Apache Hudi Issue Type: Bug Components: Hive Integration Affects Versions: 0.9.0 Environment: hudi 0.9.0 hive 3.1.1 hadoop 3.1.1 Reporter: tao meng Assignee: tao meng Fix For: 0.10.0
now when we use hive to query hudi table and set hive.input.format=org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat; all read values will be print. This can lead to performance problems and data security problems, as: xxxxxxx 20:10:45,045 | INFO | main | Reading from record reader | HoodieCombineRealtimeRecordReader.java:69 xxxxxx 20:10:45,045 | INFO | main | "values_0.158268513314199_10": \{"value0":"20211102192749","type0":"Text","value1":"null","type1":"unknown","value2":"null","type2":"unknown","value3":"null","type3":"unknown","value4":"null","type4":"unknown","value5":"16","type5":"IntWritable","value6":"16jack","type6":"Text","value7":"null","type7":"unknown","value8":"null","type8":"unknown","value9":"null","type9":"unknown"} | HoodieCombineRealtimeRecordReader.java:70 xxxxxxx 20:10:45,045 | INFO | main | Reading from record reader | HoodieCombineRealtimeRecordReader.java:69 xxxxxxx 20:10:45,045 | INFO | main | "values_0.16924293134429924_10": \{"value0":"20211102192749","type0":"Text","value1":"null","type1":"unknown","value2":"null","type2":"unknown","value3":"null","type3":"unknown","value4":"null","type4":"unknown","value5":"96","type5":"IntWritable","value6":"96jack","type6":"Text","value7":"null","type7":"unknown","value8":"null","type8":"unknown","value9":"null","type9":"unknown"} | HoodieCombineRealtimeRecordReader.java:70 2021-11-02 20:10:45,045 | INFO | main | Reading from record reader | HoodieCombineRealtimeRecordReader.java:69 -- This message was sent by Atlassian Jira (v8.3.4#803005)