> On 1月 14, 2016, 5:36 p.m., Prasad Mujumdar wrote: > > My apologies for jumping in late in the review > > > > I am curious about why you chose HiveServer2 interface to implement this. I > > guess one can work it around by 'INSERT OVERWRITE DIRECTORY .. SELECT ..' > > and so one of the options could be to solve this in the query processing > > (eg rewrite the query during parsing etc). The problem for HS2 solution is > > that it's not going to be accessible to high level clients like > > JDBC/ODBC/Beeline. One has to write a custom HS2 client driver in order to > > pass the new parameters. > > > > Regarding this patch, one comment below.
Thank you for your review and comments, Prasad! > I am curious about why you chose HiveServer2 interface to implement this. When some JDBC clients download large data from HiveServer2, HiveServer2 will be a bottle neck. This bypass is for between JDBC clients and HDFS so that JDBC clients can download data from HDFS directly. > I guess one can work it around by 'INSERT OVERWRITE DIRECTORY .. SELECT ..' > and so one of the options could be to solve this in the query processing (eg > rewrite the query during parsing etc). > The problem for HS2 solution is that it's not going to be accessible to high > level clients like JDBC/ODBC/Beeline. What do you mean by that? I rather think this bypass would not work 'INSERT OVERWRITE DIRECTORY .. SELECT ..'. Because it does not return results to JDBC clients. > On 1月 14, 2016, 5:36 p.m., Prasad Mujumdar wrote: > > service/if/TCLIService.thrift, line 272 > > <https://reviews.apache.org/r/40867/diff/4/?file=1187218#file1187218line272> > > > > Does it need to be a required parameter ? Adding a new required > > parameter in the middle will make it break the backward compatibility for > > existing clients. Yes, it is necessary to be a required parameter since this patch had TypeDescriptor created by the typename in ColumnDescriptor. (Please see last Sergey's review.) Next patch fixes the order of the parameters. - Takanobu ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/40867/#review114504 ----------------------------------------------------------- On 1月 8, 2016, 7:18 a.m., Takanobu Asanuma wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/40867/ > ----------------------------------------------------------- > > (Updated 1月 8, 2016, 7:18 a.m.) > > > Review request for hive. > > > Repository: hive-git > > > Description > ------- > > This is a WIP patch for HIVE-11527 > > * I added a new configuration whose name is > hive.server2.webhdfs.bypass.enabled. The default is false. When this value is > true, clients use the bypass. > > * I still have not considered security such as Kerberos and SSL at present. > > * I have not implement Statement#setFetchSize for bypass yet. > > > Diffs > ----- > > common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 479fa46 > jdbc/src/java/org/apache/hive/jdbc/HiveQueryResultSet.java 245c6a3 > jdbc/src/java/org/apache/hive/jdbc/HiveStatement.java 3f9024a > ql/src/java/org/apache/hadoop/hive/ql/Driver.java 29e6315 > ql/src/java/org/apache/hadoop/hive/ql/exec/FetchTask.java 1634143 > service/if/TCLIService.thrift baf583f > service/src/gen/thrift/gen-cpp/TCLIService_types.h b078c99 > service/src/gen/thrift/gen-cpp/TCLIService_types.cpp b852379 > > service/src/gen/thrift/gen-javabean/org/apache/hive/service/cli/thrift/TColumnDesc.java > b01fadb > > service/src/gen/thrift/gen-javabean/org/apache/hive/service/cli/thrift/TExecuteStatementResp.java > 0b9aa0f > > service/src/gen/thrift/gen-javabean/org/apache/hive/service/cli/thrift/TProtocolVersion.java > c936ada > service/src/gen/thrift/gen-py/TCLIService/ttypes.py ef5f5f5 > service/src/gen/thrift/gen-rb/t_c_l_i_service_types.rb f004ec4 > service/src/java/org/apache/hive/service/cli/CLIService.java adc9809 > service/src/java/org/apache/hive/service/cli/ColumnDescriptor.java f0bbf14 > service/src/java/org/apache/hive/service/cli/operation/Operation.java > d2b3f9c > > service/src/java/org/apache/hive/service/cli/operation/OperationManager.java > 92135cd > service/src/java/org/apache/hive/service/cli/operation/SQLOperation.java > 2eaab4a > service/src/java/org/apache/hive/service/cli/session/HiveSession.java > 4f4e92d > service/src/java/org/apache/hive/service/cli/session/HiveSessionImpl.java > 50c912e > service/src/java/org/apache/hive/service/cli/thrift/ThriftCLIService.java > 8434965 > > Diff: https://reviews.apache.org/r/40867/diff/ > > > Testing > ------- > > I have tested few simple queries and they worked well. But I think there are > some problems for some queries. I'm going to test more queries and fix bugs. > I'm also going to add unit tests. > > > Thanks, > > Takanobu Asanuma > >