[ https://issues.apache.org/jira/browse/HBASE-20334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16453175#comment-16453175 ]
Sean Busbey edited comment on HBASE-20334 at 4/25/18 10:15 PM: --------------------------------------------------------------- here's my plan: * start single node hadoop cluster via CLI Minicluster ([docs for 2.7|http://hadoop.apache.org/docs/r2.7.6/hadoop-project-dist/hadoop-common/CLIMiniCluster.html], [docs for 3.0.1|http://hadoop.apache.org/docs/r3.0.2/hadoop-project-dist/hadoop-common/CLIMiniCluster.html]) * start HBase [Standalone-over-HDFS mode|http://hbase.apache.org/book.html#standalone.over.hdfs] * load example TSV file to HDFS via Hadoop CLI * import from tsv using hbase-shaded-mapreduce * use utility program to scan result of import and compare it to data in HDFS, using shaded hbase client and hadoop dependencies (shaded hadoop client in the case of Hadoop 3) two questions that I'd like feedback on, but will just pick something if needed: a) where does the utility program live? is it in our code repo? do I generate it in the test? hbase-downstreamer? b) where does this test go? I could make it a yetus plugin. that would let us choose running it in precommit in addition to nightly if we wanted. Or I could just add it as a non-yetus step to our nightly builds, ala the "check source artifact" one. was (Author: busbey): here's my plan: * start single node hadoop cluster via CLI Minicluster ([docs for 2.7|http://hadoop.apache.org/docs/r2.7.6/hadoop-project-dist/hadoop-common/CLIMiniCluster.html], [docs for 3.0.1|http://hadoop.apache.org/docs/r3.0.2/hadoop-project-dist/hadoop-common/CLIMiniCluster.html]) * start HBase [Standalone-over-HDFS mode|http://hbase.apache.org/book.html#standalone.over.hdfs] * load example TSV file to HDFS via Hadoop CLI * import from tsv using shaded mapreduce * use utility program to scan result of import and compare it to data in HDFS, using shaded hbase client and hadoop dependencies (shaded hadoop client in the case of Hadoop 3) two questions that I'd like feedback on, but will just pick something if needed: a) where does the utility program live? is it in our code repo? do I generate it in the test? hbase-downstreamer? b) where does this test go? I could make it a yetus plugin. that would let us choose running it in precommit in addition to nightly if we wanted. Or I could just add it as a non-yetus step to our nightly builds, ala the "check source artifact" one. > add a test that expressly uses both our shaded client and the one from hadoop > 3 > ------------------------------------------------------------------------------- > > Key: HBASE-20334 > URL: https://issues.apache.org/jira/browse/HBASE-20334 > Project: HBase > Issue Type: Sub-task > Components: hadoop3, shading > Affects Versions: 2.0.0 > Reporter: Sean Busbey > Assignee: Sean Busbey > Priority: Major > > Since we're making a shaded client that bleed out of our namespace and into > Hadoop's, we should ensure that we can show our clients coexisting. Even if > it's just an IT that successfully talks to both us and HDFS via our > respective shaded clients, that'd be a big help in keeping us proactive. -- This message was sent by Atlassian JIRA (v7.6.3#76005)