[ https://issues.apache.org/jira/browse/HBASE-15223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15141782#comment-15141782 ]
stack commented on HBASE-15223: ------------------------------- No TestImportExport error this time. I just noted org.apache.hadoop.hbase.snapshot.TestFlushSnapshotFromClient as a flakey. Looking at the unit fails, I'd say commit [~jerryhe] > Make convertScanToString public for Spark > ----------------------------------------- > > Key: HBASE-15223 > URL: https://issues.apache.org/jira/browse/HBASE-15223 > Project: HBase > Issue Type: Improvement > Reporter: Jerry He > Assignee: Jerry He > Fix For: 2.0.0, 1.3.0 > > Attachments: HBASE-15223-branch-1-v2.patch, > HBASE-15223-branch-1.patch, HBASE-15223-master.patch, HBASE-15223-master.patch > > > One way to access HBase from Spark is to use newAPIHadoopRDD, which can take > a TableInputFormat as class name. But we are not able to set a Scan object > in there, for example to set a HBase filter. > In MR, the public API TableMapReduceUtil.initTableMapperJob() or equivalent > is used which can take a Scan object. But this call is not used in Spark > conveniently. > We need to make the TableMapReduceUtil.convertScanToString() public. > So that a Scan object can be created, populated and then convert to the > property and used by Spark. They are now package private. -- This message was sent by Atlassian JIRA (v6.3.4#6332)