> On Oct. 1, 2013, 7:49 p.m., Jarek Cecho wrote:
> > Thank you Alexandre for taking up this ticket!
> > 
> > Would you mind adding automated tests to ensure that the functionality is 
> > indeed working?
> 
> Alexandre Normand wrote:
>     I'd like to be able to integration test bulk load but this is still not a 
> possibility in non-distributed mode. We currently have the same problems in 
> some of our map/reduce jobs that use bulk loading. The way we've approached 
> it with those is that we have two modes for each job: bulk load enabled/bulk 
> load disabled. The integration tests always run with bulk load disabled. The 
> "bulk load mode" is tested on a real cluster with automated tests that run 
> from jenkins. 
>     
>     For what it's worth, I've done the manual testing on our cluster for this 
> patch.

Thank you for the feedback Alexandre! Do you happen to know if there is already 
a JIRA for adding the bulk load mode into the HBase MiniCluster? I tried to 
search for now, but didn't found it.


> On Oct. 1, 2013, 7:49 p.m., Jarek Cecho wrote:
> > src/java/org/apache/sqoop/mapreduce/HBaseBulkImportJob.java, lines 149-152
> > <https://reviews.apache.org/r/14240/diff/5/?file=359511#file359511line149>
> >
> >     This seems to be very dangerous as at some point everyone can read the 
> > files, wouldn't be much secure to simply change the owner to the hbase user?
> 
> Alexandre Normand wrote:
>     It would be more secure but it doesn't work. I just tried and I get:
>     Caused by: 
> org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.security.AccessControlException):
>  Non-super user cannot change owner
>     
>     The actual alternative would be to use the new approach enabled by 
> https://issues.apache.org/jira/browse/HBASE-5498 and described in 
> https://cwiki.apache.org/confluence/display/HCATALOG/HBase+Secure+Bulk+Load
>     
>     But sqoop can't assume this patch is available, right?

Yeah, we can't assume that the HBASE-5498 is available at this point. Fair 
enough then.


- Jarek


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/14240/#review26576
-----------------------------------------------------------


On Oct. 4, 2013, 5:45 p.m., Alexandre Normand wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/14240/
> -----------------------------------------------------------
> 
> (Updated Oct. 4, 2013, 5:45 p.m.)
> 
> 
> Review request for Sqoop, Jarek Cecho, Jean-Marc Spaggiari, and Vasanth kumar 
> RJ.
> 
> 
> Repository: sqoop-trunk
> 
> 
> Description
> -------
> 
> This is the patch to address some of the comments on Zhancheng Deng's review 
> request (https://reviews.apache.org/r/13052/) which was basically the change 
> to add the --bulk-load-dir option to support the HBase doBulkLoad function.
> 
> 
> Diffs
> -----
> 
>   src/java/org/apache/sqoop/SqoopOptions.java 01805f9 
>   src/java/org/apache/sqoop/hbase/HBasePutProcessor.java 9ceb5bd 
>   src/java/org/apache/sqoop/hbase/ToStringPutTransformer.java 5ccf311 
>   src/java/org/apache/sqoop/manager/SqlManager.java 2a4992d 
>   src/java/org/apache/sqoop/mapreduce/HBaseBulkImportJob.java PRE-CREATION 
>   src/java/org/apache/sqoop/mapreduce/HBaseBulkImportMapper.java PRE-CREATION 
>   src/java/org/apache/sqoop/mapreduce/ImportJobBase.java ab7f21e 
>   src/java/org/apache/sqoop/tool/BaseSqoopTool.java 0eca991 
>   src/test/com/cloudera/sqoop/TestSqoopOptions.java 03e2504 
> 
> Diff: https://reviews.apache.org/r/14240/diff/
> 
> 
> Testing
> -------
> 
> 
> Thanks,
> 
> Alexandre Normand
> 
>

Reply via email to