[ 
https://issues.apache.org/jira/browse/PIG-1825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bill Graham updated PIG-1825:
-----------------------------

    Attachment: PIG-1825_2.patch

Adding a second patch with new tests for option 2, which asserts that the WAL 
is disabled when createPut is called. There's still no assertion that createPut 
actually is used though, just as the original tests couldn't assert that 
disable WAL was called. 

Ideally we could somehow mock the Put or HTable or something to verify, but I 
think this would disrupt the code quite a bit for a simple test.

Agreed, disabling the WAL should only be done with extreme caution. 

> ability to turn off the write ahead log for pig's HBaseStorage
> --------------------------------------------------------------
>
>                 Key: PIG-1825
>                 URL: https://issues.apache.org/jira/browse/PIG-1825
>             Project: Pig
>          Issue Type: Improvement
>    Affects Versions: 0.8.0
>            Reporter: Corbin Hoenes
>            Priority: Minor
>         Attachments: HBaseStorage_noWAL.patch, PIG-1825_1.patch, 
> PIG-1825_2.patch
>
>
> Added an option to allow a caller of HBaseStorage to turn off the 
> WriteAheadLog feature while doing bulk loads into hbase.
> From the performance tuning wikipage: 
> http://wiki.apache.org/hadoop/PerformanceTuning
> "To speed up the inserts in a non critical job (like an import job), you can 
> use Put.writeToWAL(false) to bypass writing to the write ahead log."
> We've tested this on HBase 0.20.6 and it helps dramatically.  
> The -noWAL options is passed in just like other options for hbase storage:
> STORE myalias INTO 'MyTable' USING 
> org.apache.pig.backend.hadoop.hbase.HBaseStorage('mycolumnfamily:field1 
> mycolumnfamily:field2','-noWAL');
> This would be my first patch so please educate me with any steps I need to 
> do.  

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to