ability to turn off the write ahead log for pig's HBaseStorage
--------------------------------------------------------------

                 Key: PIG-1825
                 URL: https://issues.apache.org/jira/browse/PIG-1825
             Project: Pig
          Issue Type: Improvement
    Affects Versions: 0.8.0
            Reporter: Corbin Hoenes
            Priority: Minor


Added an option to allow a caller of HBaseStorage to turn off the WriteAheadLog 
feature while doing bulk loads into hbase.

>From the performance tuning wikipage: 
>http://wiki.apache.org/hadoop/PerformanceTuning
"To speed up the inserts in a non critical job (like an import job), you can 
use Put.writeToWAL(false) to bypass writing to the write ahead log."

We've tested this on HBase 0.20.6 and it helps dramatically.  

The -noWAL options is passed in just like other options for hbase storage:

STORE myalias INTO 'MyTable' USING 
org.apache.pig.backend.hadoop.hbase.HBaseStorage('mycolumnfamily:field1 
mycolumnfamily:field2','-noWAL');

This would be my first patch so please educate me with any steps I need to do.  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to