[ https://issues.apache.org/jira/browse/PIG-1825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13031068#comment-13031068 ]
Dmitriy V. Ryaboy commented on PIG-1825: ---------------------------------------- Cool. At this point I don't think we need testStoreToHBase_2_no_WAL() ? HBase itself doesn't actually test noWAL directly. I'm ok with not testing the full path, just testing that we are using the HBase api correctly. I do almost want to make it "-noSafety" just to be clear about what one is doing when invoking this "optimization".... > ability to turn off the write ahead log for pig's HBaseStorage > -------------------------------------------------------------- > > Key: PIG-1825 > URL: https://issues.apache.org/jira/browse/PIG-1825 > Project: Pig > Issue Type: Improvement > Affects Versions: 0.8.0 > Reporter: Corbin Hoenes > Priority: Minor > Attachments: HBaseStorage_noWAL.patch, PIG-1825_1.patch, > PIG-1825_2.patch > > > Added an option to allow a caller of HBaseStorage to turn off the > WriteAheadLog feature while doing bulk loads into hbase. > From the performance tuning wikipage: > http://wiki.apache.org/hadoop/PerformanceTuning > "To speed up the inserts in a non critical job (like an import job), you can > use Put.writeToWAL(false) to bypass writing to the write ahead log." > We've tested this on HBase 0.20.6 and it helps dramatically. > The -noWAL options is passed in just like other options for hbase storage: > STORE myalias INTO 'MyTable' USING > org.apache.pig.backend.hadoop.hbase.HBaseStorage('mycolumnfamily:field1 > mycolumnfamily:field2','-noWAL'); > This would be my first patch so please educate me with any steps I need to > do. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira