-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/16533/#review31537
-----------------------------------------------------------



build.xml
<https://reviews.apache.org/r/16533/#comment60056>

    Remove this line



src/org/apache/pig/backend/hadoop/accumulo/AccumuloStorage.java
<https://reviews.apache.org/r/16533/#comment60058>

    We can remove this line


That much better, thanks.

Caster is added. But for AccumuloBinaryConverter, we shall retain the data in 
binary style. Eg, toBytes(Integer), we shall not convert the integer into 
String, then get bytes from string, but retain in 4 bytes integer style (like 
Bytes.toBytes(Integer) in hbase, not sure what's best in Accumulo). Also in 
addition to specify caster in construct parameter, better to have a 
configuration entry for that. I assume user prefer BinaryConverter will always 
use BinaryConverter, they don't want to specify the option every time.

I still feel the syntax for AccumuloStorage is not straightforward. Can user 
use the following style (similar to HBaseStorage)?

a = load 'accumulo://....' using AccumuloStorage('info:age info2:address', 
'other options');
-- produce a 3 item tuple (key, info:age, info2:address). info/info2 is colFam, 
age/address is colQual
-- user might optionally specify info:*, which will produce a map including 
every colQual in this colFam
-- I see aggregate flag in the code, but I cannot think of much use case that 
will be useful, how do you feel?

store x into 'accumulo://....' using AccumuloStorage('info:age info2:address', 
'other options');
-- input tuple contains 3 items: (key, info:age, info2:address)
-- mirror what's in input side, user can pass a map instead, then he needs to 
use wild card in construct AccumuloStorage('info:*');

- Daniel Dai


On Jan. 10, 2014, 7:20 p.m., Josh Elser wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/16533/
> -----------------------------------------------------------
> 
> (Updated Jan. 10, 2014, 7:20 p.m.)
> 
> 
> Review request for pig.
> 
> 
> Bugs: PIG-3573
>     https://issues.apache.org/jira/browse/PIG-3573
> 
> 
> Repository: pig-git
> 
> 
> Description
> -------
> 
> Provides basic StoreFunc and LoadFunc implementations. Based off of code that 
> was in an Accumulo contrib project.
> 
> 
> Diffs
> -----
> 
>   build.xml 575c9ae 
>   ivy.xml 180eb2c 
>   ivy/libraries.properties 14abdf8 
>   src/org/apache/pig/backend/hadoop/accumulo/AbstractAccumuloStorage.java 
> PRE-CREATION 
>   src/org/apache/pig/backend/hadoop/accumulo/AccumuloBinaryConverter.java 
> PRE-CREATION 
>   src/org/apache/pig/backend/hadoop/accumulo/AccumuloStorage.java 
> PRE-CREATION 
>   src/org/apache/pig/backend/hadoop/accumulo/AccumuloStorageOptions.java 
> PRE-CREATION 
>   src/org/apache/pig/backend/hadoop/accumulo/FixedByteArrayOutputStream.java 
> PRE-CREATION 
>   src/org/apache/pig/backend/hadoop/accumulo/Utils.java PRE-CREATION 
>   
> test/org/apache/pig/backend/hadoop/accumulo/AbstractAccumuloStorageTest.java 
> PRE-CREATION 
>   test/org/apache/pig/backend/hadoop/accumulo/AccumuloPigClusterTest.java 
> PRE-CREATION 
>   
> test/org/apache/pig/backend/hadoop/accumulo/AccumuloStorageConfigurationTest.java
>  PRE-CREATION 
>   test/org/apache/pig/backend/hadoop/accumulo/AccumuloStorageOptionsTest.java 
> PRE-CREATION 
>   test/org/apache/pig/backend/hadoop/accumulo/AccumuloStorageTest.java 
> PRE-CREATION 
> 
> Diff: https://reviews.apache.org/r/16533/diff/
> 
> 
> Testing
> -------
> 
> Local tests reading, writing and JOIN'ing Accumulo tables. Tested against 
> Hadoop-1.0.4 and 2.2.0, with Accumulo 1.5.0
> 
> 
> Thanks,
> 
> Josh Elser
> 
>

Reply via email to