[ 
https://issues.apache.org/jira/browse/ACCUMULO-274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13198513#comment-13198513
 ] 

Keith Turner commented on ACCUMULO-274:
---------------------------------------

Good question, we created an example using the file system because its a type 
of data everyone has.  We wanted to show an efficient way of storing a file 
system tree in Accumulo as an example.  We wanted to show how to store large 
data blobs in Accumulo through chunking.  

Some reasons to store in Accumulo are that you can insert/update/delete and you 
can insert/update/delete metadata about the file.  To insert/update/delete from 
a super file (like HAR), you would need to do something similar to the 
compactions that Accumulo does automatically.  You can run analytics over the 
file contents via map reduce and push that back into the table as metadata 
about the file.  Could also index the files and reference the content in 
Accumulo.  Of course this type of metadata and index could also reference the 
content in HAR.

                
> Add descriptions to README.filedata
> -----------------------------------
>
>                 Key: ACCUMULO-274
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-274
>             Project: Accumulo
>          Issue Type: Improvement
>          Components: docs
>            Reporter: David Medinets
>            Assignee: Billie Rinaldi
>            Priority: Minor
>             Fix For: 1.4.0
>
>
> README.filedata says the following:
> {noformat}
>   This example is coupled with the dirlist example.  See README.dirlist for 
> instructions.
> {noformat}
> I don't see the filedata class describes or references in the README.dirlist 
> file. The FileDataIngest class has a main method but I don't see information 
> about how it would be used. I do see this:
> {noformat}
>  * FileDataIngest - Takes a list of files and archives them into Accumulo 
> keyed on the SHA1 hashes of the files.
> {noformat}
> How is filedata.FileDataIngest different from the dirlist.Ingest?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to