[ https://issues.apache.org/jira/browse/ACCUMULO-274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13198513#comment-13198513 ]
Keith Turner commented on ACCUMULO-274: --------------------------------------- Good question, we created an example using the file system because its a type of data everyone has. We wanted to show an efficient way of storing a file system tree in Accumulo as an example. We wanted to show how to store large data blobs in Accumulo through chunking. Some reasons to store in Accumulo are that you can insert/update/delete and you can insert/update/delete metadata about the file. To insert/update/delete from a super file (like HAR), you would need to do something similar to the compactions that Accumulo does automatically. You can run analytics over the file contents via map reduce and push that back into the table as metadata about the file. Could also index the files and reference the content in Accumulo. Of course this type of metadata and index could also reference the content in HAR. > Add descriptions to README.filedata > ----------------------------------- > > Key: ACCUMULO-274 > URL: https://issues.apache.org/jira/browse/ACCUMULO-274 > Project: Accumulo > Issue Type: Improvement > Components: docs > Reporter: David Medinets > Assignee: Billie Rinaldi > Priority: Minor > Fix For: 1.4.0 > > > README.filedata says the following: > {noformat} > This example is coupled with the dirlist example. See README.dirlist for > instructions. > {noformat} > I don't see the filedata class describes or references in the README.dirlist > file. The FileDataIngest class has a main method but I don't see information > about how it would be used. I do see this: > {noformat} > * FileDataIngest - Takes a list of files and archives them into Accumulo > keyed on the SHA1 hashes of the files. > {noformat} > How is filedata.FileDataIngest different from the dirlist.Ingest? -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira