[ https://issues.apache.org/jira/browse/MAILBOX-170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13219979#comment-13219979 ]
Eric Charles commented on MAILBOX-170: -------------------------------------- Hi Ioan, imho the storage of the raw mail in hdfs sequence file can be an option. We will need to measure the efficiency of this implementation compared to a pure hbase one (you know the story "... hdfs is for very very large files...". Upon a distributed mailbox locker (JAMES-1388) we also need a mechanism to query efficiently the mailbox (in case of imap search queries for example - not covered neither in the current hbase impl). btw, please ensure the existing mailbox-hbase remain as such (without hadoop), and start the implementation in a mailbox-hadoop project. > Store mailboxes in HDFS SequenceFile > ------------------------------------ > > Key: MAILBOX-170 > URL: https://issues.apache.org/jira/browse/MAILBOX-170 > Project: James Mailbox > Issue Type: Improvement > Components: hbase > Affects Versions: 0.4 > Reporter: Ioan Eugen Stan > Assignee: Ioan Eugen Stan > Fix For: 0.5 > > > The current implementation stores messages directly in HBase. I believe a > better approach is to store the messages as SequenceFiles in the <mail_ID>: > <message_data>. HBase will store sequence File offests in the SequenceFile > for each mailbox for fast access similar to a hadoop MapFile. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: server-dev-unsubscr...@james.apache.org For additional commands, e-mail: server-dev-h...@james.apache.org