Well the documents can be in various formats like pdf,doc,xls, video file,ppt 
etc and can vary in size from MBs to GBs.
I have read that if the file sizes are going bigger than MBs then it is not 
good to use Hbase for storage.

Any Comments

From: Ted Dunning [mailto:tdunn...@maprtech.com]
Sent: Tuesday, November 22, 2011 11:43 AM
To: hdfs-user@hadoop.apache.org
Subject: Re: Version control of files present in HDFS

How big is that?
On Mon, Nov 21, 2011 at 9:26 PM, Stuti Awasthi 
<stutiawas...@hcl.com<mailto:stutiawas...@hcl.com>> wrote:
Hi Ted,
Well in my case document size can be big, which is not good to keep in Hbase. 
So I rule out this option.

Thanks

From: Ted Dunning [mailto:tdunn...@maprtech.com<mailto:tdunn...@maprtech.com>]
Sent: Monday, November 21, 2011 8:57 PM
To: hdfs-user@hadoop.apache.org<mailto:hdfs-user@hadoop.apache.org>
Subject: Re: Version control of files present in HDFS

HDFS is a filesystem that is designed to support map-reduce computation.  As 
such, the semantics differ from what SVN or GIT would want to have.

HBase provides versioned values.  That might suffice for your needs.
On Mon, Nov 21, 2011 at 9:58 AM, Stuti Awasthi 
<stutiawas...@hcl.com<mailto:stutiawas...@hcl.com>> wrote:
Do we have any support from GIT or SVN to attain that ?

-----Original Message-----
From: Harsh J [mailto:ha...@cloudera.com<mailto:ha...@cloudera.com>]
Sent: Monday, November 21, 2011 3:23 PM
To: hdfs-user@hadoop.apache.org<mailto:hdfs-user@hadoop.apache.org>
Subject: Re: Version control of files present in HDFS

HDFS is merely a filesystem today. You'll need to write your own versioning 
layer on top of it if you need such a functionality.

On Mon, Nov 21, 2011 at 2:52 PM, Stuti Awasthi 
<stutiawas...@hcl.com<mailto:stutiawas...@hcl.com>> wrote:
> Hi All,
> I want to provide version control for documents which I keep in HDFS. What I 
> can do to attain this. Any ideas ?
>
> Regards,
> Stuti Awasthi
>
>
> ::DISCLAIMER::
> ----------------------------------------------------------------------
> -------------------------------------------------
>
> The contents of this e-mail and any attachment(s) are confidential and 
> intended for the named recipient(s) only.
> It shall not attach any liability on the originator or HCL or its
> affiliates. Any views or opinions presented in this email are solely those of 
> the author and may not necessarily reflect the opinions of HCL or its 
> affiliates.
> Any form of reproduction, dissemination, copying, disclosure,
> modification, distribution and / or publication of this message
> without the prior written consent of the author of this e-mail is
> strictly prohibited. If you have received this email in error please delete 
> it and notify the sender immediately. Before opening any mail and attachments 
> please check them for viruses and defect.
>
> ----------------------------------------------------------------------
> -------------------------------------------------
>



--
Harsh J


Reply via email to