Re: Using HBase in standalone mode in production

2014-07-08 Thread lars hofhansl
@hbase.apache.org Sent: Tuesday, July 8, 2014 12:55 AM Subject: Using HBase in standalone mode in production Hi all, So this question might be stupid, retarded even, but it has been bugging me for a while and I cannot think of a better place to ask this. I am really impressed with the way HBase

Re: Using HBase in standalone mode in production

2014-07-07 Thread Ted Yu
ze with flexibility of growing and shrinking. > > > Artem Ervits > Data Analyst > New York Presbyterian Hospital > > - Original Message - > From: Arun Allamsetty [mailto:arun.allamse...@gmail.com] > Sent: Monday, July 07, 2014 08:37 PM > To: user@hbase.apache.o

Re: Using HBase in standalone mode in production

2014-07-07 Thread Artem Ervits
Message - From: Arun Allamsetty [mailto:arun.allamse...@gmail.com] Sent: Monday, July 07, 2014 08:37 PM To: user@hbase.apache.org Subject: Re: Using HBase in standalone mode in production I have never tried MySQL's blob or varbinary. I guess I can look into that. Thanks for answering my ques

Re: Using HBase in standalone mode in production

2014-07-07 Thread Arun Allamsetty
I have never tried MySQL's blob or varbinary. I guess I can look into that. Thanks for answering my questions. Arun On Jul 7, 2014 6:22 PM, "Dima Spivak" wrote: > Does MySQL's BLOB or VARBINARY satisfy your use case? > > As for converting a pseudo-distributed cluster to a distributed one, unless

Re: Using HBase in standalone mode in production

2014-07-07 Thread Dima Spivak
Does MySQL's BLOB or VARBINARY satisfy your use case? As for converting a pseudo-distributed cluster to a distributed one, unless I'm mistaken, you should have no problem doing so. HDFS is quite good with scaling, whether it's from 10 machines to 20 or 1 to 10 and I don't know of any reason that H

Re: Using HBase in standalone mode in production

2014-07-07 Thread Arun Allamsetty
I understand. But for example, my use case is where even if I don't have a lot of data, what if I would rather store serialized objects. For this traditional RDBMS are not suitable. If I can forego the fail safe capabilities, then what is a good choice (if not HBase). Also, on a different note, if

Re: Using HBase in standalone mode in production

2014-07-07 Thread Dima Spivak
In general, production systems run in distributed mode because they leverage HBase's scalability and reliability; HBase really only shows its worth when it's charged with managing terabytes of data on a fault-tolerant file system like HDFS. You lose both of these when you run in standalone mode, so

Re: Using HBase in standalone mode in production

2014-07-07 Thread Arun Allamsetty
Hi Ted, I have. So the book says there are two types of distributed modes. One is pseudo distributed, which is used when we want to test HBase's distributed capabilities using a single machine. As far as I understood, this is just to verify the use cases and the requirements. Then we have the full

Re: Using HBase in standalone mode in production

2014-07-07 Thread Ted Yu
Have you read http://hbase.apache.org/book.html#standalone_dist ? Cheers On Mon, Jul 7, 2014 at 3:55 PM, Arun Allamsetty wrote: > Hi all, > > So this question might be stupid, retarded even, but it has been bugging me > for a while and I cannot think of a better place to ask this. I am really

Using HBase in standalone mode in production

2014-07-07 Thread Arun Allamsetty
Hi all, So this question might be stupid, retarded even, but it has been bugging me for a while and I cannot think of a better place to ask this. I am really impressed with the way HBase works (as a key-value store). Since it stores everything as a byte array, I find it really convenient to store