HBase Stack

2011-11-13 Thread Em
Hello list, I was asked whether it is a good idea to replace the M in LAMP with Hbase as well as the P with Java-Servlet (i.e. Tomcat) so that you run your webserver, your hbase-instance, hadoop etc. on the same machine. Are the differences compared to a LAMP-Stack in terms of performance large?

RE: MR - Input from Hbase output to HDFS

2011-11-13 Thread Stuti Awasthi
Hi, I think that issue is with Filesystem Configuration, as in config, it is picking HbaseConfiguration. When I modified my output directory path to absolute path of HDFS : FileOutputFormat.setOutputPath(job, new Path("hdfs://master:54310/MR/stuti3")); The MR jobs runs successfully and I am able

RE: MR - Input from Hbase output to HDFS

2011-11-13 Thread Stuti Awasthi
Hi Joey, Thanks for pointing this. After importing "FileOutputFormat" as you suggested, I am able to run MR job from eclipse (Windows) the only problem is I am not able to see the output directory this code is creating. HDFS and HBase are on Linux machine. Code : Configuration c

Re: Web analytics and HBase

2011-11-13 Thread inder . pall
A note - storing time series data in hbase can cause hot spots and splits...have you looked at opentsdb Inder Sent from BlackBerry® on Airtel -Original Message- From: Date: Sun, 13 Nov 2011 23:39:51 To: Reply-To: user@hbase.apache.org Subject: Re: Web analytics and HBase On Nov 13,

Re: Web analytics and HBase

2011-11-13 Thread victor.hong
On Nov 13, 2011, at 6:11 PM, ext Samuel García Martínez wrote: > Hi everyone, i had a question about HBase. > > * Background: > I'm working on analytics project and, so far, we are using MySQL as DBMS > and Hadoop for data processing and aggregation. By now, we collect data > analytics over HTTP

Web analytics and HBase

2011-11-13 Thread Samuel García Martínez
Hi everyone, i had a question about HBase. * Background: I'm working on analytics project and, so far, we are using MySQL as DBMS and Hadoop for data processing and aggregation. By now, we collect data analytics over HTTP and pushes to Hadoop. Every day (in fact, every night :P) we run Hadoop jobs

Re: HBase cluster on heterogeneous filesystems

2011-11-13 Thread Fuad Efendi
Hi Otis, I had super ugly experience with Amazon EC2 virtual nodes and I even found bug reports related to Ubuntu... Problems with unpredictable "wall time" when everything stops and ZooKeeper sessions expire... I don't have any problems with dedicated servers and CentoOS. It is super risky t

Re: Row get very slow

2011-11-13 Thread Stack
On Sun, Nov 13, 2011 at 7:13 AM, Arvind Jayaprakash wrote: > A common confusion is b/w MAX_FILESIZE and BLOCKSIZE. Given that > MAX_FILESIZE is not listed on :60010/master.jsp, one tends to assume > BLOCKSIZE represents that value. > We should fix that. What would you like to see Arind? St.Ack

Re: HMaster daemon is disappearing immidietly after i ran start-hbase.sh

2011-11-13 Thread Stack
On Fri, Nov 11, 2011 at 10:23 PM, Vamshi Krishna wrote: > Hi Anil, thank you very much for ur help. It worked out well to resolve the > problem. I changed the port in /etc/zookeeper/zoo.cfg . Later i got client > protocol mismatch, so i replaced the hadoop jar in HBase_Home/lib. Now its > fine. Hb

Re: Row get very slow

2011-11-13 Thread Arvind Jayaprakash
A common confusion is b/w MAX_FILESIZE and BLOCKSIZE. Given that MAX_FILESIZE is not listed on :60010/master.jsp, one tends to assume BLOCKSIZE represents that value. On Nov 10, lars hofhansl wrote: >"BLOCKSIZE => '536870912'" > > >You set your blocksize to 512mb? The default is 64k (65536), try t