The performance you're seeing is definitely not typical. 'couple of further questions: - How large are your KVs (columns)?- Do you delete data? Do you run major compactions? - Can you measure: CPU, IO, context switches, etc, during the scanning? - Do you have many versions of the columns?
Note that HBase is a key value store, i.e. the storage is sparse. Each column is represented by its own key value pair, and HBase has to do the work to reassemble the data. -- Lars ________________________________ From: Vimal Jain <vkj...@gmail.com> To: user@hbase.apache.org Sent: Monday, July 1, 2013 4:44 AM Subject: Re: How many column families in one table ? Hi, We had some hardware constraints along with the fact that our total data size was in GBs. Thats why to start with Hbase , we first began with pseudo distributed mode and thought if required we would upgrade to fully distributed mode. On Mon, Jul 1, 2013 at 5:09 PM, Ted Yu <yuzhih...@gmail.com> wrote: > bq. I have configured Hbase in pseudo distributed mode on top of HDFS. > > What was the reason for using pseudo distributed mode in production setup ? > > Cheers > > On Mon, Jul 1, 2013 at 1:44 AM, Vimal Jain <vkj...@gmail.com> wrote: > > > Thanks Dhaval/Michael/Ted/Otis for your replies. > > Actually , i asked this question because i am seeing some performance > > degradation in my production Hbase setup. > > I have configured Hbase in pseudo distributed mode on top of HDFS. > > I have created 17 Column families :( . I am actually using 14 out of > these > > 17 column families. > > Each column family has around on average 8-10 column qualifiers so total > > around 140 columns are there for each row key. > > I have around 1.6 millions rows in the table. > > To completely scan the table for all 140 columns , it takes around 30-40 > > minutes. > > Is it normal or Should i redesign my table schema ( probably merging 4-5 > > column families into one , so that at the end i have just 3-4 cf ) ? > > > > > > > > On Sat, Jun 29, 2013 at 12:06 AM, Otis Gospodnetic < > > otis.gospodne...@gmail.com> wrote: > > > > > Hm, works for me - > > > > > > > > > http://search-hadoop.com/m/qOx8l15Z1q42/column+families+fb&subj=Re+HBase+Column+Family+Limit+Reasoning > > > > > > Shorter version: http://search-hadoop.com/m/qOx8l15Z1q42 > > > > > > Otis > > > -- > > > Solr & ElasticSearch Support -- http://sematext.com/ > > > Performance Monitoring -- http://sematext.com/spm > > > > > > > > > > > > On Fri, Jun 28, 2013 at 8:40 AM, Vimal Jain <vkj...@gmail.com> wrote: > > > > Hi All , > > > > Thanks for your replies. > > > > > > > > Ted, > > > > Thanks for the link, but its not working . :( > > > > > > > > > > > > On Fri, Jun 28, 2013 at 5:57 PM, Ted Yu <yuzhih...@gmail.com> wrote: > > > > > > > >> Vimal: > > > >> Please also refer to: > > > >> > > > >> > > > > > > http://search-hadoop.com/m/qOx8l15Z1q42/column+families+fb&subj=Re+HBase+Column+Family+Limit+Reasoning > > > >> > > > >> On Fri, Jun 28, 2013 at 1:37 PM, Michel Segel < > > > michael_se...@hotmail.com > > > >> >wrote: > > > >> > > > >> > Short answer... As few as possible. > > > >> > > > > >> > 14 CF doesn't make too much sense. > > > >> > > > > >> > Sent from a remote device. Please excuse any typos... > > > >> > > > > >> > Mike Segel > > > >> > > > > >> > On Jun 28, 2013, at 12:20 AM, Vimal Jain <vkj...@gmail.com> > wrote: > > > >> > > > > >> > > Hi, > > > >> > > How many column families should be there in an hbase table ? Is > > > there > > > >> any > > > >> > > performance issue in read/write if we have more column families > ? > > > >> > > I have designed one table with around 14 column families in it > > with > > > >> each > > > >> > > having on average 6 qualifiers. > > > >> > > Is it a good design ? > > > >> > > > > > >> > > -- > > > >> > > Thanks and Regards, > > > >> > > Vimal Jain > > > >> > > > > >> > > > > > > > > > > > > > > > > -- > > > > Thanks and Regards, > > > > Vimal Jain > > > > > > > > > > > -- > > Thanks and Regards, > > Vimal Jain > > > -- Thanks and Regards, Vimal Jain