Regarding RexExp support in ColumnFilter

2008-11-16 Thread Balamurugan Rajendran
Dear >From the API docs I understand that, To filter based on RegExp of RowKey we can use RegExpRowFilter. Is there any filter available to use RegExp comparision for an ColumnValue. I hope the existing ColumnValue filter does not support it. Or other wise, do we need to use MapReduce to achieve

Re: How to detect when the mapper is called the last time?

2008-11-16 Thread Thibaut_
Thanks St.Ack, That's exactly what I needed :-). I will modify the MultithreadedMapRunner class and the Mapper interface to add setup/teardown logic. Thanks, Thibaut -- View this message in context: http://www.nabble.com/How-to-detect-when-the-mapper-is-called-the-last-time--tp20528861p205308

RE: Few questions about map reduce in Hbase

2008-11-16 Thread Jonathan Gray
You get two sorted maps in HBase. The first is row key, the second is columns within each family. Beyond those two "indexes" you would have to build something separately. Others have done things internally, but I have no experience with that. For our uses, part of which is extensive merging and

Re: Few questions about map reduce in Hbase

2008-11-16 Thread Nishant Khurana
Hi Jonathan, Thanks for your reply. That made things lot clear to me. But there are more questions :) . -- What is the best way to build a index over a field in Hbase ? Do I have to build it in a custom way and store it on HDFS. If I have a query (not on HQL) like selection over 2 fields out of whi

Re: How to detect when the mapper is called the last time?

2008-11-16 Thread Michael Stack
Thibaut_ wrote: Hi, As each row of my hbase table can take a lot of time to process (waiting on answeres from other hosts), I would like to create a few threads to process that data in parallel. I would then use the last call to the map function to wait for all threads to finish their job and

RE: Few questions about map reduce in Hbase

2008-11-16 Thread Jonathan Gray
> Hi, > I am new to Hadoop and Hbase. I am trying to understand how to use map > reduce with Hbase as source and sink and had following questions. Would > appreciate if someone can answer them and may be point me to some > sample > code: > > -- As far as I understood, the tables gets stored in dif

How to detect when the mapper is called the last time?

2008-11-16 Thread Thibaut_
Hi, As each row of my hbase table can take a lot of time to process (waiting on answeres from other hosts), I would like to create a few threads to process that data in parallel. I would then use the last call to the map function to wait for all threads to finish their job and only return the la

Few questions about map reduce in Hbase

2008-11-16 Thread Nishant Khurana
Hi, I am new to Hadoop and Hbase. I am trying to understand how to use map reduce with Hbase as source and sink and had following questions. Would appreciate if someone can answer them and may be point me to some sample code: -- As far as I understood, the tables gets stored in different regions i

Re: create table is time consuming

2008-11-16 Thread Przemek
Michael Stack wrote: > Things take a while usually because messaging is done on a period; most > of the elapsed time is just waiting on the period to pass. See in > src/test where we have an hbase-site.xml with different config. from > hbase defaults. Most of the config. here are tunings to make