Re: How to perform fuzzyRowFilter with range search

2014-07-09 Thread Hotec04
Yep...I went with the old design...loop through the date and perform SubstringComparator for each date. The performance seems good. Thus we will go with this solution for now. -- View this message in context: http://apache-hbase.679495.n3.nabble.com/How-to-perform-fuzzyRowFilter-with-range-sear

Re: How to perform fuzzyRowFilter with range search

2014-07-09 Thread Hotec04
Yes, RegexStringComparator only valid with EQUAl or NOT_EQUAL, but I can regex given the start date and end date. It will be something like: "[0-9]|[0-9]|[0-9]|2014-0[7-8]-[07-31]T??:??:??.???-04:00" I still need to figure out how to write the regular expression in RegexStringComparator. Will thi

Re: How to perform fuzzyRowFilter with range search

2014-07-09 Thread Hotec04
hmmm...just read more on RegexStringComparator...looks like this is the best option for me now. Let me play around with this api... -- View this message in context: http://apache-hbase.679495.n3.nabble.com/How-to-perform-fuzzyRowFilter-with-range-search-tp4061159p4061168.html Sent from the HBas

Re: How to perform fuzzyRowFilter with range search

2014-07-09 Thread Hotec04
SubstringComparator does not have Greater_OR_EQUAL and LESS_OR_EQUAL for CompareOp, I think I need that for date range (greater than the start date and less than the end date). Right now we are using HBase's UI, maybe in future we can create one above HBase's UI. Thank you! -- View this messag

Re: How to perform fuzzyRowFilter with range search

2014-07-09 Thread Hotec04
This one does not perform well on date range...There are records out of the date range been returned. Is there a BinaryPostfixComparator so I can go from backward? Thank you! -- View this message in context: http://apache-hbase.679495.n3.nabble.com/How-to-perform-fuzzyRowFilter-with-range-sear

Re: How to perform fuzzyRowFilter with range search

2014-07-09 Thread Hotec04
I implemented a range search on rowFilter for given customer id + timestamp. That one works fine. Just come out of my mind...should I use 0 to 9 as customer id then plus the timestamp in rowFilter with CompareOp and BinaryComparator? so the start key will be 02014-07-07T00:0

Re: How to perform fuzzyRowFilter with range search

2014-07-09 Thread Hotec04
Searched on cloudera, looks like they only support up to 0.94.16. -- View this message in context: http://apache-hbase.679495.n3.nabble.com/How-to-perform-fuzzyRowFilter-with-range-search-tp4061159p4061160.html Sent from the HBase User mailing list archive at Nabble.com.

How to perform fuzzyRowFilter with range search

2014-07-09 Thread Hotec04
Hi All, I tried to find apis for fuzzyRowFilter with range search but did not find any. Here is what I want to achieve: show the row key in a certain date range. Our row key design is like customer id + timestamp. Example is: 1234567892014-07-09T00:11:22.333-04:00. Input should be the start date a

Re: [Discuss] HBase with multiple threads

2014-06-09 Thread Hotec04
Never mind. I removed the synchronized block from the code. Thanks to every one! -- View this message in context: http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p4060235.html Sent from the HBase User mailing list archive at Nabble.com.

Re: [Discuss] HBase with multiple threads

2014-06-09 Thread Hotec04
I run it without the block and seems it is still working. Will this be an issue if the code go with the synchronized block other than latency? Do I have to remove it? Thank you! -- View this message in context: http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060

Re: [Discuss] HBase with multiple threads

2014-06-08 Thread Hotec04
We removed the setFlush(false) statement so the flush time is determined by HTable. We just put the Put creation part and table.put(p) inside the synchronized block. -- View this message in context: http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p4060126.h

Re: [Discuss] HBase with multiple threads

2014-06-07 Thread Hotec04
It looks like we resolved this issue by giving each thread their own connection. Thanks to everyone. -- View this message in context: http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p4060116.html Sent from the HBase User mailing list archive at Nabble.com.

Re: [Discuss] HBase with multiple threads

2014-06-07 Thread Hotec04
Yes, I set the autoflush to false. At first I tried to use table.put to add rows and at last do one flush. But in that situation, only one row is added into HTable. Then I changed to flush every time putting a row. The message drop situation happens in this scenario: I have 4 threads running, each

Re: [Discuss] HBase with multiple threads

2014-06-07 Thread Hotec04
One more information: HBase is not pre-splitting, will that impact on multithreading? -- View this message in context: http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p4060113.html Sent from the HBase User mailing list archive at Nabble.com.

Re: [Discuss] HBase with multiple threads

2014-06-06 Thread Hotec04
Hi Jonathan, I followed your suggestion to modify it to HConnection, it runs faster but still have message dropping issue. I used a synchronized block to protect table.flushCommits. It looks like when one thread is doing flushCommits while another thread is doing it as well, then one of their mess

Re: [Discuss] HBase with multiple threads

2014-06-06 Thread Hotec04
How did it work? My design is like this: each thread will call the write method, in the write method, the first statement is creating the table like: table = connection.getTable(this.tableName); then I set auto flush to false then I put the table.put statement in a synchronized block like this:

Re: [Discuss] HBase with multiple threads

2014-06-06 Thread Hotec04
This is the version 0.94.15-cdh4.6.0. I am using api on 0.94.6. Yeah...I know this is not thread safe, then I added the synchronized block around table.flushCommits(). -- View this message in context: http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p406008

[Discuss] HBase with multiple threads

2014-06-06 Thread Hotec04
Hi HBase users, I try to implement multiple threads for HBase writing process, but looks like the table.flushCommites() breaks thread easily. Then I add a synchronized blocker for it, it flows but when I publish 50 messages, about 2 of them are missing...I am using HTablePool as well, so when ever

Re: How to override the hbase-site.xml

2014-05-19 Thread Hotec04
Besides, if I add hbase-site.xml, and core-site.xml into resources folder, it works great. -- View this message in context: http://apache-hbase.679495.n3.nabble.com/How-to-override-the-hbase-site-xml-tp4059326p4059328.html Sent from the HBase User mailing list archive at Nabble.com.

Re: How to override the hbase-site.xml

2014-05-19 Thread Hotec04
I think I got it, I am using addResource(InputStream, String) instead of addResource(String). -- View this message in context: http://apache-hbase.679495.n3.nabble.com/How-to-override-the-hbase-site-xml-tp4059326p4059329.html Sent from the HBase User mailing list archive at Nabble.com.

How to override the hbase-site.xml

2014-05-19 Thread Hotec04
Hi there, I am trying to use an hbase-site.xml from other directory instead from class path. But the the configuration.addResource() does not work. For example, I have an hbase-site.xml sitting at catalina_home/hbase/hbase-site.xml, I used configuration.addResource("{catalina.base}/hbase/hbase-sit

Re: How to override the hbase-site.xml

2014-05-19 Thread Hotec04
These are the resources in configuration: [hbase-default.xml, hbase-site.xml, /tomcat/hadoop/hbase-site.xml, /tomcat/hadoop/hdfs-site.xml, /tomcat/hadoop/core-site.xml] Is there a way I can remove resources? Or How can I override hbase-site.xml? Any helps are appreciated! -- View this message