Yep...I went with the old design...loop through the date and perform
SubstringComparator for each date. The performance seems good. Thus we will
go with this solution for now.
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/How-to-perform-fuzzyRowFilter-with-range-sear
Yes, RegexStringComparator only valid with EQUAl or NOT_EQUAL, but I can
regex given the start date and end date. It will be something like:
"[0-9]|[0-9]|[0-9]|2014-0[7-8]-[07-31]T??:??:??.???-04:00" I still need to
figure out how to write the regular expression in RegexStringComparator.
Will thi
hmmm...just read more on RegexStringComparator...looks like this is the best
option for me now. Let me play around with this api...
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/How-to-perform-fuzzyRowFilter-with-range-search-tp4061159p4061168.html
Sent from the HBas
SubstringComparator does not have Greater_OR_EQUAL and LESS_OR_EQUAL for
CompareOp, I think I need that for date range (greater than the start date
and less than the end date).
Right now we are using HBase's UI, maybe in future we can create one above
HBase's UI.
Thank you!
--
View this messag
This one does not perform well on date range...There are records out of the
date range been returned. Is there a BinaryPostfixComparator so I can go
from backward?
Thank you!
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/How-to-perform-fuzzyRowFilter-with-range-sear
I implemented a range search on rowFilter for given customer id + timestamp.
That one works fine. Just come out of my mind...should I use 0 to
9 as customer id then plus the timestamp in rowFilter with CompareOp
and BinaryComparator? so the start key will be
02014-07-07T00:0
Searched on cloudera, looks like they only support up to 0.94.16.
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/How-to-perform-fuzzyRowFilter-with-range-search-tp4061159p4061160.html
Sent from the HBase User mailing list archive at Nabble.com.
Hi All,
I tried to find apis for fuzzyRowFilter with range search but did not find
any. Here is what I want to achieve: show the row key in a certain date
range. Our row key design is like customer id + timestamp. Example is:
1234567892014-07-09T00:11:22.333-04:00. Input should be the start date a
Never mind. I removed the synchronized block from the code.
Thanks to every one!
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p4060235.html
Sent from the HBase User mailing list archive at Nabble.com.
I run it without the block and seems it is still working. Will this be an
issue if the code go with the synchronized block other than latency? Do I
have to remove it?
Thank you!
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060
We removed the setFlush(false) statement so the flush time is determined by
HTable. We just put the Put creation part and table.put(p) inside the
synchronized block.
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p4060126.h
It looks like we resolved this issue by giving each thread their own
connection. Thanks to everyone.
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p4060116.html
Sent from the HBase User mailing list archive at Nabble.com.
Yes, I set the autoflush to false. At first I tried to use table.put to add
rows and at last do one flush. But in that situation, only one row is added
into HTable. Then I changed to flush every time putting a row.
The message drop situation happens in this scenario: I have 4 threads
running, each
One more information:
HBase is not pre-splitting, will that impact on multithreading?
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p4060113.html
Sent from the HBase User mailing list archive at Nabble.com.
Hi Jonathan,
I followed your suggestion to modify it to HConnection, it runs faster but
still have message dropping issue. I used a synchronized block to protect
table.flushCommits. It looks like when one thread is doing flushCommits
while another thread is doing it as well, then one of their mess
How did it work?
My design is like this:
each thread will call the write method, in the write method, the first
statement is creating the table like:
table = connection.getTable(this.tableName);
then I set auto flush to false
then I put the table.put statement in a synchronized block like this:
This is the version 0.94.15-cdh4.6.0. I am using api on 0.94.6.
Yeah...I know this is not thread safe, then I added the synchronized block
around table.flushCommits().
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/Discuss-HBase-with-multiple-threads-tp4060081p406008
Hi HBase users,
I try to implement multiple threads for HBase writing process, but looks
like the table.flushCommites() breaks thread easily. Then I add a
synchronized blocker for it, it flows but when I publish 50 messages, about
2 of them are missing...I am using HTablePool as well, so when ever
Besides, if I add hbase-site.xml, and core-site.xml into resources folder, it
works great.
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/How-to-override-the-hbase-site-xml-tp4059326p4059328.html
Sent from the HBase User mailing list archive at Nabble.com.
I think I got it, I am using addResource(InputStream, String) instead of
addResource(String).
--
View this message in context:
http://apache-hbase.679495.n3.nabble.com/How-to-override-the-hbase-site-xml-tp4059326p4059329.html
Sent from the HBase User mailing list archive at Nabble.com.
Hi there,
I am trying to use an hbase-site.xml from other directory instead from class
path. But the the configuration.addResource() does not work. For example, I
have an hbase-site.xml sitting at catalina_home/hbase/hbase-site.xml, I used
configuration.addResource("{catalina.base}/hbase/hbase-sit
These are the resources in configuration:
[hbase-default.xml, hbase-site.xml, /tomcat/hadoop/hbase-site.xml,
/tomcat/hadoop/hdfs-site.xml, /tomcat/hadoop/core-site.xml]
Is there a way I can remove resources? Or How can I override hbase-site.xml?
Any helps are appreciated!
--
View this message
22 matches
Mail list logo