passing a parameter to an observer coprocessor

2013-08-22 Thread Wei Tan
Hi all, I want to add some dynamic behavior to my observer cp, say: postPut(){ if(flag) {do function1()}; else {do function2()} } Is there a way to dynamically change the value of flag? One Feasible approaches is to change a value in table descriptor, but then I need to restart the tab

Re: passing a parameter to an observer coprocessor

2013-08-22 Thread Wei Tan
RE: passing a parameter to an observer coprocessor >> >> Add fake cf + column to your Put operation >> >> Put put = new Put(row); >> >> put.addFamily("COMMAND".getBytes(), "flag".getBytes(),"true".getBytes()); >> >> Be

RE: passing a parameter to an observer coprocessor

2013-08-22 Thread Wei Tan
We would like to avoid such interruption. A global hashmap storing such a setting would be more desirable. Thanks, Wei - Wei Tan, PhD Research Staff Member IBM T. J. Watson Research Center http://researcher.ibm.com/person/us-wtan From: Vladimir Rodionov To

coprocessor status query

2013-10-21 Thread Wei Tan
Hello, I want to check the status of each coprocessor, in a given table. Let's say I have 3 CPs and one of them is removed due to some unhandled exception, so I want to see this status (3 deployed, 2 currently alive). I found this from http://blogs.apache.org/hbase/entry/coprocessor_introductio

Re: coprocessor status query

2013-10-21 Thread Wei Tan
Hi Gary, thanks! It seems that the region observer been removed behavior, is per region and NOT per coprocessor. So do I have to query each region to get the per region health status? Or, is there a table level API telling me something like, I have 10 regions and an observer has been removed in 2

Re: coprocessor status query

2013-10-22 Thread Wei Tan
Hi Gary, thanks for your clarification and yes, I totally agree with your statement. The class is not removed but the CP is kind of removed and not active after an un-handled exception. I will take a look at the Jira you mentioned. Best regards, Wei From: Gary Helmling To: user@hbase.

Re: How can I insert large image or video into HBase?

2013-10-23 Thread Wei Tan
Roman, thanks for sharing your experience. Is your approach is somewhat similar to Facebook's image store, Haystack? I am very interested in knowing your use case, and what you actually mean by class abstraction, internal write buffer, etc. Best regards, Wei From: Roman Nikitchenko To:

connect to the region from coprocessor

2012-10-09 Thread Wei Tan
Hi, In my preput coprocessor I would like to get the old value of the row been input. Now I am creating a HTable instance and using the get interface; function wise it works fine. Given the row is physically in the same region as the cp, is there any lightweight approach doing that? Thanks!

RE: connect to the region from coprocessor

2012-10-11 Thread Wei Tan
anted overhead. -Anoop- ________ From: Wei Tan [w...@us.ibm.com] Sent: Wednesday, October 10, 2012 2:05 AM To: hbase-u...@hadoop.apache.org Subject: connect to the region from coprocessor Hi, In my preput coprocessor I would like to get the old value of the row been input. Now I am creatin

readRequestsCount: 2 in one read

2012-10-16 Thread Wei Tan
Hi, I am monitoring the readRequestsCount shown in the "Requests" column in the web GUI of a server/region. I observe that, while a put correspond to ONE write request, a get corresponds to 2 readRequestsCount. Is that true and is there a reason for that? I got the same number in a table with

RE: Retrieve Put timestamp

2012-11-13 Thread Wei Tan
I wonder if there is any follow up on this issue, i.e., a put can return a timestamp of the record? Thanks! Best Regards, Wei From: Wei Tan/Watson/IBM To: user@hbase.apache.org, Date: 08/02/2012 12:37 PM Subject:RE: Retrieve Put timestamp +1. So far I think timestamp is

RE: Hbase MapReduce

2012-11-24 Thread Wei Tan
Actually coprocessor can be used to implement MR-like function, while not using Hadoop framework. Best Regards, Wei Wei Tan Research Staff Member IBM T. J. Watson Research Center Yorktown Heights, NY 10598 w...@us.ibm.com; 914-784-6752 From: Dalia Sobhy To: "user@hbase.apach

Meaure server time of Get/Scan - through RPC logging?

2012-12-02 Thread Wei Tan
Hi I am using YCSB to measure the performance of some complex Get/Scan, i.e., with some filters. I found that I got weird high latency from YCSB client but both HBase server and YCSB client are NOT saturated (in terms of network, I/O, CPU). So I want to measure the server time a single Get/Scan

Re: Meaure server time of Get/Scan - through RPC logging?

2012-12-08 Thread Wei Tan
Hmm... I appreciate it if anybody can point me how to measure server side Get/Scan time... Thanks! Best Regards, Wei From: Wei Tan/Watson/IBM To: hbase-u...@hadoop.apache.org, Date: 12/02/2012 12:21 PM Subject:Meaure server time of Get/Scan - through RPC logging? Hi I am

Re: Coprocessor / threading model

2013-01-15 Thread Wei Tan
Andrew, could you explain more, why doing cross-table operation is an anti-pattern of using CP? Durability might be an issue, as far as I understand. Thanks, Best Regards, Wei From: Andrew Purtell To: "user@hbase.apache.org" , Date: 01/12/2013 09:39 PM Subject:Re: Coprocess

Re: Coprocessor / threading model

2013-01-15 Thread Wei Tan
avoid RPC. Otherwise, if you can tolerate consistency but not latency, you put RPCs in a queue and process them in the background. By this means you can have a "global" index with some lag. Best Regards, Wei Wei Tan Research Staff Member IBM T. J. Watson Research Center Yorktown

Which version has FuzzyRowFilter?

2013-01-23 Thread Wei Tan
Hi, I did a quick lookup and found these two threads related to FuzzyRowFilter status: https://issues.apache.org/jira/browse/HBASE-7628 https://issues.apache.org/jira/browse/HBASE-6509 It seems that FuzzyRowFilter will be available on 96 and there is a plan to port it to 94.5. So that is th

copy table AND descriptor

2013-01-29 Thread Wei Tan
Hi I have a table in cluster 1 with quite lengthy metadata (aka, descriptor). How can I replicate the data as well as descriptor to another table? CopyTable, Export/Import does not handle table descriptor. Thanks, Wei

Re: copy table AND descriptor

2013-01-29 Thread Wei Tan
Hi Ted, Thanks for your remind. I made it in two steps: 1. write a java problem to create a table using original table's descriptor. 2. use ./bin/hbase org.apache.hadoop.hbase.mapreduce.CopyTable to copy the data Best Regards, Wei Wei Tan Research Staff Member IBM T. J. Watson Res

maven junit test of a coprocessor

2013-01-30 Thread Wei Tan
Hi, I am writing a maven junit test for a HBase coprocessor. The problem is that, I want to write a junit test that deploy the cp jar into a cluster, and test its function. However, test is before install so I cannot get a cp jar to deploy at that time. Is this like a chicken-and-egg problem? An

Re: Coprocessor HDFS jar hot deployment

2013-01-31 Thread Wei Tan
Thanks. I may try this approach later. Now I am using a remote cluster to test and I have this workaround: mvn install and skip test copy the cp to the remote cluster mvn install with test I doubt if it is a good approach but it works. Best Regards, Wei From: Adrien Mogenet To: user@

HBase /lib directory contains any json processing util?

2013-01-31 Thread Wei Tan
We need to parse JSON in a coprocessor and if HBase /lib directory contains any json processing util, we can avoid introducing additional jars. Thanks! Best Regards, Wei

Re: coprocessor enabled put very slow, help please~~~

2013-02-17 Thread Wei Tan
Is your CheckAndPut involving a local or remote READ? Due to the nature of LSM, read is much slower compared to a write... Best Regards, Wei From: Prakash Kadel To: "user@hbase.apache.org" , Date: 02/17/2013 07:49 PM Subject:coprocessor enabled put very slow, help please~~~

Re: coprocessor enabled put very slow, help please~~~

2013-02-18 Thread Wei Tan
n a lot of cases, RPC may NOT be the bottle neck. Remember a "local" put also involves RPC -- during WAL to HDFS. Best Regards, Wei Wei Tan Research Staff Member IBM T. J. Watson Research Center Yorktown Heights, NY 10598 w...@us.ibm.com; 914-945-4386 From: Prakash Kadel

Re: coprocessor enabled put very slow, help please~~~

2013-02-19 Thread Wei Tan
or data processing can be seen as >>> "unnecessary overhead". >>> >>> Thanks >>> >>> yong >>> On Mon, Feb 18, 2013 at 10:35 AM, Michael Segel >>> wrote: >>>> Why? >>>> &g

Re: Retrieve Put timestamp

2012-08-01 Thread Wei Tan
We have a similar requirement and here is the solution in our mind: add a coprocessor, in prePut() get the current ms and set it to put --- the current implementation get the current ms and set it in put() return the ms generated to prePut() to client. For now put() does not return any value. we

RE: Retrieve Put timestamp

2012-08-02 Thread Wei Tan
+1. So far I think timestamp is very useful. I would imagine if we can configure the return, say in pre/post put, it would be even nicer. Thanks, Wei Wei Tan Research Staff Member IBM T. J. Watson Research Center 19 Skyline Dr, Hawthorne, NY 10532 w...@us.ibm.com; 914-784-6752 From

RE: HBase MapReduce - Using mutiple tables as source

2012-08-06 Thread Wei Tan
. Thanks, Best Regards, Wei Wei Tan Research Staff Member IBM T. J. Watson Research Center 19 Skyline Dr, Hawthorne, NY 10532 w...@us.ibm.com; 914-784-6752 From: "Amlan Roy" To: , Date: 08/06/2012 09:05 AM Subject:RE: HBase MapReduce - Using mutiple tables as so

Re: consistency, availability and partition pattern of HBase

2012-08-07 Thread Wei Tan
oposed by Prof. Eric Brewer... http://en.wikipedia.org/wiki/Eric_Brewer_%28scientist%29 Best Regards, Wei Wei Tan Research Staff Member IBM T. J. Watson Research Center 19 Skyline Dr, Hawthorne, NY 10532 w...@us.ibm.com; 914-784-6752 From: Lin Ma To: user@hbase.apache.org, Date:

Re: Coprocessor tests under busy insertions

2012-08-13 Thread Wei Tan
for the chained put (seems to be an index put) to be completed -- this is causing the timeout. Best Regards, Wei Wei Tan Research Staff Member IBM T. J. Watson Research Center 19 Skyline Dr, Hawthorne, NY 10532 w...@us.ibm.com; 914-784-6752 From: Henry JunYoung KIM To: user

larger HFile block size for very wide row?

2014-01-29 Thread Wei Tan
Hi, I have a HBase table where each row has ~1000k columns, ~2K each. My table scan pattern is to use a row key filter but I need to fetch the whole row (~1000 k) columns back. Shall I set HFile block size to be larger than the default 64K? Thanks, Wei - Wei Tan

Re: larger HFile block size for very wide row?

2014-01-29 Thread Wei Tan
To be more clear, each KV (cell) is a couple of KB but each row is a couple of MB. If I need to search through row key, but always fetch rows as a whole, shall I use a block size larger than the default 64KB? Thanks, Wei - Wei Tan, PhD Research Staff Member IBM

Re: larger HFile block size for very wide row?

2014-01-29 Thread Wei Tan
criteria. Best regards, Wei - Wei Tan, PhD Research Staff Member IBM T. J. Watson Research Center http://researcher.ibm.com/person/us-wtan From: lars hofhansl To: "user@hbase.apache.org" , Date: 01/29/2014 03:49 PM Subject:Re: larger H

Re: larger HFile block size for very wide row?

2014-01-29 Thread Wei Tan
Thanks for the help from you all. Best regards, Wei --------- Wei Tan, PhD Research Staff Member IBM T. J. Watson Research Center http://researcher.ibm.com/person/us-wtan From: Ted Yu To: "user@hbase.apache.org" , Date: 01/29/2014 04:37 PM Subject:

Re: Is HBase is feasible for storing 4-5 MB of data as cell value

2014-02-26 Thread Wei Tan
I am thinking of storing medium sized objects (~1M) using HBase. The advantage of using HBase rather than HBase (storing pointers) + HDFS, in my mind, is: data locality. When I want to run analytics, I will access these objects using HBase scan, and HBase stores KVs in a sequential manner. If I

RE: Is HBase is feasible for storing 4-5 MB of data as cell value

2014-02-27 Thread Wei Tan
Image :) Best regards, Wei - Wei Tan, PhD Research Staff Member IBM T. J. Watson Research Center http://researcher.ibm.com/person/us-wtan From: Vladimir Rodionov To: "user@hbase.apache.org" , Date: 02/27/2014 01:22 AM Subject:RE: I

mutable index in phoenix

2014-02-27 Thread Wei Tan
- Wei Tan, PhD Research Staff Member IBM T. J. Watson Research Center http://researcher.ibm.com/person/us-wtan

Occasional GSSException that brings down region server

2014-03-10 Thread Wei Tan
, Wei ----- Wei Tan, PhD Research Staff Member IBM T. J. Watson Research Center http://researcher.ibm.com/person/us-wtan

Re: Occasional GSSException that brings down region server

2014-03-11 Thread Wei Tan
ese errors on this page seem to be for "does not work at all" conditions... not a "fails every randomly long amount of time" 3) we don't have this "problematic combination of components" listed... but again - this is a work / no work dichotomy... Thanks, W

Re: Occasional GSSException that brings down region server

2014-03-14 Thread Wei Tan
! Best regards, Wei --------- Wei Tan, PhD Research Staff Member IBM T. J. Watson Research Center http://researcher.ibm.com/person/us-wtan From: Zesheng Wu To: user@hbase.apache.org, Date: 03/13/2014 09:33 PM Subject:Re: Occasional GSSException that brings down region server Thank you al