Re: ValueFilter returning earlier values

2017-04-20 Thread Ted Yu
The JIRA number is HBASE-17125 On Thu, Apr 20, 2017 at 6:40 AM, Anoop John wrote: > This is because of the way with which filters and versions are checked > in RS. We first do filter op and then apply version. The value filter > might have filtered out latest cell but it

Re: HBase table Size

2017-04-19 Thread Ted Yu
For Apache hadoop, you can use 'hdfs dfs -du' command on the table directory. Not sure of the equivalent for MapR hadoop. Consider polling their mailing list. On Wed, Apr 19, 2017 at 5:20 AM, Chetan Khatri wrote: > Hello Dev, > > How to get HBase Table Size from

Re: Writing to HBase from Spark

2017-04-18 Thread Ted Yu
Not that I know of. On Tue, Apr 18, 2017 at 12:02 PM, Eric Wilson <eric.wil...@aver.io> wrote: > So if I'm using PySpark 2.1 and HBase 1.2.5 there is no way for them to > communicate with each other? > > Eric > > On Tue, Apr 18, 2017 at 2:03 PM, Ted Yu <

Re: Scan returns incomplete results

2017-04-14 Thread Ted Yu
ad. > Thanks > -Kohki > > On Apr 14, 2017, at 12:30 PM, Ted Yu <yuzhih...@gmail.com> wrote: > > Was the trace obtained with 1.3 client ? > > Thanks > > > On Apr 14, 2017, at 11:43 AM, Kohki Nishio <tarop...@icloud.com> wrote: > > > Here I

Re: Scan returns incomplete results

2017-04-14 Thread Ted Yu
but it didn’t fix the problem .. >> >> >>> On Apr 12, 2017, at 8:14 PM, Kohki Nishio <tarop...@icloud.com> wrote: >>> >>> As far as I can see, the change is in ClientScanner class, which is a part >>> of Hbase-client, does this work inside

Re: Can't build hbase with hadoop 2.8.0

2017-04-14 Thread Ted Yu
Please see: https://issues.apache.org/jira/browse/HBASE-17893 > On Apr 14, 2017, at 10:43 AM, Dima Fadeyev wrote: > > Hello, everyone, > > I'm trying to build HBase 1.3.0 with Hadoop 2.8.0. > > mvn clean package assembly:single -DskipTests -Dhadoop-two.version=2.8.0 >

Re: Hbase exportSnapshot fails in HA missing hbase-prefix-tree.jar

2017-04-14 Thread Ted Yu
hdfs://clusterID/root/.m2/repository/org/apache/htrace/htrace-core/3.1.0-incubating/htrace-core-3.1.0-incubating.jar > > So it looks like something else is wrong with the installation or there is > a bug that is making these searches attempt to look in "hdfs://" instead of >

Re: ANNOUNCE: Yu Li joins the Apache HBase PMC

2017-04-14 Thread Ted Yu
Congratulations, Yu ! On Fri, Apr 14, 2017 at 7:22 AM, Anoop John wrote: > On behalf of the Apache HBase PMC I"m pleased to announce that Yu Li > has accepted our invitation to become a PMC member on the Apache HBase > project. He has been an active contributor to HBase

Re: Hbase exportSnapshot fails in HA missing hbase-prefix-tree.jar

2017-04-14 Thread Ted Yu
his is a working Hbase HA setup which has been happily chugging along. 3 > node HDFS JN, 3 node zk, 2 of them NN+backup NN. Also two of them > HMaster+HMaster backup. Then several data nodes. > > Thanks for taking a look! > > Vasco > > On Fri, Apr 14, 2017 at 1:41 PM, T

Re: Hbase exportSnapshot fails in HA missing hbase-prefix-tree.jar

2017-04-14 Thread Ted Yu
Can you show the complete stack trace ? Please pastebin contents of hbase/site.xml Thanks > On Apr 14, 2017, at 3:51 AM, Vasco Pinho wrote: > > When running: > > bin/hbase org.apache.hadoop.hbase.snapshot.ExportSnapshot > -Dfs.s3a.buffer.dir=/tmp/hbase_snap_tmp -snapshot

Re: RequestsPerSecond on master status page

2017-04-13 Thread Ted Yu
Take a look at MetricsRegionServerWrapperImpl (which implements MetricsRegionServerWrapper): private volatile double requestsPerSecond = 0.0; FYI On Thu, Apr 13, 2017 at 11:50 AM, jeff saremi wrote: > there is a metric that we can't find in hadoop metric2-compatible

Re: Efficient time based queries - TIMERANGE or STARTROW/STOPROW?

2017-04-12 Thread Ted Yu
Since STARTROW is specified (with uuid) in both of your examples, I think their efficiency should be tantamount. Cheers On Wed, Apr 12, 2017 at 10:33 AM, Josh wrote: > Hi, > > I am just getting started with HBase, and have a question about the > efficiency of timestamp based

Re: [DISCUSS] Status of the 0.98 release line

2017-04-10 Thread Ted Yu
+1 Andrew has done tremendous work. On Mon, Apr 10, 2017 at 12:17 PM, Mikhail Antonov wrote: > +1 to EOL 0.98. > > Thanks Andrew for all the work maintaining it! > > -Mikhail > > On Mon, Apr 10, 2017 at 12:10 PM, Dima Spivak > wrote: > > > +1 > > >

Re: Is it because the JVM memory is low?

2017-04-07 Thread Ted Yu
Hi, Tsutomu: Do you keep memstore parameters as default ? Can you pastebin the remaining log from master ? See if you can give master more than 1GB of memory. Cheers On Thu, Apr 6, 2017 at 9:15 PM, wrote: > Hello. > > I am testing HBASE now. > > The following

Re: HBase as a file repository

2017-04-03 Thread Ted Yu
NAS replacement and move on. But then, the HBase API currently > > > seems > > > > to > > > > > be the only thing getting in my way. > > > > > > > > > > I checked async HBase projects, but apparently they're focused on > > &

Re: HBase as a file repository

2017-03-30 Thread Ted Yu
Have you read: http://hbase.apache.org/book.html#hbase_mob In particular: When using MOBs, ideally your objects will be between 100KB and 10MB Cheers On Thu, Mar 30, 2017 at 1:01 PM, Daniel Jeliński wrote: > Hello, > I'm evaluating HBase as a cheaper replacement for NAS

Re: How to healthcheck a regionserver

2017-03-30 Thread Ted Yu
the name of the region server > > - somehow find a record belonging to that region (starting hash?) > > - do a GET on one record of each region and report back > > > From: Ted Yu <yuzhih...@gmail.com> > Sent: Wednesday, March 29, 201

Re: How to healthcheck a regionserver

2017-03-30 Thread Ted Yu
Have you heard of http://slider.incubator.apache.org/ (since you mentioned Yarn) ? Slider provides several methods of monitoring region server health. FYI On Wed, Mar 29, 2017 at 9:57 PM, jeff saremi wrote: > We have our region servers assigned by Yarn and occasionally

Re: [DISCUSS] Status of 0.94 release line

2017-03-29 Thread Ted Yu
Yes, I think so. On Wed, Mar 29, 2017 at 6:54 AM, Sean Busbey wrote: > Hi Folks! > > It's been 2 years since there's been a release on the 0.94 line. Shall > we consider it end-of-maintenance? > > -busbey >

Re: [ANNOUNCE] - Welcome our new HBase committer Anastasia Braginsky

2017-03-27 Thread Ted Yu
Congratulations, Anastasia. On Mon, Mar 27, 2017 at 6:34 AM, Yu Li wrote: > Congrats and Welcome! > > Best Regards, > Yu > > On 27 March 2017 at 20:57, ashish singhi wrote: > > > Congrats and Welcome! > > > > -Original Message- > > From:

Re: All memstores flushed to be quite small files

2017-03-27 Thread Ted Yu
How many column families does your table have ? Which hbase release are you using ? Can you pastebin more of the server log around the time of flush ? Thanks On Mon, Mar 27, 2017 at 6:26 AM, Hef wrote: > Hi, > Does anyone have an idea why most of my 128MB memstore

Re: HBase master dies (1.1.2) often

2017-03-24 Thread Ted Yu
t / > 192.168.80.51:44456 which had sessionid 0x35af577e0ac > > > Margus (margusja) Roo > http://margus.roo.ee > skype: margusja > https://www.facebook.com/allan.tuuring > +372 51 48 780 > > On 23/03/2017 08:43, Ted Yu wrote: > >> Have you checked zookeeper logs to

Re: HBase master dies (1.1.2) often

2017-03-23 Thread Ted Yu
Have you checked zookeeper logs to see if there was some clue ? Cheers > On Mar 22, 2017, at 11:30 PM, Margus Roo wrote: > > Hi > > Almost every night hbase master is closed. In error log I can see: > gc.log: > 2017-03-23T01:59:27.239+0200: 41752.366: [GC (Allocation Failure)

Re: Need guidance on Custom Compaction Policy

2017-03-22 Thread Ted Yu
Have you taken look at http://hbase.apache.org/book.html#ops.date.tiered ? Cheers On Wed, Mar 22, 2017 at 12:29 PM, jeff saremi wrote: > I mentioned some of this in another thread. We have a readonly database > which get bulk loaded using HFiles. > We want to keep only

Re: Question in WALEdit

2017-03-22 Thread Ted Yu
Sreeram: For #2, did you mean this method ? default void postWALRestore(final ObserverContext ctx, HRegionInfo info, WALKey logKey, WALEdit logEdit) throws IOException {} On Wed, Mar 22, 2017 at 12:56 PM, Vladimir Rodionov wrote: > a) HBase does not support

Re: Unable to get coprocessor debug logs in regionserver.

2017-03-20 Thread Ted Yu
What were your log4j settings ? Could it be due to org.apache.hadoop.hbase.regionserver.wal (where FSHLog resides) not being at DEBUG level ? Cheers On Mon, Mar 20, 2017 at 7:16 AM, Sreeram wrote: > Hi, > > I am writing a coprocessor for postWALWrite event. > > I do not

Re: Why IOException occur when region server is closing (CloseRegionHandler.java#L110)?

2017-03-19 Thread Ted Yu
er: > Region server reported a fatal error: > > ABORTING region server : Unrecoverable exception while closing region , > still finishing close > > 2017-03-20 11:02:22,261 INFO org.apache.hadoop.hbase.master.RegionStates: > Offlined from > > 2017-03-20 11:02:22,723 INFO

Re: Why IOException occur when region server is closing (CloseRegionHandler.java#L110)?

2017-03-19 Thread Ted Yu
See HBASE-4270 Did you see this happen in your cluster ? If so, mind sharing related log snippets ? Cheers On Sun, Mar 19, 2017 at 7:50 PM, Kang Minwoo wrote: > Hello! > > In this code (https://github.com/apache/hbase/blob/master/hbase- >

Re: Parallel range scanners, not getting all rows.

2017-03-17 Thread Ted Yu
Please also notify AsyncHBase mailing list, if you haven't done so. 2017-03-17 9:01 GMT-07:00 Kristoffer Sjögren <sto...@gmail.com>: > Thanks Ted, that was indeed the problem. > > 2017-03-17 4:29 GMT+01:00 Ted Yu <yuzhih...@gmail.com>: > > Have you considered the emp

Re: Parallel range scanners, not getting all rows.

2017-03-16 Thread Ted Yu
with withStartRow (inclusive) and > withStopRow (exclusive) semantics? > > 2017-03-16 15:08 GMT+01:00 Ted Yu <yuzhih...@gmail.com>: > > Since you're using AsyncHBase, please consider posting on their mailing > > list. > > > > Thanks > > > > 2017-03

Re: Parallel range scanners, not getting all rows.

2017-03-16 Thread Ted Yu
Since you're using AsyncHBase, please consider posting on their mailing list. Thanks 2017-03-16 7:05 GMT-07:00 Kristoffer Sjögren : > Hi > > I'm trying to scan a table using start and stop key ranges based on a > single byte. > > I'm using AsyncHBase where scanners are start

Re: hbase table creation

2017-03-16 Thread Ted Yu
karthi: The link you posted was for 0.94 We'd better use up-to-date link from refguide (see my previous reply). Cheers On Thu, Mar 16, 2017 at 3:26 AM, karthi keyan wrote: > Rajesh, > > Use HBase snapshots for backup and move the data from your " >

Re: hbase table creation

2017-03-16 Thread Ted Yu
Have you looked at http://hbase.apache.org/book.html#ops.snapshots.export ? On Thu, Mar 16, 2017 at 3:16 AM, Rajeshkumar J wrote: > I have copied hbase data of a table from one cluster to another. For > instance I have a table testing and its data will be in the

Re: limiting user threads on client

2017-03-13 Thread Ted Yu
Are you using Java client ? See the following in HTable : public static ThreadPoolExecutor getDefaultExecutor(Configuration conf) { int maxThreads = conf.getInt("hbase.htable.threads.max", Integer. MAX_VALUE); FYI On Mon, Mar 13, 2017 at 9:14 AM, Henning Blohm

Re: hbase CMS gc pause serious program

2017-03-10 Thread Ted Yu
Attachment didn't go through. Do you use bucket cache ? It would reduce GC pause. > On Mar 9, 2017, at 9:24 PM, gehaijiang wrote: > > CMS GC program: > > 2017-03-10T10:15:25.741+0800: 4555916.378: [GC2017-03-10T10:15:25.741+0800: > 4555916.378: [ParNew:

Re: last exception: java.io.IOException: Call to e26-node.fqdn.com/10.12.1.209:60020 failed on local exception

2017-03-08 Thread Ted Yu
How many Thrift servers did you start ? The log snippet was INFO logs which didn't give us more clue. Can you pastebin more of the logs (from Thrift servers) ? Thanks On Wed, Mar 8, 2017 at 11:20 AM, Motty Cruz wrote: > Hello, in the past three weeks I have seen the

Re: Hbase Locality/table balancing

2017-03-08 Thread Ted Yu
I guess Pradheep's intention is to save some cost for the duration when the hbase cluster is not used. On Wed, Mar 8, 2017 at 1:37 AM, sudhakara st wrote: > I believe the problem is bring down the cluster is everyday. On Hbase > start probability of re-assigns regions

Re: Is deploying Region server as a YARN job a customary thing to do?

2017-03-06 Thread Ted Yu
Related: https://slider.incubator.apache.org/ Consider polling Slider mailing list. FYI On Mon, Mar 6, 2017 at 10:08 AM, jeff saremi wrote: > We have the option of running our region server dynamically as a YARN job. > I'd like to know if this is what everyone else

Re: Why doesn't HBase use github pull requests

2017-03-03 Thread Ted Yu
For github pull requests to go through Jenkins QA, there is certain infrastructure setup needed. Due to lack of such setup, we still go with attaching patches. FYI > On Mar 3, 2017, at 5:11 AM, Kahlil Oppenheimer > wrote: > > Hi all, > > I just had my first

Re: HBase scan returns inconsistent results on multiple runs for same dataset

2017-03-02 Thread Ted Yu
check for include JIRAs on top of those in the ASF release > is > > here: > > > > http://archive.cloudera.com/cdh5/cdh/5/hbase-1.2.0-cdh5.9. > > 1.releasenotes.html > > > > HBASE-15378 is not in CDH5.9.1. > > > > On Wed, Mar 1, 2017 at 9:58 AM, Ted Yu &l

Re: last exception: java.io.IOException: Call to e26-node.fqdn.com/10.12.1.209:60020 failed on local exception

2017-03-02 Thread Ted Yu
Dropping hadoop ML. This should be directed to user@hbase. Which hbase release are you using ? On Thu, Mar 2, 2017 at 10:02 AM, Motty Cruz wrote: > Hello, in the past two weeks, I see the following error on HBase Thrift > servers, we have total of about 10 Thrift servers

Re: Scan and Get - different results

2017-03-02 Thread Ted Yu
Actually this is manifestation of HBASE-17125 On Thu, Mar 2, 2017 at 1:48 AM, Devi Sunil Kumar Shegu < sunil143d...@gmail.com> wrote: > Hi, > > Scan and get are yielding different results for the same column even when > versioning is disabled. > > Thanks in advance and please check the details

Re: parallel processing of regions in coprocessor

2017-03-02 Thread Ted Yu
of coprocessors of all relevant regions on individual server places burden on the server. FYI On Thu, Mar 2, 2017 at 8:22 AM, Daniel Połaczański <dpolaczan...@gmail.com> wrote: > thx, this is what I needed > > 2017-03-02 11:07 GMT+01:00 Ted Yu <yuzhih...@gmail.com>: > >

Re: Scan and Get - different results

2017-03-02 Thread Ted Yu
When you issue raw scan, what output do you get ? hbase> scan 't1', {RAW => true} BTW looks like you have row key 'status', I am bit curious what your schema is - 'status' is such general term. On Thu, Mar 2, 2017 at 1:48 AM, Devi Sunil Kumar Shegu < sunil143d...@gmail.com> wrote: > Hi, > >

Re: Pre-emptive Fast Fail not used for scanners?

2017-03-02 Thread Ted Yu
source of the NPE as the ScannerCallableWithReplicas not extending > RegionServerCallable as ScannerCallable does. I'm running some additional > tests now but should have an update shortly. > > On Tue, Feb 28, 2017 at 2:29 PM, Ted Yu <yuzhih...@gmail.com> wrote: > > > Seems to be a bug. >

Re: client side configurations

2017-03-02 Thread Ted Yu
added values greater than default i.e.,10. And I am > running the java class in hbase master > > On Thu, Mar 2, 2017 at 3:53 PM, Ted Yu <yuzhih...@gmail.com> wrote: > > > bq. I have added this property > > > > On which machine did you add the property ? >

Re: client side configurations

2017-03-02 Thread Ted Yu
bq. I have added this property On which machine did you add the property ? If you added on client machine (normally outside the cluster), region servers wouldn't see them. Did you pass timeout values greater than the default ? Cheers On Wed, Mar 1, 2017 at 11:00 PM, Rajeshkumar J

Re: parallel processing of regions in coprocessor

2017-03-02 Thread Ted Yu
> I'm thinking about some hack like invoking 4 times rpc from client when i > > have 4 regions > > > > 02.03.2017 00:43 "Ted Yu" <yuzhih...@gmail.com> napisał(a): > > > > > To my knowledge, there is no support for this type of combination of

Re: parallel processing of regions in coprocessor

2017-03-01 Thread Ted Yu
coprocessor for every > region separately, return partial results to the client. On the client > invoke reduce phase and store result in another hbase table. > > 2017-03-01 23:26 GMT+01:00 Ted Yu <yuzhih...@gmail.com>: > > > Can you describe your use case in more detail ? > >

Re: parallel processing of regions in coprocessor

2017-03-01 Thread Ted Yu
Can you describe your use case in more detail ? What type of custom coprocessor are you loading to the region server ? Thanks On Wed, Mar 1, 2017 at 2:24 PM, Daniel Połaczański wrote: > Hi, > Let's assume that we have cluster consisting from one RegionServer and the >

Re: HBase scan returns inconsistent results on multiple runs for same dataset

2017-03-01 Thread Ted Yu
. ( > https://www.cloudera.com/documentation/enterprise/ > release-notes/topics/cdh_vd_cdh_package_tarball_59.html > ) > I have no idea if HBASE-15378 is included. > > On Wed, Mar 1, 2017 at 9:33 PM, Ted Yu <yuzhih...@gmail.com> wrote: > > > Which hbase version

Re: HBase scan returns inconsistent results on multiple runs for same dataset

2017-03-01 Thread Ted Yu
Which hbase version are you using ? Does it include HBASE-15378 ? > On Mar 1, 2017, at 5:02 AM, Hef wrote: > > Hi, > I'm encountering a strange behavior on MapReduce when using HBase as input > format. I run my MR tasks on a same table, same dataset, with a same > pattern

Re: Need guidance on getting detailed elapsed times in every stage of processing a request

2017-02-28 Thread Ted Yu
Have you looked at: http://hbase.apache.org/book.html#tracing On Tue, Feb 28, 2017 at 12:37 PM, jeff saremi wrote: > I think we need to get detailed information from HBase RegionServer logs > on how a request (read or write) is processed. Specifically speaking, i > need

Re: Pre-emptive Fast Fail not used for scanners?

2017-02-28 Thread Ted Yu
0"),Bytes.toBytes("cq1")); > > //multi.add(s); > > ResultScanner scanner = table.getScanner(s); > > scanner.next(); > > scanner.close(); > > > > > On Tue, Feb 28, 2017 at 12:41 PM, Ted Yu <yuzhih...@gmail.com> wrote: > > > What's the value f

Re: Pre-emptive Fast Fail not used for scanners?

2017-02-28 Thread Ted Yu
What's the value for "hbase.client.fast.fail.mode.enabled" ? The default value is false. Cheers On Tue, Feb 28, 2017 at 9:36 AM, James Moore wrote: > We're considering rolling out Pre-Emptive Fast fail to replace a homegrown > solution based on Hystrix for fast failing

Re: Mini announcement HBase connector for Apache Beam

2017-02-25 Thread Ted Yu
I saw that HBaseIOTest was disabled. Just curious, was the test flaky ? On Sat, Feb 25, 2017 at 8:12 AM, Ismaël Mejía <ieme...@gmail.com> wrote: > I will for sure, Thanks Ted. > > > On Thu, Feb 23, 2017 at 4:00 PM, Ted Yu <yuzhih...@gmail.com> wrote: > > > Ism

Re: Mini announcement HBase connector for Apache Beam

2017-02-23 Thread Ted Yu
Ismaël: Can you post your future questions on the mailing list ? Thanks On Thu, Feb 23, 2017 at 5:49 AM, Ismaël Mejía wrote: > Solomon, it is so great you answered, thanks a lot, you saved me with the > idea of using the Protobuf serialization for the Coders, it was quite >

Re: attachment permissions for HBASE jira?

2017-02-21 Thread Ted Yu
I have added you as contributor. You should be able to attach now. On Tue, Feb 21, 2017 at 10:31 AM, James Moore wrote: > Hi All, > > I'm trying to attach a patch file to > https://issues.apache.org/jira/browse/HBASE-17501 but it appears that my > account doesn't have

Re: Parallel Scanner

2017-02-20 Thread Ted Yu
nt record is child for > itself ) > > we set the max hregion file size as 10gb. I don't think we have any control > on region size :( > > Thanks > > > On 20 February 2017 at 21:24, Ted Yu <yuzhih...@gmail.com> wrote: > > > Among the 5 columns, do you know roughl

Re: Parallel Scanner

2017-02-20 Thread Ted Yu
it is like 5 > column composite key. Prefix for a common set of data would have same first > prefix. I am not sure how to convey the data distribution. > > Thanks. > > On 20 February 2017 at 20:48, Ted Yu <yuzhih...@gmail.com> wrote: > > > Anil: > > What's the

Re: Parallel Scanner

2017-02-20 Thread Ted Yu
Anil: What's the current region size you use ? Given a region, do you have some idea how the data is distributed within the region ? Cheers On Mon, Feb 20, 2017 at 7:14 AM, Anil wrote: > i understand my original post now :) Sorry about that. > > now the challenge is to

Re: Don't Settle for Eventual Consistency

2017-02-17 Thread Ted Yu
Reference #8 at the end of the post is interesting. On Fri, Feb 17, 2017 at 9:23 AM, Robert Yokota wrote: > Hi, > > This may be helpful to those who are considering the use of HBase. > > https://yokota.blog/2017/02/17/dont-settle-for-eventual-consistency/ >

Re: Publish / Subscribe with HBase

2017-02-15 Thread Ted Yu
There is no such functionality in scan currently. > On Feb 15, 2017, at 5:47 AM, Rohit Jain wrote: > > Hi folks, > > Is there a way to hang a scan read against an HBase table that does not > complete until the next row arrives (is inserted into the table) instead of >

Re: Doubt

2017-02-14 Thread Ted Yu
ngodb mainly so to keep myself updated I follow these > discussions to learn about issues so I post it on my profile so that I can > have easy access to those issues. > > Thanks > Gaurhari > > On Tue, Feb 14, 2017 at 3:15 PM, Ted Yu <yuzhih...@gmail.com> wrote: > >

Re: Doubt

2017-02-14 Thread Ted Yu
y so I am confused. > > As I don't want to get into legal trouble. > > Thanks > Gaurhari > > On Tue, Feb 14, 2017 at 1:41 PM, Ted Yu <yuzhih...@gmail.com> wrote: > > > The previous thread from mailing list is on-going. > > > > I would expect proper edit

Re: Doubt

2017-02-14 Thread Ted Yu
en let me know or should I include link. > > > > Thanks > Gaurhari > > > >> On Tue, Feb 14, 2017 at 12:30 PM, Ted Yu <yuzhih...@gmail.com> wrote: >> >> [image: Boxbe] <https://www.boxbe.com/overview> Automatic Cleanup: keep >> last 1 ema

Re: Doubt

2017-02-14 Thread Ted Yu
Clicking on both links directed me to: https://www.linkedin.com/post/new Do the pages require read permission ? On Tue, Feb 14, 2017 at 1:46 AM, gaurhari dass wrote: > Hi > > I want to post like this > > https://www.linkedin.com/post/edit/hbase-read-write- >

Re: On HBase Read Replicas

2017-02-11 Thread Ted Yu
Please take a look at the design doc attached to https://issues.apache.org/jira/browse/HBASE-10070. Your first question would be answered by that document. Cheers On Sat, Feb 11, 2017 at 2:06 PM, jeff saremi wrote: > The first time I heard replicas in HBase the

Re: How to avoid update blocking

2017-02-11 Thread Ted Yu
Which hbase release are you using ? You have given estimate on write load. How about read load ? Do you observe blocking / slow down on write path ? Consider increasing memstore, decreasing block cache if read load is light. I assume the 4 nodes with 125 GB Ram have HDD (instead of SSD). If you

Re: HBase Performance

2017-02-10 Thread Ted Yu
d n write performance. > > Sure will add more nodes. > > > > On Feb 10, 2017 19:44, "Ted Yu" <yuzhih...@gmail.com> wrote: > > Do you want to increase performance for reads, writes or both ? > > Your table is presplit. Can you add more node to leverage the

Re: HBase Performance

2017-02-10 Thread Ted Yu
Do you want to increase performance for reads, writes or both ? Your table is presplit. Can you add more node to leverage the multiple regions ? If you need better performance for reads, consider using BucketCache. Cheers > On Feb 10, 2017, at 2:22 AM, Devi Sunil Kumar Shegu

Re: Region server dies at regular intervals for unknown reasons.

2017-02-09 Thread Ted Yu
he.hadoop.hbase.master.SplitLogManager: > dead splitlog workers [{HOST1}] > 2017-02-08 11:08:12,445 INFO org.apache.hadoop.hbase.master.SplitLogManager: > started splitting {N} logs in [hdfs://a/b/WALs/{HOST1}] > ... > (SplitLogManager log) > ... > > > Yes,

Re: Dropping a very large table - 75million rows

2017-02-09 Thread Ted Yu
> "good" > > locality regions were no longer included). This wouldn't have affected > your > > system's performance because the locality for the table didn't change -- > > just the system-wide locality. > > > > > > Ted Yu wrote: > >

Re: Dropping a very large table - 75million rows

2017-02-09 Thread Ted Yu
bq. The locality of regions for OTHER tables on the same regionserver also fell drastically Can you be a bit more specific on how you came to the above conclusion ? Dropping one table shouldn't affect locality of other tables - unless number of regions on each server becomes unbalanced which

Re: HBase Down Scaling

2017-02-09 Thread Ted Yu
Can you be specific about how the table didn't work ? Were some of its regions in transition or offline ? Which hbase release are you using ? Please pastebin relevant master log / region server log. Thanks > On Feb 9, 2017, at 3:50 AM, Devi Sunil Kumar Shegu > wrote:

Re: Region server dies at regular intervals for unknown reasons.

2017-02-09 Thread Ted Yu
r. > > This error seems to have killed the region server. > > One second after this error > > I found Error log on the master server. > > RegionServerTracker: RegionServer ephemeral node deleted > > Thanks > > > > 보낸 사람:

Re: Region server dies at regular intervals for unknown reasons.

2017-02-07 Thread Ted Yu
eceives a > region? > > Thanks. > ________ > 보낸 사람: Ted Yu <yuzhih...@gmail.com> > 보낸 날짜: 2017년 2월 7일 화요일 오전 11:38:23 > 받는 사람: user@hbase.apache.org > 제목: Re: Region server dies at regular intervals for unknown reasons. > > 0.96 was so old. >

Re: Seeking advice on skipped/lost data during data migration from and to a hbase table

2017-02-07 Thread Ted Yu
0.0-cdh5.5.4. > > > > > > I guess that puts us back to square one. Any other ideas? > > > > > > On Sun, Feb 5, 2017 at 1:10 PM Alexandre Normand < > > > alexandre.norm...@gmail.com> wrote: > > > > > >> That's a good s

Re: Region server dies at regular intervals for unknown reasons.

2017-02-06 Thread Ted Yu
> I did not run hbck. > > Thanks > ________ > 보낸 사람: Ted Yu <yuzhih...@gmail.com> > 보낸 날짜: 2017년 2월 7일 화요일 오전 10:40:28 > 받는 사람: user@hbase.apache.org > 제목: Re: Region server dies at regular intervals for unknown reasons. > > Kang: > Please let

Re: Region server dies at regular intervals for unknown reasons.

2017-02-06 Thread Ted Yu
Kang: Please let us know the release of hbase and hadoop you use. Did you run hbck around the time region server crashed ? If there was inconsistency, please pastebin as well. Thanks On Mon, Feb 6, 2017 at 5:36 PM, Ganesh Viswanathan wrote: > Check the GC logs for HBase and

Re: How should I pool Connection (HConnectionImplementation)?

2017-02-05 Thread Ted Yu
, does HBase share the connection? > > ________ > 보낸 사람: Ted Yu <yuzhih...@gmail.com> > 보낸 날짜: 2017년 2월 4일 토요일 오후 9:04:52 > 받는 사람: user@hbase.apache.org > 제목: Re: How should I pool Connection (HConnectionImplementation)? > > Did you observe some bott

Re: Seeking advice on skipped/lost data during data migration from and to a hbase table

2017-02-05 Thread Ted Yu
it sounds like it would be difficult to > confirm that we've been affected by this bug. Am I right? > > On Sun, Feb 5, 2017 at 12:36 PM Ted Yu <yuzhih...@gmail.com> wrote: > > > Which release of hbase are you using ? > > > > To be specific, does the release h

Re: Seeking advice on skipped/lost data during data migration from and to a hbase table

2017-02-05 Thread Ted Yu
Which release of hbase are you using ? To be specific, does the release have HBASE-15378 ? Cheers On Sun, Feb 5, 2017 at 11:32 AM, Alexandre Normand < alexandre.norm...@gmail.com> wrote: > We're migrating data from a previous iteration of a table to a new one and > this process involved a MR

Re: Hbase Architecture Questions

2017-02-03 Thread Ted Yu
data has been scanned. To drop the data before next round of writes come in, you can set TTL for the table properly. Cheers On Fri, Feb 3, 2017 at 1:46 PM, Ted Yu <yuzhih...@gmail.com> wrote: > bq. We use Hbase 1.0.0 > > 1.0.0 was quite old. > > Can you try more recent r

Re: help for a fee -- using filters to search data in hbase for correct results

2017-02-03 Thread Ted Yu
Dropping dev@ again. Your code was looking for column family "name". if (new String(kv.getFamily()).equals("name")) { But from the listing, it was not obvious which rows were in "name" family. You should keep track of the column qualifier(s) (in the nested loop) through the following

Re: help for a fee -- using filters to search data in hbase for correct results

2017-02-03 Thread Ted Yu
Dropping dev@ Can you format your code in more readable style such as the following ? http://pastebin.com/CQMdf4pc Also separate code and description instead of mixing them together. Which release of hbase are you using ? Thanks On Fri, Feb 3, 2017 at 2:14 PM, Yoom Nguyen

Re: Hbase Architecture Questions

2017-02-03 Thread Ted Yu
bq. We use Hbase 1.0.0 1.0.0 was quite old. Can you try more recent releases such as 1.3.0 (the hbase-thrift module should be more robust) ? If your nodes have enough memory, have you thought of using bucket cache to improve read performance ? Cheers On Fri, Feb 3, 2017 at 1:34 PM, Akshat

Re: HBase server aborts on coprocessor exceptions even though hbase.coprocessor.abortonerror is set to false

2017-02-01 Thread Ted Yu
Can you take a look at TestMasterCoprocessorExceptionWithRemove to see if it covers your case ? If not, can it be modified to exhibit the behavior you described ? Cheers On Wed, Feb 1, 2017 at 5:45 AM, Steen Manniche wrote: > I'm trying to specify some sanity checks in my

Re: Hbase data version performance impact

2017-01-31 Thread Ted Yu
com> wrote: > Hi Ted > > Its mean I have no other way to load specific version value the only way to > load all value at client side and get version of my choice. > > Manjeet > On 1 Feb 2017 00:25, "Ted Yu" <yuzhih...@gmail.com> wrote: > > > For #3,

Re: replication concepts - enabling peer vs enabling table replication

2017-01-31 Thread Ted Yu
The lag would come down after the port opens. On Tue, Jan 31, 2017 at 2:53 PM, marjana wrote: > Yes the status command was run on source cluster. These are my peers: > > PEER_ID CLUSTER_KEY STATE TABLE_CFS > 3 >

Re: HBase Snapshot

2017-01-31 Thread Ted Yu
have 2 replcas in target cluster after restore > eventhough the config has 3 as replication factor. > Since it is a file level copy I guess the WAL will not have the edits and > hence cannot change the number of copies based on target config. > > Thanks, > Pradheep > > >

Re: HBase Snapshot

2017-01-31 Thread Ted Yu
Yes. It should work. On Tue, Jan 31, 2017 at 1:28 PM, Pradheep Shanmugam < pradheep.shanmu...@infor.com> wrote: > Hi, > > Can the Hbase Snapshot work when I snap shot a table from a cluster with > replication factor as 2 and restore it on a > Cluster with replication factor as 3? > > Thanks, >

Re: Hbase data version performance impact

2017-01-31 Thread Ted Yu
For #3, you need to retrieve multiple versions (to get to V2). Take a look at TestVisibilityLabelsWithDeletes#testDeleteColumnWithLatestTimeStampUsingMultipleVersions around line 1368. FYI On Tue, Jan 31, 2017 at 9:58 AM, Manjeet Singh wrote: > Hi All > > can

Re: replication concepts - enabling peer vs enabling table replication

2017-01-31 Thread Ted Yu
I assume both clusters run hbase 1.2.0 How many servers are there in each cluster ? Have you checked region server logs in the slave cluster to see if there is some clue ? Thanks On Tue, Jan 31, 2017 at 9:14 AM, marjana wrote: > It is 1.2.0 hbase version. > > > > -- >

Re: Question on WALEdit

2017-01-28 Thread Ted Yu
FYI On Sat, Jan 28, 2017 at 8:29 AM, Ted Yu <yuzhih...@gmail.com> wrote: > I haven't found the API you were looking for. > > Which release of hbase are you using ? > I assume it supports tags. > > If you use tag to pass event-id, you can retrieve thru this method of > W

Re: Question on WALEdit

2017-01-28 Thread Ted Yu
I haven't found the API you were looking for. Which release of hbase are you using ? I assume it supports tags. If you use tag to pass event-id, you can retrieve thru this method of WALEdit: public ArrayList getCells() { >From Cell, there're 3 methods for retrieving tag starting with:

Re: Writing/Importing large number of records into HBase

2017-01-27 Thread Ted Yu
ding to @Ted Check Bulk Put Example - > https://github.com/tmalaska/SparkOnHBase/blob/master/src/main/scala/org/apache/hadoop/hbase/spark/example/hbasecontext/HBaseBulkPutExampleFromFile.scala > >> On Sat, Jan 28, 2017 at 9:11 AM, Ted Yu <yuzhih...@gmail.com> wrote: >>

Re: Writing/Importing large number of records into HBase

2017-01-27 Thread Ted Yu
Have you looked at hbase-spark module (currently in master branch) ? See hbase-spark/src/main/scala/org/apache/hadoop/hbase/spark/example/datasources/AvroSource.scala and hbase-spark/src/test/scala/org/apache/hadoop/hbase/spark/DefaultSourceSuite.scala for examples. There may be other options.

Re: table schema - row with many column vs many rows

2017-01-26 Thread Ted Yu
Daniel: For the underlying column family, do you use any data block encoding / compression ? Which hbase release do you use ? Thanks On Thu, Jan 26, 2017 at 2:12 PM, Dave Birdsall wrote: > My guess (and it is only a guess) is that you are traversing much less of > the

Re: RpcRetryingCaller error accessing HBase from MapReduce job

2017-01-25 Thread Ted Yu
948532_0001/container_ > 1485362948532_0001_01_02/job.jar:/tmp/hadoop-hdadmin/ > nm-local-dir/usercache/idstest/appcache/application_ > 1485362948532_0001/container_1485362948532_0001_01_02/ > hbase-common-1.2.4.jar:/tmp/hadoop-hdadmin/nm-local-dir/ > usercache/idstest/appcac

<    1   2   3   4   5   6   7   8   9   10   >