Re: MR job randomly scans up thousands of rows less than the it should.

2012-03-02 Thread Cosmin Lehene
...@adobe.com] Sent: Tuesday, February 14, 2012 12:02 PM To: dev@hbase.apache.org Subject: Re: MR job randomly scans up thousands of rows less than the it should. I just got back on this issue. Initially the behavior we've seen (missing rows) wouldn't reproduce on 0.90 using TestAcidGuarantees. However

Re: MR job randomly scans up thousands of rows less than the it should.

2012-02-15 Thread Cosmin Lehene
@hbase.apache.org Subject: Re: MR job randomly scans up thousands of rows less than the it should. I just got back on this issue. Initially the behavior we've seen (missing rows) wouldn't reproduce on 0.90 using TestAcidGuarantees. However, if the puts in the writer threads include additional rows the scanners

Re: MR job randomly scans up thousands of rows less than the it should.

2012-02-14 Thread Cosmin Lehene
I just got back on this issue. Initially the behavior we've seen (missing rows) wouldn't reproduce on 0.90 using TestAcidGuarantees. However, if the puts in the writer threads include additional rows the scanners will start reading less rows. This reproduces consistently on 0.90 and seems to be

RE: MR job randomly scans up thousands of rows less than the it should.

2012-02-14 Thread Amitanand Aiyer
Subject: Re: MR job randomly scans up thousands of rows less than the it should. I just got back on this issue. Initially the behavior we've seen (missing rows) wouldn't reproduce on 0.90 using TestAcidGuarantees. However, if the puts in the writer threads include additional rows the scanners

Re: MR job randomly scans up thousands of rows less than the it should.

2012-02-06 Thread Cosmin Lehene
Thanks Ted! I wonder if it would make more sense to port it to 0.90.X or upgrade to 0.92. Cosmin On 2/2/12 5:03 PM, Ted Yu yuzhih...@gmail.com wrote: HBASE-4838 ports HBASE-2856 to 0.92 FYI On Thu, Feb 2, 2012 at 4:46 PM, Cosmin Lehene cleh...@adobe.com wrote: (sorry for the damaged

Re: MR job randomly scans up thousands of rows less than the it should.

2012-02-02 Thread Cosmin Lehene
(sorry for the damaged subject :)) Hey Jon, We have two column families. There are no filters and there's a full table scan. We're not skipping rows. I did see however a single time that we had one qualifier fault in the job counters (it was missing, and it wasn't supposed to be missing).

Re: MR job randomly scans up thousands of rows less than the it should.

2012-02-02 Thread Ted Yu
HBASE-4838 ports HBASE-2856 to 0.92 FYI On Thu, Feb 2, 2012 at 4:46 PM, Cosmin Lehene cleh...@adobe.com wrote: (sorry for the damaged subject :)) Hey Jon, We have two column families. There are no filters and there's a full table scan. We're not skipping rows. I did see however a single

Re: MR job randomly scans up thousands of rows less than the it should.

2012-02-01 Thread Jonathan Hsieh
Cosmin, How many column families to you have in this table? Are you using any filters in you HBase scans? Are you using skip rows that may not have qualifiers present? There are a few known issues with multi-CF atomicity and a recent one about flushes that may be related to this problem.