Re: A region full of data is missing

2009-11-13 Thread Vaibhav Puranik
Stack, We are on 0.20.0. If this issue is not fixed in 0.20.1, then only I will file an issue. Let me know. Regards, Vaibhav On Fri, Nov 13, 2009 at 4:17 PM, stack wrote: > On Fri, Nov 13, 2009 at 11:20 AM, Vaibhav Puranik >wrote: > > > Now that we have resolved this problem and figured out

Re: A region full of data is missing

2009-11-13 Thread stack
On Fri, Nov 13, 2009 at 11:20 AM, Vaibhav Puranik wrote: > Now that we have resolved this problem and figured out that some data could > be missing because of a region having a small empty file, we were wondering > if there is any automated way we can check all of our regions for this kind > of pr

Re: A region full of data is missing

2009-11-13 Thread Vaibhav Puranik
Now that we have resolved this problem and figured out that some data could be missing because of a region having a small empty file, we were wondering if there is any automated way we can check all of our regions for this kind of problem. One obvious way would be to check all the regions for a sm

Re: A region full of data is missing

2009-11-10 Thread Vaibhav Puranik
This problem is resolved. Courtesy Ryan, JD and Stack.Thank you very much! For the culprit region there were two data files instead of one data file. The size of the first data file was around 130 MB. The second file was just 228 bytes. Because of a bug this second file gets created during major

Re: A region full of data is missing

2009-11-10 Thread Vaibhav Puranik
Region name contains table name, start key and an id. Start key is binary. In our case it was a mixture of few longs. Whenever printed, it always prints Unicode characters which looks like a junk or garbled characters. I am not sure whether shell can interpret it correctly. I don't know how to giv

Re: A region full of data is missing

2009-11-10 Thread stack
You couldn't run the shell? So, region closed and opened somewhere else? Open on another regionserver and you still can't get data out of it? St.Ack On Tue, Nov 10, 2009 at 2:11 PM, Vaibhav Puranik wrote: > Stack, > > I tried doing HBaseAdmin.closeRegion with the binary region name. > > It c

Re: A region full of data is missing

2009-11-10 Thread Vaibhav Puranik
Stack, I tried doing HBaseAdmin.closeRegion with the binary region name. It closed the region and reopened it. But we still can not access the data. I guess trying to read it back from the data file is the only option left, right? Regards, Vaibhav On Tue, Nov 10, 2009 at 12:56 PM, stack wrote

Re: A region full of data is missing

2009-11-10 Thread stack
On Mon, Nov 9, 2009 at 6:40 PM, Vaibhav Puranik wrote: > Does that mean the region is > open and needs to be closed? > > It means region should be open... especially if its the message the regionserver is passing back to the Master reporting successful open. Maybe check the regionserver log to

Re: A region full of data is missing

2009-11-10 Thread stack
On Tue, Nov 10, 2009 at 11:41 AM, Ken Weiner wrote: > Other than http://wiki.apache.org/hadoop/Hbase/HbaseArchitecture, is there > any documentation on the lifecycle of a region or the meaning of the > following terms with respect to regions: > > No. You'll have to wait for the book Ken (smile).

Re: A region full of data is missing

2009-11-10 Thread Ken Weiner
Other than http://wiki.apache.org/hadoop/Hbase/HbaseArchitecture, is there any documentation on the lifecycle of a region or the meaning of the following terms with respect to regions: open vs closed online vs offline enabled vs disabled If not, it would be great if someone could explain these co

Re: A region full of data is missing

2009-11-10 Thread Lars George
Hi Vaibhav, You can close a region from the HBase shell. When in the shell type "tools" to get a list of options. One of them will be "close_region", see its help text next to it. HTH, Lars Vaibhav Puranik schrieb: Stack, Thanks a lot for your prompt answer. I could locate the file on the

Re: A region full of data is missing

2009-11-09 Thread Vaibhav Puranik
Stack, Thanks a lot for your prompt answer. I could locate the file on the filesystem. I did not find any unusual activity with the region name in the master log. Looks like the region was opened for updating rows. There are messages such as MSG_REPORT_PROCESS_OPEN, MSG_REPORT_OPEN, MSG_REPORT_C

Re: A region full of data is missing

2009-11-09 Thread stack
On Mon, Nov 9, 2009 at 5:11 PM, Vaibhav Puranik wrote: > > When we looked at the region distribution, we found that all of the missing > data (for a week in August) belongs to a region (based on start and end > keys). > > You know the regions name? Can you grep it in your master log and try to f

A region full of data is missing

2009-11-09 Thread Vaibhav Puranik
Hi, We are running HBase 0.20.0 with 3 region servers (on EC2 with EBS volumes). We suddenly found that data for a week is missing in one of our tables. This table is continuously updated. When we try to query the table with that week's time stamp, we get nothing back as if the data does not exi