Timo: Other than two occurrences of 'Took too long to split the files' @ 13:54:20,194 and 13:55:10,533, I don't find much clue from the posted log.
If you have time, mind checking namenode log for 1 minute interval leading up to 13:54:20,194 and 13:55:10,533, respectively ? Thanks On Sat, Dec 14, 2013 at 5:21 AM, Timo Schaepe <t...@timoschaepe.de> wrote: > Hey, > > @JM: Thanks for the hint with hbase.regionserver.fileSplitTimeout. At the > moment (the import is actually working) and after I splittet the specific > regions manually, we do not have growing regions anymore. > > hbase hbck says, all things are going fine. > 0 inconsistencies detected. > Status: OK > > @Ted Yu: Sure, have a look here: http://pastebin.com/2ANFVZEU > The relevant tablename ist data_1091. > > Thanks for your time. > > Timo > > Am 13.12.2013 um 20:18 schrieb Ted Yu <yuzhih...@gmail.com>: > > > Timo: > > Can you pastebin regionserver log around 2013-12-12 13:54:20 so that we > can > > see what happened ? > > > > Thanks > > > > > > On Fri, Dec 13, 2013 at 11:02 AM, Jean-Marc Spaggiari < > > jean-m...@spaggiari.org> wrote: > > > >> Try to increase hbase.regionserver.fileSplitTimeout but put it back to > its > >> default value after. > >> > >> Default value is 30 seconds. I think it's not normal for a split to take > >> more than that. > >> > >> What is your hardware configuration? > >> > >> Have you run hbck to see if everything is correct? > >> > >> JM > >> > >> > >> 2013/12/13 Timo Schaepe <t...@timoschaepe.de> > >> > >>> Hello again, > >>> > >>> digging in the logs of the specific regionserver shows me that: > >>> > >>> 2013-12-12 13:54:20,194 INFO > >>> org.apache.hadoop.hbase.regionserver.SplitRequest: Running > >> rollback/cleanup > >>> of failed split of > >>> > >> > data,OR\x83\xCF\x02\x82\xAE\xF3U,1386851456415.d04cadb1b2252dafc476c138e9651ca7.; > >>> Took too long to split the files and create the references, aborting > >> split > >>> > >>> This message appears two time, so it seems, that HBase tried to split > the > >>> region but it failed. I don't know why. How is the behaviour of HBase, > >> if a > >>> region split fails? Are there more tries to split this region again? I > >>> didn't find any new tries in the log. Now I split the big regions > >> manually > >>> and this works. And also it seems, that HBase split the new regions > again > >>> to crunch they down to the given limit. > >>> > >>> But also it is a mystery for me, why the split size in Hannibal shows > me > >>> 10 GB and in base-site.xml I put 2 GB… > >>> > >>> Thanks, > >>> > >>> Timo > >>> > >>> > >>> Am 13.12.2013 um 10:22 schrieb Timo Schaepe <t...@timoschaepe.de>: > >>> > >>>> Hello, > >>>> > >>>> during the loading of data in our cluster I noticed some strange > >>> behavior of some regions, that I don't understand. > >>>> > >>>> Scenario: > >>>> We convert data from a mysql database to HBase. The data is inserted > >>> with a put to the specific HBase table. The row key is a timestamp. I > >> know > >>> the problem with timestamp keys, but in our requirement it works quiet > >>> well. The problem is now, that there are some regions, which are > growing > >>> and growing. > >>>> > >>>> For example the table on the picture [1]. First, all data was > >>> distributed over regions and node. And now, the data is written into > only > >>> one region, which is growing and I can see no splitting at all. > Actually > >>> the size of the big region is nearly 60 GB. > >>>> > >>>> HBase version is 0.94.11. I cannot understand, why the splitting is > not > >>> happening. In hbase-site.xml I limit the hbase.hregion.max.filesize to > 2 > >> GB > >>> and HBase accepted this value. > >>>> > >>>> <property> > >>>> <!--Loaded from hbase-site.xml--> > >>>> <name>hbase.hregion.max.filesize</name> > >>>> <value>2147483648</value> > >>>> </property> > >>>> > >>>> First mystery: Hannibal shows me the split size is 10 GB (see > >>> screenshot). > >>>> Second mystery: HBase is not splitting some regions neither at 2 GB > nor > >>> 10 GB. > >>>> > >>>> Any ideas? Could be the timestamp rowkey cause this problem? > >>>> > >>>> Thanks, > >>>> > >>>> Timo > >>>> > >>>> [1] https://www.dropbox.com/s/lm286xkcpglnj1t/big_region.png > >>> > >>> > >> > >