[jira] [Created] (HBASE-14060) Add FuzzyRowFilter to ParseFilter

2015-07-10 Thread Esteban Gutierrez (JIRA)
Esteban Gutierrez created HBASE-14060: - Summary: Add FuzzyRowFilter to ParseFilter Key: HBASE-14060 URL: https://issues.apache.org/jira/browse/HBASE-14060 Project: HBase Issue Type: Bug

Re: hbase hdfs snapshots

2015-07-10 Thread Vladimir Rodionov
Matteo, thanks for reminding me about verification stage - this is it. Master, verifying 60K files and regions in a live cluster ... this can explain 30 minutes. If even one region splits during a snapshot, snapshot will fail. Its amazing, that these guys are able to to finish snapshot at all. -V

Re: hbase hdfs snapshots

2015-07-10 Thread Matteo Bertozzi
yeah, something along that line. but I doubt the problem is RS side. or the communication between the master and RSs. in theory the problem may be the verification step where the master is checking the snapshot. I was just trying to figure out where he is spending the time and that "30 minutes to

Re: hbase hdfs snapshots

2015-07-10 Thread Vladimir Rodionov
Matteo, there should be some explanation for 30 min flash_skip snapshot. I think its should be somewhere in NN/Hdfs. This is a huge cluster and NN load is extreme, it is probably does not scale well with # DNs and #files per directory. I presume that NN performance on file operations degrades when

Re: hbase hdfs snapshots

2015-07-10 Thread Matteo Bertozzi
Manifest per Region, not family. we couldn't send them back to the master/table to keep compatibility. 60k region on 1200 RS are ~50 manifest per RS that alone should not take 30sec On Fri, Jul 10, 2015 at 5:21 PM, Vladimir Rodionov wrote: > OK, even with 1 manifest file per region (per column

Re: hbase hdfs snapshots

2015-07-10 Thread Vladimir Rodionov
OK, even with 1 manifest file per region (per column family?) - 60K X 4 = 240,000 new files 8000 per minute, 135 per second. That is probably NN limit. Anyway, the root cause is the same as with reference files during region split: HDFS does not do well on file create/open/close/delete. -Vlad O

Re: hbase hdfs snapshots

2015-07-10 Thread Matteo Bertozzi
@Vladimir there is no hfile link creation on snapshot. we create 1 manifest per region Matteo On Fri, Jul 10, 2015 at 5:06 PM, Vladimir Rodionov wrote: > Being not very familiar with snapshot code, I could speculate only on where > most time is spent ... > > In creating 60K x 4 x K (K is avera

Re: hbase hdfs snapshots

2015-07-10 Thread Vladimir Rodionov
Being not very familiar with snapshot code, I could speculate only on where most time is spent ... In creating 60K x 4 x K (K is average # of store files per region) small HFileLInks? This can be very large # of files. -Vlad On Fri, Jul 10, 2015 at 4:57 PM, Matteo Bertozzi wrote: > the total

Re: hbase hdfs snapshots

2015-07-10 Thread Matteo Bertozzi
the total time taken by a snapshot should be bounded by the slowest machine. we send a notification to each RS and each RS execute the snapshot operation for each region. can you track down what is slow in your case? clone has to create a reference for each file, and that is a master operation, an

Re: hbase hdfs snapshots

2015-07-10 Thread Jean-Marc Spaggiari
Hi Rahul, Have you identified with it takes those 30 minutes? Is the table balances correctly across the servers? Form the logs, are you able to identify what takes that much time? JM 2015-07-10 18:46 GMT-04:00 rahul gidwani : > Hi Matteo, > > We do SKIP_FLUSH. We have 1200+ regionservers with

Re: hbase hdfs snapshots

2015-07-10 Thread rahul gidwani
Hi Matteo, We do SKIP_FLUSH. We have 1200+ regionservers with a single table with 60k regions and 4 column families. It takes around 30 minutes to snapshot this table using manifests compared to just seconds doing this with hdfs. Cloning this table takes considerably longer. For cases where som

[jira] [Created] (HBASE-14059) We should add a RS to the dead servers list if admin calls fail more than a threshold

2015-07-10 Thread Esteban Gutierrez (JIRA)
Esteban Gutierrez created HBASE-14059: - Summary: We should add a RS to the dead servers list if admin calls fail more than a threshold Key: HBASE-14059 URL: https://issues.apache.org/jira/browse/HBASE-14059

[jira] [Created] (HBASE-14058) Stabilize heap memory tuner

2015-07-10 Thread Abhilash (JIRA)
Abhilash created HBASE-14058: Summary: Stabilize heap memory tuner Key: HBASE-14058 URL: https://issues.apache.org/jira/browse/HBASE-14058 Project: HBase Issue Type: Improvement Compone

[jira] [Resolved] (HBASE-14056) regionserver.HRegion: Failed getting lock in batch put

2015-07-10 Thread stack (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-14056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack resolved HBASE-14056. --- Resolution: Invalid Please do not file an issue and ask a question on the mailing list. Resolving as invalid

[jira] [Resolved] (HBASE-13897) OOM may occur when Import imports a row with too many KeyValues

2015-07-10 Thread Ted Yu (JIRA)
t;Reporter: Liu Junhong >Assignee: Liu Junhong > Fix For: 2.0.0, 0.98.14, 1.3.0 > > Attachments: 13897-v2.txt, > HBASE-13897-0.98-20150710-suitable_for_hadoop1.patch, > HBASE-13897-0.98-20150710.patch, HBASE-13897-0.98.patch, > HBASE-1

[jira] [Created] (HBASE-14057) HBase shell user_permission should list super users defined on hbase-site.xml

2015-07-10 Thread Wellington Chevreuil (JIRA)
Wellington Chevreuil created HBASE-14057: Summary: HBase shell user_permission should list super users defined on hbase-site.xml Key: HBASE-14057 URL: https://issues.apache.org/jira/browse/HBASE-14057

[jira] [Created] (HBASE-14056) regionserver.HRegion: Failed getting lock in batch put

2015-07-10 Thread congcong (JIRA)
congcong created HBASE-14056: Summary: regionserver.HRegion: Failed getting lock in batch put Key: HBASE-14056 URL: https://issues.apache.org/jira/browse/HBASE-14056 Project: HBase Issue Type: Bu

[jira] [Resolved] (HBASE-13387) Add ByteBufferedCell an extension to Cell

2015-07-10 Thread Anoop Sam John (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-13387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anoop Sam John resolved HBASE-13387. Resolution: Fixed Pushed. Thanks for the quick review Ram. > Add ByteBufferedCell an exten

[jira] [Reopened] (HBASE-13387) Add ByteBufferedCell an extension to Cell

2015-07-10 Thread Anoop Sam John (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-13387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anoop Sam John reopened HBASE-13387: My bad.. Need to apply an addendum. ByteBufferedCell initially was made as Interface and so a