Re: HBase Down Scaling

2017-02-09 Thread Anthony Nguyen
Nodes can be added and removed > > How are you able to downscale the Core Nodes? > > Thanks > > On Fri, Feb 10, 2017 at 10:44 AM, Anthony Nguyen < > anthony.an.ngu...@gmail.com> wrote: > >> Hey Devi, >> >> I'm able to successfully scale my HBase

Links left in data/default/table_name directory after major compaction

2016-03-03 Thread Anthony Nguyen
Hi all, For a table that was restored from a snapshot in HBase 1.1.2, is it normal for some link files to still remain in data/default/table_name? Another major_compact clears these link files, so shouldn't they have been deleted the first time around? Thanks!

Disable table command hangs in hbase shell in HBase 1.1.2

2016-02-16 Thread Anthony Nguyen
Hi all, I'm running into an issue in HBase 1.1.2 where after a cluster is running for some time, disabling a table in the shell has no effect, hanging until the timeout limit is reached. Looking at the master logs there is only one message that has anything to do with it: 2016-02-12 13:16:23,237

Duplicate HFiles during bulk load in 0.98

2015-12-09 Thread Anthony Nguyen
Hi all, Having duplicate HFiles within a region should result in no change to the data, correct? The reason I ask is because I'm seeing duplicate HFiles being created during a bulk load - they have the same row count, same size, and same firstKey and lastKey. Is this normal behavior? Will a

Upgrading HBase 0.98.4+ to 1.x Questions

2015-12-07 Thread Anthony Nguyen
Hi all, I believe I have successfully done a rolling upgrade to a small test cluster that I've stood up to test out the upgrade path (simply replaced the jars and rebooted). One question I have is about the now default use of HFile v3 as opposed to v2 which was used in 0.98 - what's the

hbase.bucketcache.percentage.in.combinedcache with ioengine= on HBase 0.98

2015-10-06 Thread Anthony Nguyen
Hi all, I'm running HBase 0.98, with the complicated percentage.in.combinedcache setting. What's the best way to set the percentage.in.combinedcache setting when using a file on SSD storage as the location for ioengine? The SSD drive is dedicated for the cache. I don't want to modify the

Re: Prefetching Indexes of HFiles

2015-09-25 Thread Anthony Nguyen
7?focusedCommentId=13031489=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13031489 > > Cheers > > On Thu, Sep 24, 2015 at 1:16 PM, Anthony Nguyen < > anthony.an.ngu...@gmail.com > > wrote: > > > Hi all, > > > > Is there a better way to warm HFile indexes other th

Prefetching Indexes of HFiles

2015-09-24 Thread Anthony Nguyen
Hi all, Is there a better way to warm HFile indexes other than scanning through my datasets? I did see PREFETCH_BLOCKS_ON_OPEN, but the warning that it "is not a good idea if the data to be preloaded will not fit into the blockcache" makes me wary. Why would this be a bad idea? Thanks!

Defining WAL location within HDFS outside of rootdir

2015-09-14 Thread Anthony Nguyen
Hi all, Is it possible to define explicitly where the WAL / oldWAL is stored, separate from where rootdir is? If not, would it break anything for me to change HBase to allow for this? The context is to optimize the WAL location for the use case of a shared filesystem. Thanks!

Re: Ramifications of minimizing use of .tmp directories / renames in HBase when using S3 as backing store

2015-09-09 Thread Anthony Nguyen
ps://issues.apache.org/jira/browse/HBASE-14090 > There is a document in it, if you are interested in reading about the > internals. > > Matteo > > > On Wed, Sep 9, 2015 at 4:23 PM, Anthony Nguyen < > anthony.an.ngu...@gmail.com> > wrote: > > > Hi all,

Ramifications of minimizing use of .tmp directories / renames in HBase when using S3 as backing store

2015-09-09 Thread Anthony Nguyen
Hi all, I'm investigating the use of S3 as a backing store for HBase. Would there be any major issues with modifying HBase in such a way where when an S3 location is set for the rootdir, writes to .tmp are removed and minimized, instead writing directly to the final destination? The reason I'd

Re: Ramifications of minimizing use of .tmp directories / renames in HBase when using S3 as backing store

2015-09-09 Thread Anthony Nguyen
not the intended recipient, do > not disclose or disseminate the message to anyone except the intended > recipient. If you have received this message in error, or are not the named > recipient(s), please immediately notify the sender by return email, and > delete all copies of this me