[01/11] hbase git commit: Update documentation from master for 1.0.0RC3

enis Sun, 01 Feb 2015 17:34:26 -0800

Repository: hbase
Updated Branches:
  refs/heads/branch-1.0 05ecd4e1d -> fba353df7



http://git-wip-us.apache.org/repos/asf/hbase/blob/fba353df/src/main/asciidoc/_chapters/upgrading.adoc
----------------------------------------------------------------------
diff --git a/src/main/asciidoc/_chapters/upgrading.adoc 
b/src/main/asciidoc/_chapters/upgrading.adoc
index e90b98a..ab3f154 100644
--- a/src/main/asciidoc/_chapters/upgrading.adoc
+++ b/src/main/asciidoc/_chapters/upgrading.adoc
@@ -27,9 +27,9 @@
 :icons: font
 :experimental:
 
-You cannot skip major versions upgrading. If you are upgrading from version 
0.90.x to 0.94.x, you must first go from 0.90.x to 0.92.x and then go from 
0.92.x to 0.94.x.
+You cannot skip major versions when upgrading. If you are upgrading from 
version 0.90.x to 0.94.x, you must first go from 0.90.x to 0.92.x and then go 
from 0.92.x to 0.94.x.
 
-Note:It may be possible to skip across versions -- for example go from 0.92.2 
straight to 0.98.0 just following the 0.96.x upgrade instructions -- but we 
have not tried it so cannot say whether it works or not.
+NOTE: It may be possible to skip across versions -- for example go from 0.92.2 
straight to 0.98.0 just following the 0.96.x upgrade instructions -- but these 
scenarios are untested.
 
 Review <<configuration>>, in particular <<hadoop>>.
 
@@ -41,7 +41,7 @@ HBase has two versioning schemes, pre-1.0 and post-1.0. Both 
are detailed below.
 [[hbase.versioning.post10]]
 === Post 1.0 versions
 
-Starting with 1.0.0 release, HBase uses link:http://semver.org/[Semantic 
Versioning] for its release versioning. In summary:
+Starting with the 1.0.0 release, HBase uses link:http://semver.org/[Semantic 
Versioning] for its release versioning. In summary:
 
 .Given a version number MAJOR.MINOR.PATCH, increment the:
 * MAJOR version when you make incompatible API changes,
@@ -90,7 +90,7 @@ In addition to the usual API versioning considerations HBase 
has other compatibi
 .Operational Compatibility
 * Metric changes
 * Behavioral changes of services
-*Web page APIs
+* Web page APIs
 
 .Summary
 * A patch upgrade is a drop-in replacement. Any change that is not Java binary 
compatible would not be allowed.footnote:[See 
http://docs.oracle.com/javase/specs/jls/se7/html/jls-13.html.]
@@ -149,25 +149,25 @@ Our first "Development" Series was the 0.89 set that came 
out ahead of HBase 0.9
 
 [[hbase.binary.compatibility]]
 .Binary Compatibility
-When we say two HBase versions are compatible, we mean that the versions are 
wire and binary compatible. Compatible HBase versions means that clients can 
talk to compatible but differently versioned servers. It means too that you can 
just swap out the jars of one version and replace them with the jars of 
another, compatible version and all will just work. Unless otherwise specified, 
HBase point versions are (mostly) binary compatible. You can safely do rolling 
upgrades between binary compatible versions; i.e. across point versions: e.g. 
from 0.94.5 to 0.94.6. See link:[Does compatibility between versions also mean 
binary compatibility?] discussion on the hbaes dev mailing list.
+When we say two HBase versions are compatible, we mean that the versions are 
wire and binary compatible. Compatible HBase versions means that clients can 
talk to compatible but differently versioned servers. It means too that you can 
just swap out the jars of one version and replace them with the jars of 
another, compatible version and all will just work. Unless otherwise specified, 
HBase point versions are (mostly) binary compatible. You can safely do rolling 
upgrades between binary compatible versions; i.e. across point versions: e.g. 
from 0.94.5 to 0.94.6. See link:[Does compatibility between versions also mean 
binary compatibility?] discussion on the HBase dev mailing list.
 
 [[hbase.rolling.upgrade]]
 === Rolling Upgrades
 
-A rolling upgrade is the process by which you update the servers in your 
cluster a server at a time. You can rolling upgrade across HBase versions if 
they are binary or wire compatible. See <<hbase.rolling.restart>> for more on 
what this means. Coarsely, a rolling upgrade is a graceful stop each server, 
update the software, and then restart. You do this for each server in the 
cluster. Usually you upgrade the Master first and then the regionservers. See 
<<rolling>> for tools that can help use the rolling upgrade process.
+A rolling upgrade is the process by which you update the servers in your 
cluster a server at a time. You can rolling upgrade across HBase versions if 
they are binary or wire compatible. See <<hbase.rolling.restart>> for more on 
what this means. Coarsely, a rolling upgrade is a graceful stop each server, 
update the software, and then restart. You do this for each server in the 
cluster. Usually you upgrade the Master first and then the RegionServers. See 
<<rolling>> for tools that can help use the rolling upgrade process.
 
-For example, in the below, hbase was symlinked to the actual hbase install. On 
upgrade, before running a rolling restart over the cluser, we changed the 
symlink to point at the new HBase software version and then ran
+For example, in the below, HBase was symlinked to the actual HBase install. On 
upgrade, before running a rolling restart over the cluser, we changed the 
symlink to point at the new HBase software version and then ran
 
 [source,bash]
 ----
 $ HADOOP_HOME=~/hadoop-2.6.0-CRC-SNAPSHOT ~/hbase/bin/rolling-restart.sh 
--config ~/conf_hbase
 ----
 
-The rolling-restart script will first gracefully stop and restart the master, 
and then each of the regionservers in turn. Because the symlink was changed, on 
restart the server will come up using the new hbase version. Check logs for 
errors as the rolling upgrade proceeds.
+The rolling-restart script will first gracefully stop and restart the master, 
and then each of the RegionServers in turn. Because the symlink was changed, on 
restart the server will come up using the new HBase version. Check logs for 
errors as the rolling upgrade proceeds.
 
 [[hbase.rolling.restart]]
 .Rolling Upgrade Between Versions that are Binary/Wire Compatible
-Unless otherwise specified, HBase point versions are binary compatible. You 
can do a <<hbase.rolling.upgrade>> between hbase point versions. For example, 
you can go to 0.94.6 from 0.94.5 by doing a rolling upgrade across the cluster 
replacing the 0.94.5 binary with a 0.94.6 binary.
+Unless otherwise specified, HBase point versions are binary compatible. You 
can do a <<hbase.rolling.upgrade>> between HBase point versions. For example, 
you can go to 0.94.6 from 0.94.5 by doing a rolling upgrade across the cluster 
replacing the 0.94.5 binary with a 0.94.6 binary.
 
 In the minor version-particular sections below, we call out where the versions 
are wire/protocol compatible and in this case, it is also possible to do a 
<<hbase.rolling.upgrade>>. For example, in <<upgrade1.0.rolling.upgrade>>, we 
state that it is possible to do a rolling upgrade between hbase-0.98.x and 
hbase-1.0.0.
 
@@ -176,7 +176,7 @@ In the minor version-particular sections below, we call out 
where the versions a
 [[upgrade1.0]]
 === Upgrading from 0.98.x to 1.0.x
 
-In this section we first note the significant changes that come in with 1.0.0 
HBase and then we go over the upgrade process.  Be sure to read the significant 
changes section with care so you avoid surprises.
+In this section we first note the significant changes that come in with 1.0.0 
HBase and then we go over the upgrade process. Be sure to read the significant 
changes section with care so you avoid surprises.
 
 ==== Changes of Note!
 
@@ -188,30 +188,30 @@ See <<zookeeper.requirements>>.
 
 [[default.ports.changed]]
 .HBase Default Ports Changed
-The ports used by HBase changed.  The used to be in the 600XX range.  In 
hbase-1.0.0 they have been moved up out of the ephemeral port range and are 
160XX instead (Master web UI was 60010 and is now 16010; the RegionServer web 
UI was 60030 and is now 16030, etc). If you want to keep the old port 
locations, copy the port setting configs from _hbase-default.xml_ into 
_hbase-site.xml_, change them back to the old values from hbase-0.98.x era, and 
ensure you've distributed your configurations before you restart.
+The ports used by HBase changed. They used to be in the 600XX range. In HBase 
1.0.0 they have been moved up out of the ephemeral port range and are 160XX 
instead (Master web UI was 60010 and is now 16010; the RegionServer web UI was 
60030 and is now 16030, etc.). If you want to keep the old port locations, copy 
the port setting configs from _hbase-default.xml_ into _hbase-site.xml_, change 
them back to the old values from the HBase 0.98.x era, and ensure you've 
distributed your configurations before you restart.
 
 [[upgrade1.0.hbase.bucketcache.percentage.in.combinedcache]]
 .hbase.bucketcache.percentage.in.combinedcache configuration has been REMOVED
-You may have made use of this configuration if you are using BucketCache. If 
NOT using BucketCache, this change does not effect you. Its removal means that 
your L1 LruBlockCache is now sized using `hfile.block.cache.size` -- i.e. the 
way you would size the onheap L1 LruBlockCache if you were NOT doing 
BucketCache -- and the BucketCache size is not whatever the setting for 
hbase.bucketcache.size is. You may need to adjust configs to get the 
LruBlockCache and BucketCache sizes set to what they were in 0.98.x and 
previous. If you did not set this config., its default value was 0.9. If you do 
nothing, your BucketCache will increase in size by 10%. Your L1 LruBlockCache 
will become hfile.block.cache.size times your java heap size 
(`hfile.block.cache.size` is a float between 0.0 and 1.0). To read more, see 
link:https://issues.apache.org/jira/browse/HBASE-11520[HBASE-11520 Simplify 
offheap cache config by removing the confusing 
"hbase.bucketcache.percentage.in.combinedcache"].
+You may have made use of this configuration if you are using BucketCache. If 
NOT using BucketCache, this change does not effect you. Its removal means that 
your L1 LruBlockCache is now sized using `hfile.block.cache.size` -- i.e. the 
way you would size the on-heap L1 LruBlockCache if you were NOT doing 
BucketCache -- and the BucketCache size is not whatever the setting for 
`hbase.bucketcache.size` is. You may need to adjust configs to get the 
LruBlockCache and BucketCache sizes set to what they were in 0.98.x and 
previous. If you did not set this config., its default value was 0.9. If you do 
nothing, your BucketCache will increase in size by 10%. Your L1 LruBlockCache 
will become `hfile.block.cache.size` times your java heap size 
(`hfile.block.cache.size` is a float between 0.0 and 1.0). To read more, see 
link:https://issues.apache.org/jira/browse/HBASE-11520[HBASE-11520 Simplify 
offheap cache config by removing the confusing 
"hbase.bucketcache.percentage.in.combinedcache"].
 
 [[hbase-12068]]
-.If you have your own customer filters....
+.If you have your own customer filters.
 See the release notes on the issue 
link:https://issues.apache.org/jira/browse/HBASE-12068[HBASE-12068 [Branch-1\] 
Avoid need to always do KeyValueUtil#ensureKeyValue for Filter transformCell]; 
be sure to follow the recommendations therein.
 
 [[dlr]]
 .Distributed Log Replay
-<<distributed.log.replay>> is off by default in hbase-1.0. Enabling it can 
make a big difference improving HBase MTTR. Enable this feature if you are 
doing a clean stop/start when you are upgrading. You cannot rolling upgrade on 
to this feature (caveat if you are running on a version of hbase in excess of 
hbase-0.98.4 -- see 
link:https://issues.apache.org/jira/browse/HBASE-12577[HBASE-12577 Disable 
distributed log replay by default] for more).
+<<distributed.log.replay>> is off by default in HBase 1.0.0. Enabling it can 
make a big difference improving HBase MTTR. Enable this feature if you are 
doing a clean stop/start when you are upgrading. You cannot rolling upgrade to 
this feature (caveat if you are running on a version of HBase in excess of 
HBase 0.98.4 -- see 
link:https://issues.apache.org/jira/browse/HBASE-12577[HBASE-12577 Disable 
distributed log replay by default] for more).
 
 [[upgrade1.0.rolling.upgrade]]
 ==== Rolling upgrade from 0.98.x to HBase 1.0.0
 .From 0.96.x to 1.0.0
-NOTE: You cannot do a <<hbase.rolling.upgrade,rolling upgrade>> from 0.96.x to 
1.0.0 without first doing a rolling upgrade to 0.98.x. See comment in 
link:https://issues.apache.org/jira/browse/HBASE-11164?focusedCommentId=14182330&amp;page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&#35;comment-14182330[HBASE-11164
 Document and test rolling updates from 0.98 -> 1.0] for the why. Also because 
hbase-1.0.0 enables hfilev3 by default, 
link:https://issues.apache.org/jira/browse/HBASE-9801[HBASE-9801 Change the 
default HFile version to V3], and support for hfilev3 only arrives in 0.98, 
this is another reason you cannot rolling upgrade from hbase-0.96.x; if the 
rolling upgrade stalls, the 0.96.x servers cannot open files written by the 
servers running the newer hbase-1.0.0 hfilev3 writing servers. 
+NOTE: You cannot do a <<hbase.rolling.upgrade,rolling upgrade>> from 0.96.x to 
1.0.0 without first doing a rolling upgrade to 0.98.x. See comment in 
link:https://issues.apache.org/jira/browse/HBASE-11164?focusedCommentId=14182330&amp;page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&#35;comment-14182330[HBASE-11164
 Document and test rolling updates from 0.98 -> 1.0] for the why. Also because 
HBase 1.0.0 enables HFile v3 by default, 
link:https://issues.apache.org/jira/browse/HBASE-9801[HBASE-9801 Change the 
default HFile version to V3], and support for HFile v3 only arrives in 0.98, 
this is another reason you cannot rolling upgrade from HBase 0.96.x; if the 
rolling upgrade stalls, the 0.96.x servers cannot open files written by the 
servers running the newer HBase 1.0.0 with HFile's of version 3.
 
-There are no known issues running a <<hbase.rolling.upgrade,rolling upgrade>> 
from hbase-0.98.x to hbase-1.0.0.
+There are no known issues running a <<hbase.rolling.upgrade,rolling upgrade>> 
from HBase 0.98.x to HBase 1.0.0.
 
 [[upgrade1.0.from.0.94]]
 ==== Upgrading to 1.0 from 0.94
-You cannot rolling upgrade from 0.94.x to 1.x.x.  You must stop your cluster, 
install the 1.x.x software, run the migration described at 
<<executing.the.0.96.upgrade>> (substituting 1.x.x. wherever we make mention of 
0.96.x in the section below), and then restart.  Be sure to upgrade your 
zookeeper if it is a version less than the required 3.4.x.
+You cannot rolling upgrade from 0.94.x to 1.x.x.  You must stop your cluster, 
install the 1.x.x software, run the migration described at 
<<executing.the.0.96.upgrade>> (substituting 1.x.x. wherever we make mention of 
0.96.x in the section below), and then restart. Be sure to upgrade your 
ZooKeeper if it is a version less than the required 3.4.x.
 
 [[upgrade0.98]]
 === Upgrading from 0.96.x to 0.98.x
@@ -230,7 +230,7 @@ A rolling upgrade from 0.94.x directly to 0.98.x does not 
work. The upgrade path
 ==== The "Singularity"
 
 .HBase 0.96.x was EOL'd, September 1st, 2014
-NOTE: Do not deploy 0.96.x  Deploy a 0.98.x at least. See 
link:https://issues.apache.org/jira/browse/HBASE-11642[EOL 0.96].
+NOTE: Do not deploy 0.96.x  Deploy at least 0.98.x. See 
link:https://issues.apache.org/jira/browse/HBASE-11642[EOL 0.96].
 
 You will have to stop your old 0.94.x cluster completely to upgrade. If you 
are replicating between clusters, both clusters will have to go down to 
upgrade. Make sure it is a clean shutdown. The less WAL files around, the 
faster the upgrade will run (the upgrade will split any log files it finds in 
the filesystem as part of the upgrade process). All clients must be upgraded to 
0.96 too.
 
@@ -242,7 +242,7 @@ The API has changed. You will need to recompile your code 
against 0.96 and you m
 .HDFS and ZooKeeper must be up!
 NOTE: HDFS and ZooKeeper should be up and running during the upgrade process.
 
-hbase-0.96.0 comes with an upgrade script. Run
+HBase 0.96.0 comes with an upgrade script. Run
 
 [source,bash]
 ----
@@ -251,11 +251,11 @@ $ bin/hbase upgrade
 to see its usage. The script has two main modes: `-check`, and `-execute`.
 
 .check
-The check step is run against a running 0.94 cluster. Run it from a downloaded 
0.96.x binary. The check step is looking for the presence of HFileV1 files. 
These are unsupported in hbase-0.96.0. To purge them -- have them rewritten as 
HFileV2 -- you must run a compaction.
+The check step is run against a running 0.94 cluster. Run it from a downloaded 
0.96.x binary. The check step is looking for the presence of HFile v1 files. 
These are unsupported in HBase 0.96.0. To have them rewritten as HFile v2 you 
must run a compaction.
 
-The check step prints stats at the end of its run (grep for `âResult:â` in 
the log) printing absolute path of the tables it scanned, any HFileV1 files 
found, the regions containing said files (the regions we need to major compact 
to purge the HFileV1s), and any corrupted files if any found. A corrupt file is 
unreadable, and so is undefined (neither HFileV1 nor HFileV2).
+The check step prints stats at the end of its run (grep for `âResult:â` in 
the log) printing absolute path of the tables it scanned, any HFile v1 files 
found, the regions containing said files (these regions will need a major 
compaction), and any corrupted files if found. A corrupt file is unreadable, 
and so is undefined (neither HFile v1 nor HFile v2).
 
-To run the check step, run 
+To run the check step, run
 
 [source,bash]
 ----
@@ -286,23 +286,23 @@ 
hdfs://localhost:41020/myHBase/usertable/ecdd3eaee2d2fcf8184ac025555bb2af
 There are some HFileV1, or corrupt files (files with incorrect major version)
 ----
 
-In the above sample output, there are two HFileV1 in two regions, and one 
corrupt file. Corrupt files should probably be removed. The regions that have 
HFileV1s need to be major compacted. To major compact, start up the hbase shell 
and review how to compact an individual region. After the major compaction is 
done, rerun the check step and the HFileV1s shoudl be gone, replaced by HFileV2 
instances.
+In the above sample output, there are two HFile v1 files in two regions, and 
one corrupt file. Corrupt files should probably be removed. The regions that 
have HFile v1s need to be major compacted. To major compact, start up the hbase 
shell and review how to compact an individual region. After the major 
compaction is done, rerun the check step and the HFile v1 files should be gone, 
replaced by HFile v2 instances.
 
-By default, the check step scans the hbase root directory (defined as 
hbase.rootdir in the configuration). To scan a specific directory only, pass 
the -dir option.
+By default, the check step scans the HBase root directory (defined as 
`hbase.rootdir` in the configuration). To scan a specific directory only, pass 
the `-dir` option.
 [source,bash]
 ----
 $ bin/hbase upgrade -check -dir /myHBase/testTable
 ----
-The above command would detect HFileV1s in the /myHBase/testTable directory.
+The above command would detect HFile v1 files in the _/myHBase/testTable_ 
directory.
 
-Once the check step reports all the HFileV1 files have been rewritten, it is 
safe to proceed with the upgrade.
+Once the check step reports all the HFile v1 files have been rewritten, it is 
safe to proceed with the upgrade.
 
 .execute
-After the _check_ step shows the cluster is free of HFileV1, it is safe to 
proceed with the upgrade. Next is the _execute_ step. You must *SHUTDOWN YOUR 
0.94.x CLUSTER* before you can run the execute step. The execute step will not 
run if it detects running HBase masters or regionservers.
+After the _check_ step shows the cluster is free of HFile v1, it is safe to 
proceed with the upgrade. Next is the _execute_ step. You must *SHUTDOWN YOUR 
0.94.x CLUSTER* before you can run the execute step. The execute step will not 
run if it detects running HBase masters or RegionServers.
 
 [NOTE]
 ====
-HDFS and ZooKeeper should be up and running during the upgrade process. If 
zookeeper is managed by HBase, then you can start zookeeper so it is available 
to the upgrade by running 
+HDFS and ZooKeeper should be up and running during the upgrade process. If 
zookeeper is managed by HBase, then you can start zookeeper so it is available 
to the upgrade by running
 [source,bash]
 ----
 $ ./hbase/bin/hbase-daemon.sh start zookeeper
@@ -317,7 +317,7 @@ The execute upgrade step is made of three substeps.
 
 * WAL Log Splitting: If the 0.94.x cluster shutdown was not clean, we'll split 
WAL logs as part of migration before we startup on 0.96.0. This WAL splitting 
runs slower than the native distributed WAL splitting because it is all inside 
the single upgrade process (so try and get a clean shutdown of the 0.94.0 
cluster if you can).
 
-To run the _execute_ step, make sure that first you have copied hbase-0.96.0 
binaries everywhere under servers and under clients. Make sure the 0.94.0 
cluster is down. Then do as follows:
+To run the _execute_ step, make sure that first you have copied HBase 0.96.0 
binaries everywhere under servers and under clients. Make sure the 0.94.0 
cluster is down. Then do as follows:
 [source,bash]
 ----
 $ bin/hbase upgrade -execute
@@ -339,7 +339,7 @@ Starting Log splitting
 ...
 Successfully completed Log splitting
 ----
-         
+
 If the output from the execute step looks good, stop the zookeeper instance 
you started to do the upgrade:
 [source,bash]
 ----
@@ -376,22 +376,22 @@ The migration is a one-time event. However, every time 
your cluster starts, `MET
 
 [[upgrade0.94]]
 === Upgrading from 0.92.x to 0.94.x
-We used to think that 0.92 and 0.94 were interface compatible and that you can 
do a rolling upgrade between these versions but then we figured that 
link:https://issues.apache.org/jira/browse/HBASE-5357[";>]HBASE-5357 Use builder 
pattern in HColumnDescriptor] changed method signatures so rather than return 
void they instead return HColumnDescriptor. This will 
throw`java.lang.NoSuchMethodError: 
org.apache.hadoop.hbase.HColumnDescriptor.setMaxVersions(I)V` so 0.92 and 0.94 
are NOT compatible. You cannot do a rolling upgrade between them.
+We used to think that 0.92 and 0.94 were interface compatible and that you can 
do a rolling upgrade between these versions but then we figured that 
link:https://issues.apache.org/jira/browse/HBASE-5357[HBASE-5357 Use builder 
pattern in HColumnDescriptor] changed method signatures so rather than return 
`void` they instead return `HColumnDescriptor`. This will 
throw`java.lang.NoSuchMethodError: 
org.apache.hadoop.hbase.HColumnDescriptor.setMaxVersions(I)V` so 0.92 and 0.94 
are NOT compatible. You cannot do a rolling upgrade between them.
 
 [[upgrade0.92]]
 === Upgrading from 0.90.x to 0.92.x
 ==== Upgrade Guide
-ou will find that 0.92.0 runs a little differently to 0.90.x releases. Here 
are a few things to watch out for upgrading from 0.90.x to 0.92.0.
+You will find that 0.92.0 runs a little differently to 0.90.x releases. Here 
are a few things to watch out for upgrading from 0.90.x to 0.92.0.
 
 .tl:dr
 [NOTE]
 ====
-If you've not patience, here are the important things to know upgrading.
+These are the important things to know before upgrading.
 . Once you upgrade, you canât go back.
 
 . MSLAB is on by default. Watch that heap usage if you have a lot of regions.
 
-. Distributed Log Splitting is on by default. It should make region server 
failover faster.
+. Distributed Log Splitting is on by default. It should make RegionServer 
failover faster.
 
 . Thereâs a separate tarball for security.
 
@@ -399,10 +399,10 @@ If you've not patience, here are the important things to 
know upgrading.
 ====
 
 .You canât go back!
-To move to 0.92.0, all you need to do is shutdown your cluster, replace your 
hbase 0.90.x with hbase 0.92.0 binaries (be sure you clear out all 0.90.x 
instances) and restart (You cannot do a rolling restart from 0.90.x to 0.92.x 
-- you must restart). On startup, the `.META.` table content is rewritten 
removing the table schema from the `info:regioninfo` column. Also, any flushes 
done post first startup will write out data in the new 0.92.0 file format, 
<<hfilev2>>. This means you cannot go back to 0.90.x once youâve started 
HBase 0.92.0 over your HBase data directory.
+To move to 0.92.0, all you need to do is shutdown your cluster, replace your 
HBase 0.90.x with HBase 0.92.0 binaries (be sure you clear out all 0.90.x 
instances) and restart (You cannot do a rolling restart from 0.90.x to 0.92.x 
-- you must restart). On startup, the `.META.` table content is rewritten 
removing the table schema from the `info:regioninfo` column. Also, any flushes 
done post first startup will write out data in the new 0.92.0 file format, 
<<hfilev2>>. This means you cannot go back to 0.90.x once youâve started 
HBase 0.92.0 over your HBase data directory.
 
 .MSLAB is ON by default
-In 0.92.0, the 
`<<hbase.hregion.memstore.mslab.enabled,hbase.hregion.memstore.mslab.enabled>>` 
flag is set to `true` (See <<gcpause>>). In 0.90.x it was false. When it is 
enabled, memstores will step allocate memory in MSLAB 2MB chunks even if the 
memstore has zero or just a few small elements. This is fine usually but if you 
had lots of regions per regionserver in a 0.90.x cluster (and MSLAB was off), 
you may find yourself OOME'ing on upgrade because the `thousands of regions * 
number of column families * 2MB MSLAB` (at a minimum) puts your heap over the 
top. Set `hbase.hregion.memstore.mslab.enabled` to `false` or set the MSLAB 
size down from 2MB by setting `hbase.hregion.memstore.mslab.chunksize` to 
something less.
+In 0.92.0, the 
`<<hbase.hregion.memstore.mslab.enabled,hbase.hregion.memstore.mslab.enabled>>` 
flag is set to `true` (See <<gcpause>>). In 0.90.x it was false. When it is 
enabled, memstores will step allocate memory in MSLAB 2MB chunks even if the 
memstore has zero or just a few small elements. This is fine usually but if you 
had lots of regions per RegionServer in a 0.90.x cluster (and MSLAB was off), 
you may find yourself OOME'ing on upgrade because the `thousands of regions * 
number of column families * 2MB MSLAB` (at a minimum) puts your heap over the 
top. Set `hbase.hregion.memstore.mslab.enabled` to `false` or set the MSLAB 
size down from 2MB by setting `hbase.hregion.memstore.mslab.chunksize` to 
something less.
 
 [[dls]]
 .Distributed Log Splitting is on by default
@@ -412,18 +412,18 @@ Previous, WAL logs on crash were split by the Master 
alone. In 0.92.0, log split
 In 0.92.0, <<hfilev2>> indices and bloom filters take up residence in the same 
LRU used caching blocks that come from the filesystem. In 0.90.x, the HFile v1 
indices lived outside of the LRU so they took up space even if the index was on 
a âcoldâ file, one that wasnât being actively used. With the indices now 
in the LRU, you may find you have less space for block caching. Adjust your 
block cache accordingly. See the <<block.cache>> for more detail. The block 
size default size has been changed in 0.92.0 from 0.2 (20 percent of heap) to 
0.25.
 
 .On the Hadoop version to use
-Run 0.92.0 on Hadoop 1.0.x (or CDH3u3 when it ships). The performance benefits 
are worth making the move. Otherwise, our Hadoop prescription is as it has 
been; you need an Hadoop that supports a working sync. See <<hadoop>>.
+Run 0.92.0 on Hadoop 1.0.x (or CDH3u3). The performance benefits are worth 
making the move. Otherwise, our Hadoop prescription is as it has been; you need 
an Hadoop that supports a working sync. See <<hadoop>>.
 
 If running on Hadoop 1.0.x (or CDH3u3), enable local read. See 
link:http://files.meetup.com/1350427/hug_ebay_jdcryans.pdf[Practical Caching] 
presentation for ruminations on the performance benefits âgoing localâ (and 
for how to enable local reads).
 
 .HBase 0.92.0 ships with ZooKeeper 3.4.2
-If you can, upgrade your zookeeper. If you canât, 3.4.2 clients should work 
against 3.3.X ensembles (HBase makes use of 3.4.2 API).
+If you can, upgrade your ZooKeeper. If you canât, 3.4.2 clients should work 
against 3.3.X ensembles (HBase makes use of 3.4.2 API).
 
 .Online alter is off by default
-In 0.92.0, weâve added an experimental online schema alter facility (See 
<<hbase.online.schema.update.enable,hbase.online.schema.update.enable>>). Its 
off by default. Enable it at your own risk. Online alter and splitting tables 
do not play well together so be sure your cluster quiescent using this feature 
(for now).
+In 0.92.0, weâve added an experimental online schema alter facility (See 
<<hbase.online.schema.update.enable,hbase.online.schema.update.enable>>). It's 
off by default. Enable it at your own risk. Online alter and splitting tables 
do not play well together so be sure your cluster quiescent using this feature 
(for now).
 
 .WebUI
-The webui has had a few additions made in 0.92.0. It now shows a list of the 
regions currently transitioning, recent compactions/flushes, and a process list 
of running processes (usually empty if all is well and requests are being 
handled promptly). Other additions including requests by region, a debugging 
servlet dump, etc.
+The web UI has had a few additions made in 0.92.0. It now shows a list of the 
regions currently transitioning, recent compactions/flushes, and a process list 
of running processes (usually empty if all is well and requests are being 
handled promptly). Other additions including requests by region, a debugging 
servlet dump, etc.
 
 .Security tarball
 We now ship with two tarballs; secure and insecure HBase. Documentation on how 
to setup a secure HBase is on the way.
@@ -432,10 +432,10 @@ We now ship with two tarballs; secure and insecure HBase. 
Documentation on how t
 0.92.0 adds two new features: multi-slave and multi-master replication. The 
way to enable this is the same as adding a new peer, so in order to have 
multi-master you would just run add_peer for each cluster that acts as a master 
to the other slave clusters. Collisions are handled at the timestamp level 
which may or may not be what you want, this needs to be evaluated on a per use 
case basis. Replication is still experimental in 0.92 and is disabled by 
default, run it at your own risk.
 
 .RegionServer now aborts if OOME
-If an OOME, we now have the JVM kill -9 the regionserver process so it goes 
down fast. Previous, a RegionServer might stick around after incurring an OOME 
limping along in some wounded state. To disable this facility, and recommend 
you leave it in place, youâd need to edit the bin/hbase file. Look for the 
addition of the -XX:OnOutOfMemoryError="kill -9 %p" arguments (See 
link:https://issues.apache.org/jira/browse/HBASE-4769[HBASE-4769 - âAbort 
RegionServer Immediately on OOMEâ]).
+If an OOME, we now have the JVM kill -9 the RegionServer process so it goes 
down fast. Previous, a RegionServer might stick around after incurring an OOME 
limping along in some wounded state. To disable this facility, and recommend 
you leave it in place, youâd need to edit the bin/hbase file. Look for the 
addition of the -XX:OnOutOfMemoryError="kill -9 %p" arguments (See 
link:https://issues.apache.org/jira/browse/HBASE-4769[HBASE-4769 - âAbort 
RegionServer Immediately on OOMEâ]).
 
-.HFile V2 and the âBigger, Fewerâ Tendency
-0.92.0 stores data in a new format, <<hfilev2>>. As HBase runs, it will move 
all your data from HFile v1 to HFile v2 format. This auto-migration will run in 
the background as flushes and compactions run. HFile V2 allows HBase run with 
larger regions/files. In fact, we encourage that all HBasers going forward tend 
toward Facebook axiom #1, run with larger, fewer regions. If you have lots of 
regions now -- more than 100s per host -- you should look into setting your 
region size up after you move to 0.92.0 (In 0.92.0, default size is now 1G, up 
from 256M), and then running online merge tool (See 
link:https://issues.apache.org/jira/browse/HBASE-1621[HBASE-1621 merge tool 
should work on online cluster, but disabled table]).
+.HFile v2 and the âBigger, Fewerâ Tendency
+0.92.0 stores data in a new format, <<hfilev2>>. As HBase runs, it will move 
all your data from HFile v1 to HFile v2 format. This auto-migration will run in 
the background as flushes and compactions run. HFile v2 allows HBase run with 
larger regions/files. In fact, we encourage that all HBasers going forward tend 
toward Facebook axiom #1, run with larger, fewer regions. If you have lots of 
regions now -- more than 100s per host -- you should look into setting your 
region size up after you move to 0.92.0 (In 0.92.0, default size is now 1G, up 
from 256M), and then running online merge tool (See 
link:https://issues.apache.org/jira/browse/HBASE-1621[HBASE-1621 merge tool 
should work on online cluster, but disabled table]).
 
 [[upgrade0.90]]
 === Upgrading to HBase 0.90.x from 0.20.x or 0.89.x
@@ -447,4 +447,4 @@ Finally, if upgrading from 0.20.x, check your .META. schema 
in the shell. In the
 ----
 hbase> scan '-ROOT-'
 ----
-in the shell. This will output the current `.META.` schema. Check 
`MEMSTORE_FLUSHSIZE` size. Is it 16kb (16384)? If so, you will need to change 
this (The 'normal'/default value is 64MB (67108864)). Run the script 
`bin/set_meta_memstore_size.rb`. This will make the necessary edit to your 
`.META.` schema. Failure to run this change will make for a slow cluster. See 
link:https://issues.apache.org/jira/browse/HBASE-3499[HBASE-3499 Users 
upgrading to 0.90.0 need to have their .META. table updated with the right 
MEMSTORE_SIZE].
\ No newline at end of file
+in the shell. This will output the current `.META.` schema. Check 
`MEMSTORE_FLUSHSIZE` size. Is it 16kb (16384)? If so, you will need to change 
this (The 'normal'/default value is 64MB (67108864)). Run the script 
`bin/set_meta_memstore_size.rb`. This will make the necessary edit to your 
`.META.` schema. Failure to run this change will make for a slow cluster. See 
link:https://issues.apache.org/jira/browse/HBASE-3499[HBASE-3499 Users 
upgrading to 0.90.0 need to have their .META. table updated with the right 
MEMSTORE_SIZE].

[01/11] hbase git commit: Update documentation from master for 1.0.0RC3

Reply via email to