Adam Fuchs created ACCUMULO-1621:
------------------------------------
Summary: document the use of the init install scripts in the user
manual
Key: ACCUMULO-1621
URL: https://issues.apache.org/jira/browse/ACCUMULO-1621
Project: Accumulo
Issue Type: Improvement
Components: docs
Reporter: Adam Fuchs
Priority: Minor
Accumulo comes with a nice set of scripts to set up an accumulo user and
install init scripts. It would be nice to have a concise set of installation
instructions that leverage those scripts. I put together a list of instructions
the other day for one of our installs, that is included below. It needs review,
testing, and integration into the standard docs (user manual + README in some
form).
1. Accumulo 1.5.x should be installed using the scripts in the scripts
directory under ACCUMULO_HOME, which should be /usr/lib/accumulo. Make a
symbolic link to the /usr/lib/accumulo_1.5.0 directory (or whatever version
you're using), which is where you should put the package.
2. Zookeeper must be installed on all machines, but it only needs to be running
on the zookeeper nodes.
3. Make sure the HDFS /user/accumulo exists and is owned by the accumulo user
(for the trash collection issue).
4. Make sure dfs.durable.sync (or dfs.support.append on some platforms) is
enabled. Reboot HDFS after this is set.
5. For Accumulo with encryption, use the encryption settings from
conf/examples/crypto/accumulo-site.xml, but the other settings from
conf/examples/3GB/native-standalone* for performance. You can also bump up the
memory settings for cache and memory maps according to resources available on
the cluster.
6. Make sure to set the instance.secret using a password generator.
7. Make sure the accumulo-site.xml is set to only be readable by the accumulo
user.
8. Change the accumulo-monitor user to accumulo (substitute accumulo for
accumulo_monitor in lines 28, 31, 35 of scripts/monitor-only-init.sh). This is
one way of getting past the security restrictions on the accumulo-site.xml file
and on the accumulo directory in HDFS. An alternative would be to give the
accumulo_monitor user access to those resources.
9. From the scripts directory, run ./master-only-init.sh, ./gc-only-init.sh,
and ./monitor-only-init.sh on the master node.
10. chown -R accumulo /usr/lib/accumulo*
11. scp or rsync the configured accumulo directory through the cluster
12. From the scripts directory, run ./tserver-only-init.sh on each of the
tservers.
13. Start up all the processes using "service accumulo-master start" or the
appropriate commands on each server.
14. Check that all the processes started using jps -m, and check that the right
number of tservers started on the monitor page.
15. Test! Accumulo shell first, then CI if you're ambitious.
16. Monitor the logs via the monitor page periodically over the next half hour
to see if there are any errors or warnings. Some things don't cause errors at
the API level for a while, but they show up earlier in the logs.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira