RPM/DEB package integration with Hadoop
---------------------------------------
Key: HIVE-2313
URL: https://issues.apache.org/jira/browse/HIVE-2313
Project: Hive
Issue Type: New Feature
Components: Build Infrastructure
Affects Versions: 0.8.0
Environment: Java 6, Linux
Reporter: Eric Yang
Fix For: 0.8.0
In the past Hadoop related projects are self organized to have file layout look
like this:
{noformat}
$PROJECT_HOME/bin
/conf/$PROJECT-env.sh
/lib/*.jar
/docs
/src
{noformat}
Software are deployed in isolated directory, and dependent projects are using
environment variables to reference each other. This approach works fine for
tarball and small software stack. In order to scale the Hadoop stack of
software, and there is a proposal in HADOOP-6255 to improve the file system
layout as a merged system. Hadoop related projects are recommended to use this
new layout:
{noformat}
$PROJECT_PREFIX/bin
/etc/$PROJECT
/lib (native C libraries)
/share/$PROJECT (jar files)
/share/$PROJECT/lib (third party dependent jar files)
/share/doc/$PROJECT
{noformat}
Where PROJECT_PREFIX is by default /usr. Hive can locate hadoop jar files by
referencing to $PROJECT_PREFIX/share/hadoop/common/hadoop-*-core.jar. The
system can operate by untar the tarball or install RPM packages to the same
PREFIX location.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira