Hello,

I'm trying to nail down a process for converting existing Apache-hadoop
clusters with significant amounts of pre-existing data to CDH3.  While I've
found documentation for upgrading between CDH versions, I haven't seen one
for standard Apache-hadoop => CDH3 with the "new" mapred and hdfs users and
groups.  I'm looking for the proper method to manually convert permissions
on existing data from a single user&group setup (hadoop/hadoop) to the 2user
& 3 group setup of hdfs/mapred users with hdfs/mapred/hadoop groups.

What I'm thinking needs to happen is something like this:

   1. Shutdown cluster.
   2. Perform full configuration and HDFS data backup.
   3. Delete existing hadoop user/group while leaving HDFS data/mapred
   folders untouched.
   4. Install CDH3 packages (which sets up new users and groups).
   5. Manually adjust permissions/groups/ownership of old data files to
   match new CDH3 security setup.
   6. Flow into standard hadoop-upgrade process.

 Basically, I'm trying to nail down setup 5, and would appreciate any
guidance on this.  I'm guessing I just haven't found the correct document
since this seems like it would be a common endeavor.

Thanks in advance,
-JR

Reply via email to