https://bugzilla.wikimedia.org/show_bug.cgi?id=68161

            Bug ID: 68161
           Summary: Current puppet does not allow to bring up a cluster in
                    labs
           Product: Analytics
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: General/Unknown
          Assignee: wikibugs-l@lists.wikimedia.org
          Reporter: christ...@quelltextlich.at
                CC: bugwatc...@sb-mail.wmflabs.org,
                    christ...@quelltextlich.at, kle...@wikimedia.org,
                    oke...@wikimedia.org, tneg...@wikimedia.org
       Web browser: ---
   Mobile Platform: ---

When trying to bring up a namenode in labs, puppet fails with

  Error: Cannot create /var/lib/hadoop/name; parent directory /var/lib/hadoop
does not exist
  Error: /Stage[main]/Cdh::Hadoop::Namenode/File[/var/lib/hadoop/name]/ensure:
change from absent to directory failed: Cannot create /var/lib/hadoop/name;
parent directory /var/lib/hadoop does not exist

With puppet at commit ebcbef50568960d424fcb95fc79ba3be945a905e,
everything is working, and setting up a cluster in labs works.

With 87bd718e678d290b80b0916d255f1bae8666e7d7 (i.e.: the child
following the above ebcdef commit) + cherry-picking
a38770013716dd39ee5df90380473b734e0cebbb on top [1], puppet fails to
set up namenode. Puppet runs fail with the above error message.

So it seems 87bd718e678d290b80b0916d255f1bae8666e7d7 is the culprit.
But as this commit is doing much reshuffling (~800 lines changed),
I'll leave it to CDH+puppet experts to dig deeper.



* Steps to Reproduce
   * Add a new instance 'demo-master'
      (m1.small, ubuntu-12.04-precise)
   * Wait for the instance to come up.
   * Configure the instance by adding role
       role::analytics::hadoop::master
     and setting
       hadoop_namenodes
     to
       demo-master.eqiad.wmflabs
   * Wait for the next puppet run

* Expected result
  Puppet passes without errors

* Actual result
  Puppet fails with
    Error: Cannot create /var/lib/hadoop/name; parent directory /var/lib/hadoop
does not exist
    Error:
/Stage[main]/Cdh::Hadoop::Namenode/File[/var/lib/hadoop/name]/ensure: change
from absent to directory failed: Cannot create /var/lib/hadoop/name; parent
directory /var/lib/hadoop does not exist




[1] Plain 87bd718e678d290b80b0916d255f1bae8666e7d7 fails with

  Error: Could not retrieve catalog from remote server: Error 400 on SERVER:
Duplicate parameter 'mapreduce_output_compression' for on Class[Cdh::Hadoop] at
/etc/puppet/manifests/role/analytics/hadoop.pp:201 on node
qchris-master-87bd718.eqiad.wmflabs

which was fixed upstream in commit
a38770013716dd39ee5df90380473b734e0cebbb.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to