[jira] [Updated] (HADOOP-13397) Add dockerfile for Hadoop
[ https://issues.apache.org/jira/browse/HADOOP-13397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allen Wittenauer updated HADOOP-13397: -- Attachment: HADOOP-13397.DNC001.patch I'm uploading DNC001 ("do not commit" #1) to give folks something to play with, get some feedback, etc. While the patch is intended for trunk, you can still use it to build Dockerfiles (theoretically) for any branch-2 release. Docs are missing, but running: {code} mkhdf create --version 2.7.2 --dockerfile /tmp/Dockerfile {code} will generate a simple Xenial-based Dockerfile that downloads 2.7.2, does gpg verification, etc, with a bootstrap file in /tmp/hadoop-bootstrap.sh to get the daemons started. Lots of this is still untested, but it should be enough for folks to provide some feedback if this is useful. Some portions (such as supplying stubs outside of the share dir) aren't quite baked in yet, but that will be coming. There are lots of to do's here, too many to name, but some of the big ones are: * clean up parameter handling to be less finicky * support for RPMs, DEBs, etc, * support for non-bundled bits (e.g., supplying your own tar ball) * actually verify the daemons work. :) The current focus was to build something that would just be a raw dockerfile without any external input. > Add dockerfile for Hadoop > - > > Key: HADOOP-13397 > URL: https://issues.apache.org/jira/browse/HADOOP-13397 > Project: Hadoop Common > Issue Type: Bug >Reporter: Klaus Ma >Assignee: Allen Wittenauer > Attachments: HADOOP-13397.DNC001.patch > > > For now, there's no community version Dockerfile in Hadoop; most of docker > images are provided by vendor, e.g. > 1. Cloudera's image: https://hub.docker.com/r/cloudera/quickstart/ > 2. From HortonWorks sequenceiq: > https://hub.docker.com/r/sequenceiq/hadoop-docker/ > 3. MapR provides the mapr-sandbox-base: > https://hub.docker.com/r/maprtech/mapr-sandbox-base/ > The proposal of this JIRA is to provide a community version Dockerfile in > Hadoop, and here's some requirement: > 1. Seperated docker image for master & agents, e.g. resource manager & node > manager > 2. Default configuration to start master & agent instead of configurating > manually > 3. Start Hadoop process as no-daemon > Here's my dockerfile to start master/agent: > https://github.com/k82cn/outrider/tree/master/kubernetes/imgs/yarn > I'd like to contribute it after polishing :). > Email Thread : > http://mail-archives.apache.org/mod_mbox/hadoop-user/201607.mbox/%3CSG2PR04MB162977CFE150444FA022510FB6370%40SG2PR04MB1629.apcprd04.prod.outlook.com%3E -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-13397) Add dockerfile for Hadoop
[ https://issues.apache.org/jira/browse/HADOOP-13397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allen Wittenauer updated HADOOP-13397: -- Description: For now, there's no community version Dockerfile in Hadoop; most of docker images are provided by vendor, e.g. 1. Cloudera's image: https://hub.docker.com/r/cloudera/quickstart/ 2. From HortonWorks sequenceiq: https://hub.docker.com/r/sequenceiq/hadoop-docker/ 3. MapR provides the mapr-sandbox-base: https://hub.docker.com/r/maprtech/mapr-sandbox-base/ The proposal of this JIRA is to provide a community version Dockerfile in Hadoop, and here's some requirement: 1. Seperated docker image for master & agents, e.g. resource manager & node manager 2. Default configuration to start master & agent instead of configurating manually 3. Start Hadoop process as no-daemon Here's my dockerfile to start master/agent: https://github.com/k82cn/outrider/tree/master/kubernetes/imgs/yarn I'd like to contribute it after polishing :). Email Thread : http://mail-archives.apache.org/mod_mbox/hadoop-user/201607.mbox/%3CSG2PR04MB162977CFE150444FA022510FB6370%40SG2PR04MB1629.apcprd04.prod.outlook.com%3E was: For now, there's no community version Dockerfile in Hadoop; most of docker images are provided by vendor, e.g. 1. Official image from Cloudera is the quickstart image: https://hub.docker.com/r/cloudera/quickstart/ 2. From HortonWorks sequenceiq: https://hub.docker.com/r/sequenceiq/hadoop-docker/ 3. MapR provides the mapr-sandbox-base: https://hub.docker.com/r/maprtech/mapr-sandbox-base/ The proposal of this JIRA is to provide a community version Dockerfile in Hadoop, and here's some requirement: 1. Seperated docker image for master & agents, e.g. resource manager & node manager 2. Default configuration to start master & agent instead of configurating manually 3. Start Hadoop process as no-daemon Here's my dockerfile to start master/agent: https://github.com/k82cn/outrider/tree/master/kubernetes/imgs/yarn I'd like to contribute it after polishing :). Email Thread : http://mail-archives.apache.org/mod_mbox/hadoop-user/201607.mbox/%3CSG2PR04MB162977CFE150444FA022510FB6370%40SG2PR04MB1629.apcprd04.prod.outlook.com%3E > Add dockerfile for Hadoop > - > > Key: HADOOP-13397 > URL: https://issues.apache.org/jira/browse/HADOOP-13397 > Project: Hadoop Common > Issue Type: Bug >Reporter: Klaus Ma > > For now, there's no community version Dockerfile in Hadoop; most of docker > images are provided by vendor, e.g. > 1. Cloudera's image: https://hub.docker.com/r/cloudera/quickstart/ > 2. From HortonWorks sequenceiq: > https://hub.docker.com/r/sequenceiq/hadoop-docker/ > 3. MapR provides the mapr-sandbox-base: > https://hub.docker.com/r/maprtech/mapr-sandbox-base/ > The proposal of this JIRA is to provide a community version Dockerfile in > Hadoop, and here's some requirement: > 1. Seperated docker image for master & agents, e.g. resource manager & node > manager > 2. Default configuration to start master & agent instead of configurating > manually > 3. Start Hadoop process as no-daemon > Here's my dockerfile to start master/agent: > https://github.com/k82cn/outrider/tree/master/kubernetes/imgs/yarn > I'd like to contribute it after polishing :). > Email Thread : > http://mail-archives.apache.org/mod_mbox/hadoop-user/201607.mbox/%3CSG2PR04MB162977CFE150444FA022510FB6370%40SG2PR04MB1629.apcprd04.prod.outlook.com%3E -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org