JIRA ticket # 239

On Wed, Aug 17, 2016 at 2:21 PM, John Yost <hokiege...@gmail.com> wrote:

> Hi Guys,
>
> Sorry, just checked email, setting role to * does indeed cause this
> error.  Matt -> if you set it to nothing or whatever role you have your
> mesos slaves configured as, that will fix this. My apologies for failing to
> log and fix this. Darin -> I will enter a JIRA ticket for this.
>
> To echo Darin's sentiments--thanks a bunch for checking Myriad out! :)
>
> --John
>
> On Wed, Aug 17, 2016 at 1:25 PM, Darin Johnson <dbjohnson1...@gmail.com>
> wrote:
>
>> Hey Matt,
>>
>> Looking through the code, I think setting myriadFrameworkRole to "*" might
>> be the problem.  Can you try commenting out that line in your config?
>> I'll
>> double check this in a little while too.  If that works I'll submit a
>> patch
>> that checks that.
>>
>> Sorry - Myriad is still a pretty young project!  Thanks for checking it
>> out
>> though!
>>
>> Darin
>>
>> On Wed, Aug 17, 2016 at 11:25 AM, Matthew J. Loppatto <
>> mloppa...@keywcorp.com> wrote:
>>
>> > Hey Darin,
>> >
>> > Pulling from master got rid of the errors I was seeing, however I'm
>> > running into a new issue.  After starting the resource manager, I see
>> this
>> > in the logs:
>> >
>> > 2016-08-17 10:56:40,709 INFO org.apache.myriad.Main: Launching 1 NM(s)
>> > with profile medium
>> > 2016-08-17 10:56:40,710 INFO org.apache.myriad.scheduler.My
>> riadOperations:
>> > Adding 1 NM instances to cluster
>> > 2016-08-17 10:56:40,733 ERROR org.apache.myriad.scheduler.ev
>> ent.handlers.ErrorEventHandler:
>> > Role '' is not present in the master's --roles
>> >
>> > My Mesos cluster has the default "*" role so I tried setting
>> > frameworkRole: "*" in myriad-config-default.yml, restarted the resource
>> > manager and got this error:
>> >
>> > 2016-08-17 11:06:28,244 ERROR org.apache.myriad.scheduler.ev
>> ent.handlers.ResourceOffersEventHandler:
>> > Exception thrown while trying to create a task for nm
>> > java.lang.IllegalArgumentException: n must be positive
>> >     at java.util.Random.nextInt(Random.java:300)
>> >     at org.apache.myriad.scheduler.resource.RangeResource.
>> > getRandomValues(RangeResource.java:128)
>> >     at org.apache.myriad.scheduler.resource.RangeResource.
>> > consumeResource(RangeResource.java:99)
>> >     at org.apache.myriad.scheduler.resource.ResourceOfferContainer.
>> > consumePorts(ResourceOfferContainer.java:171)
>> >     at org.apache.myriad.scheduler.NMTaskFactory.createTask(
>> > NMTaskFactory.java:45)
>> >     at org.apache.myriad.scheduler.event.handlers.
>> > ResourceOffersEventHandler.onEvent(ResourceOffersEventHandler.java:119)
>> >     at org.apache.myriad.scheduler.event.handlers.
>> > ResourceOffersEventHandler.onEvent(ResourceOffersEventHandler.java:49)
>> >     at com.lmax.disruptor.BatchEventProcessor.run(
>> > BatchEventProcessor.java:128)
>> >     at java.util.concurrent.ThreadPoolExecutor.runWorker(
>> > ThreadPoolExecutor.java:1145)
>> >     at java.util.concurrent.ThreadPoolExecutor$Worker.run(
>> > ThreadPoolExecutor.java:615)
>> >     at java.lang.Thread.run(Thread.java:745)
>> >
>> > Does Myriad require its own role in Mesos?
>> >
>> > Thanks,
>> > Matt
>> >
>> >
>> > -----Original Message-----
>> > From: Darin Johnson [mailto:dbjohnson1...@gmail.com]
>> > Sent: Tuesday, August 16, 2016 6:18 PM
>> > To: Dev
>> > Subject: Re: Resource manager error
>> >
>> > Hey Mathew, my coworker found the same issue recently, I fixed it on my
>> > last pull request, if you'd like to pull from master.
>> >
>> > Alternatively, you could comment out the appendCgroups line in
>> > myriad-scheduler <https://urldefense.proofpoint.com/v2/url?u=https-
>> > 3A__github.com_apache_incubator-2Dmyriad_tree_0.2.x_
>> > myriad-2Dscheduler&d=CwIFaQ&c=31nHN1tvZeuWBT6LwDN4Ngk1qezfsY
>> > HyolgGeY2ZhlU&r=D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=
>> > ibxhOZQSsKtyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=5zc_OUK_
>> > qwnVQoC2kVCcAgvb4ZmZrVKF-iHca_dif4Y&e= >/src <https://urldefense.
>> > proofpoint.com/v2/url?u=https-3A__github.com_apache_
>> > incubator-2Dmyriad_tree_0.2.x_myriad-2Dscheduler_src&d=CwIFaQ&c=
>> > 31nHN1tvZeuWBT6LwDN4Ngk1qezfsYHyolgGeY2ZhlU&r=
>> > D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=
>> > ibxhOZQSsKtyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=
>> > 6CJWEHP2t7cY2oTmNz9Aq9AV39VEkUOKpMoRuz1q9nY&e= > /main <
>> > https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_apache_
>> > incubator-2Dmyriad_tree_0.2.x_myriad-2Dscheduler_src_main&d=CwIFaQ&c=
>> > 31nHN1tvZeuWBT6LwDN4Ngk1qezfsYHyolgGeY2ZhlU&r=
>> > D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=
>> > ibxhOZQSsKtyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=
>> > BtYphvuvvNZ5owUTfiRd4hW90jq0Ib8GGtKiHU0fTB4&e= > /java <
>> > https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_apache_
>> > incubator-2Dmyriad_tree_0.2.x_myriad-2Dscheduler_src_main_ja
>> va&d=CwIFaQ&c=
>> > 31nHN1tvZeuWBT6LwDN4Ngk1qezfsYHyolgGeY2ZhlU&r=
>> > D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=
>> > ibxhOZQSsKtyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=5o45_
>> > 8zSN96rSaQJ8oCWfhCvmqhSbLpz9fMnV9Fk4WI&e= > /org <https://urldefense.
>> > proofpoint.com/v2/url?u=https-3A__github.com_apache_
>> > incubator-2Dmyriad_tree_0.2.x_myriad-2Dscheduler_src_main_
>> > java_org&d=CwIFaQ&c=31nHN1tvZeuWBT6LwDN4Ngk1qezfsYHyolgGeY2ZhlU&r=
>> > D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=
>> > ibxhOZQSsKtyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=
>> > KUTlDXsl6Okj5nCNNyCqnHcE2ePwEEOsYkf2ASzQP2Y&e= > /apache <
>> > https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_apache_
>> > incubator-2Dmyriad_tree_0.2.x_myriad-2Dscheduler_src_main_
>> > java_org_apache&d=CwIFaQ&c=31nHN1tvZeuWBT6LwDN4Ngk1qezfsYHyo
>> lgGeY2ZhlU&r=
>> > D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=
>> > ibxhOZQSsKtyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=465af32H1JItcea_
>> > tp5hz7zxwpqWgAqbVA8APaWmSUE&e= > /myriad <https://urldefense.
>> > proofpoint.com/v2/url?u=https-3A__github.com_apache_
>> > incubator-2Dmyriad_tree_0.2.x_myriad-2Dscheduler_src_main_
>> > java_org_apache_myriad&d=CwIFaQ&c=31nHN1tvZeuWBT6LwDN4Ngk1qezfsY
>> > HyolgGeY2ZhlU&r=D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=
>> > ibxhOZQSsKtyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=QEZka3G6qwIsYzvPtXx-
>> > w4uVek0Bt2D3bD4M4160Dnk&e= > /scheduler <https://urldefense.
>> > proofpoint.com/v2/url?u=https-3A__github.com_apache_
>> > incubator-2Dmyriad_tree_0.2.x_myriad-2Dscheduler_src_main_
>> > java_org_apache_myriad_scheduler&d=CwIFaQ&c=31nHN1tvZeuWBT6L
>> wDN4Ngk1qezfsY
>> > HyolgGeY2ZhlU&r=D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=
>> > ibxhOZQSsKtyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=
>> > 2EubhJ2JLwuGjY6DBZXpauvyuXJ0xefgOFHC8lEo5JE&e= >
>> > /*NMExecutorCLGenImpl* and rebuild.
>> >
>> > Sorry that missed my QA unfortunately I'm always using cgroups and
>> didn't
>> > test that.  We may do a 0.2.1 release but I can say when.
>> >
>> > Darin
>> >
>> > On Aug 16, 2016 8:49 AM, "Matthew J. Loppatto" <mloppa...@keywcorp.com>
>> > wrote:
>> >
>> > > Hi,
>> > >
>> > >
>> > >
>> > > I’m setting up Myriad 0.2.0 on my Mesos cluster following this guide:
>> > > https://urldefense.proofpoint.com/v2/url?u=https-3A__cwiki.a
>> pache.org_
>> > > confluence_display_MYRIAD_&d=CwIFaQ&c=31nHN1tvZeuWBT6LwDN4Ng
>> k1qezfsYHy
>> > > olgGeY2ZhlU&r=D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=
>> ibxhOZQSsK
>> > > tyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=LaQad9p3ZI3Rt5cTn3kHAb58
>> BuSD5etwIm
>> > > WZHzFz6Sk&e=
>> > > Installing+for+Developers
>> > >
>> > >
>> > >
>> > > And I get the following error in the resource manager executor log in
>> > > mesos after starting it with `/opt/hadoop-2.7.2/bin/yarn
>> > resourcemanager`:
>> > >
>> > >
>> > >
>> > > chown: cannot access ‘/sys/fs/cgroup/cpu/mesos/f5d6
>> c530-c13d-4b1d-bc30-
>> > f298affb6442’:
>> > > No such file or directory
>> > >
>> > > env: /bin/yarn: No such file or directory
>> > >
>> > > ory
>> > >
>> > >
>> > >
>> > > It appears the ‘mesos’ directory doesn’t exist under
>> /sys/fs/cgroup/cpu.
>> > > Any ideas what the issue could be?
>> > >
>> > >
>> > >
>> > > This is my yarn-site.xml:
>> > >
>> > >
>> > >
>> > > <configuration>
>> > >
>> > > <!-- Site-specific YARN configuration properties -->
>> > >
>> > >    <property>
>> > >
>> > >        <name>yarn.nodemanager.aux-services</name>
>> > >
>> > >        <value>mapreduce_shuffle,myriad_executor</value>
>> > >
>> > >        <!-- If using MapR distro, please use the following value:
>> > >
>> > >
>> > > <value>mapreduce_shuffle,mapr_direct_shuffle,myriad_executor</value>
>> > > -->
>> > >
>> > >    </property>
>> > >
>> > >    <property>
>> > >
>> > >
>> > > <name>yarn.nodemanager.aux-services.mapreduce_shuffle.class</name>
>> > >
>> > >        <value>org.apache.hadoop.mapred.ShuffleHandler</value>
>> > >
>> > >    </property>
>> > >
>> > >    <property>
>> > >
>> > >
>> > > <name>yarn.nodemanager.aux-services.myriad_executor.class</name>
>> > >
>> > >
>> > > <value>org.apache.myriad.executor.MyriadExecutorAuxService</value>
>> > >
>> > >    </property>
>> > >
>> > >    <property>
>> > >
>> > >        <name>yarn.nm.liveness-monitor.expiry-interval-ms</name>
>> > >
>> > >        <value>2000</value>
>> > >
>> > >    </property>
>> > >
>> > >    <property>
>> > >
>> > >        <name>yarn.am.liveness-monitor.expiry-interval-ms</name>
>> > >
>> > >        <value>10000</value>
>> > >
>> > >    </property>
>> > >
>> > >    <property>
>> > >
>> > >
>> > > <name>yarn.resourcemanager.nm.liveness-monitor.interval-ms</name>
>> > >
>> > >        <value>1000</value>
>> > >
>> > >    </property>
>> > >
>> > > <!-- Needed for Fine Grain Scaling -->
>> > >
>> > >    <property>
>> > >
>> > >        <name>yarn.scheduler.minimum-allocation-vcores</name>
>> > >
>> > >        <value>0</value>
>> > >
>> > >    </property>
>> > >
>> > >    <property>
>> > >
>> > >        <name>yarn.scheduler.minimum-allocation-mb</name>
>> > >
>> > >        <value>0</value>
>> > >
>> > >    </property>
>> > >
>> > > <!-- Site specific YARN configuration properties -->
>> > >
>> > > <property>
>> > >
>> > >    <name>yarn.nodemanager.resource.cpu-vcores</name>
>> > >
>> > >    <value>${nodemanager.resource.cpu-vcores}</value>
>> > >
>> > > </property>
>> > >
>> > > <property>
>> > >
>> > >    <name>yarn.nodemanager.resource.memory-mb</name>
>> > >
>> > >    <value>${nodemanager.resource.memory-mb}</value>
>> > >
>> > > </property>
>> > >
>> > > <!--These options enable dynamic port assignment by mesos -->
>> > >
>> > > <property>
>> > >
>> > >    <name>yarn.nodemanager.address</name>
>> > >
>> > >    <value>${myriad.yarn.nodemanager.address}</value>
>> > >
>> > > </property>
>> > >
>> > > <property>
>> > >
>> > >    <name>yarn.nodemanager.webapp.address</name>
>> > >
>> > >    <value>${myriad.yarn.nodemanager.webapp.address}</value>
>> > >
>> > > </property>
>> > >
>> > > <property>
>> > >
>> > >    <name>yarn.nodemanager.webapp.https.address</name>
>> > >
>> > >    <value>${myriad.yarn.nodemanager.webapp.address}</value>
>> > >
>> > > </property>
>> > >
>> > > <property>
>> > >
>> > >    <name>yarn.nodemanager.localizer.address</name>
>> > >
>> > >    <value>${myriad.yarn.nodemanager.localizer.address}</value>
>> > >
>> > > </property>
>> > >
>> > > <!-- Configure Myriad Scheduler here -->
>> > >
>> > > <property>
>> > >
>> > >    <name>yarn.resourcemanager.scheduler.class</name>
>> > >
>> > >    <value>org.apache.myriad.scheduler.yarn.MyriadFairScheduler<
>> /value>
>> > >
>> > >    <description>One can configure other scehdulers as well from
>> > > following
>> > > list: org.apache.myriad.scheduler.yarn.MyriadCapacityScheduler,
>> > > org.apache.myriad.scheduler.yarn.MyriadFifoScheduler</description>
>> > >
>> > > </property>
>> > >
>> > > <!-- Disable PMem/VMem checks for Hadoop 2.7.2 -->
>> > >
>> > > <property>
>> > >
>> > >    <name>yarn.nodemanager.pmem-check-enabled</name>
>> > >
>> > >    <value>false</value>
>> > >
>> > > </property>
>> > >
>> > > <property>
>> > >
>> > >    <name>yarn.nodemanager.vmem-check-enabled</name>
>> > >
>> > >    <value>false</value>
>> > >
>> > > </property>
>> > >
>> > > </configuration>
>> > >
>> > >
>> > >
>> > >
>> > >
>> > > My myriad-config-default.yml:
>> > >
>> > >
>> > >
>> > > mesosMaster: zk://myip:2181/mesos
>> > >
>> > > checkpoint: false
>> > >
>> > > frameworkFailoverTimeout: 43200000
>> > >
>> > > frameworkName: MyriadAlpha
>> > >
>> > > frameworkRole:
>> > >
>> > > frameworkUser: root     # User the Node Manager runs as, required if
>> > > nodeManagerURI set, otherwise defaults to the user
>> > >
>> > >                          # running the resource manager.
>> > >
>> > > frameworkSuperUser: root  # To be depricated, currently permissions
>> > > need set by a superuser due to Mesos-1790.  Must be
>> > >
>> > >                          # root or have passwordless sudo. Required if
>> > > nodeManagerURI set, ignored otherwise.
>> > >
>> > > nativeLibrary: /usr/local/lib/libmesos.so
>> > >
>> > > zkServers: myip:2181
>> > >
>> > > zkTimeout: 20000
>> > >
>> > > restApiPort: 8192
>> > >
>> > > servedConfigPath: dist/config.tgz
>> > >
>> > > servedBinaryPath: dist/binary.tgz
>> > >
>> > > profiles:
>> > >
>> > > zero:  # NMs launched with this profile dynamically obtain cpu/mem
>> > > from Mesos
>> > >
>> > >    cpu: 0
>> > >
>> > >    mem: 0
>> > >
>> > > small:
>> > >
>> > >    cpu: 2
>> > >
>> > >    mem: 2048
>> > >
>> > > medium:
>> > >
>> > >    cpu: 4
>> > >
>> > >    mem: 4096
>> > >
>> > > large:
>> > >
>> > >    cpu: 10
>> > >
>> > >    mem: 12288
>> > >
>> > > nmInstances: # NMs to start with. Requires at least 1 NM with a
>> > > non-zero profile.
>> > >
>> > > medium: 1 # <profile_name : instances>
>> > >
>> > > rebalancer: false
>> > >
>> > > haEnabled: false
>> > >
>> > > nodemanager:
>> > >
>> > > jvmMaxMemoryMB: 1024
>> > >
>> > > cpus: 0.2
>> > >
>> > > cgroups: false
>> > >
>> > > executor:
>> > >
>> > > jvmMaxMemoryMB: 256
>> > >
>> > > path:
>> > > file:///usr/local/libexec/mesos/myriad-executor-runnable-0.1.0.jar
>> > >
>> > > #The following should be used for a remotely distributed URI, hdfs
>> > > assumed but other URI types valid.
>> > >
>> > > #nodeManagerUri: hdfs://namenode:port/dist/hadoop-2.7.0.tar.gz
>> > >
>> > > #configUri:
>> > > https://urldefense.proofpoint.com/v2/url?u=http-3A__127.0.0.
>> 1_api_arif
>> > > acts_config.tgz&d=CwIFaQ&c=31nHN1tvZeuWBT6LwDN4Ngk1qezfsYHyo
>> lgGeY2ZhlU
>> > > &r=D2bc6ANY3sIFSxaGDnPt52V5PqDlabKIPtzNhAIfJCs&m=ibxhOZQSsKt
>> yVi5iruY8I
>> > > mkW7bQ8zOrHcuDTLL7GBwA&s=IpOqhUOtwJsdorbAOeoY7GgHalMJ1s9EUju
>> RUfRsmew&e
>> > > =
>> > >
>> > > #jvmUri:
>> > > https://urldefense.proofpoint.com/v2/url?u=https-3A__downloa
>> ds.mycompa
>> > > ny.com_java_jre-2D7u76-2Dlinux-2Dx64.tar.gz&d=CwIFaQ&c=
>> 31nHN1tvZeuWBT6
>> > > LwDN4Ngk1qezfsYHyolgGeY2ZhlU&r=D2bc6ANY3sIFSxaGDnPt52V5PqDla
>> bKIPtzNhAI
>> > > fJCs&m=ibxhOZQSsKtyVi5iruY8ImkW7bQ8zOrHcuDTLL7GBwA&s=jPB2677
>> RH3k3CLsgl
>> > > 4Zj3tGawuCLVB1a2WXBUOWEelU&e=
>> > >
>> > > yarnEnvironment:
>> > >
>> > > YARN_HOME: /opt/hadoop-2.7.2
>> > >
>> > >
>> > >
>> > >
>> > >
>> > > Thanks!
>> > >
>> > > Matt
>> > >
>> >
>>
>
>

Reply via email to