On Thu, Feb 05, 2015 at 06:26:17PM +0530, Sivaram Kannan wrote: > > Hi, > > I am our deployments of mesos-slave, we are getting the following error during > start up. I understand the slave is failing due to large number of fd's being > opened. I have increased the ulimit of fd's to 4096 from 1024 but still the > same behavior. What can I do to solve this problem, and what should I do to > prevent it. > > Thanks, > ./Siva. > > > Collect failed: Failed to create pipe: Too many open files > Feb 05 12:33:58 node-d4856455ad5c sh[32162]: To remedy this do as follows: > Feb 05 12:33:58 node-d4856455ad5c sh[32162]: Step 1: rm -f > /var/lib/mesos/slave > /meta/slaves/latest > Feb 05 12:33:58 node-d4856455ad5c sh[32162]: This ensures slave doesn't > recover > old live executors. > Feb 05 12:33:58 node-d4856455ad5c sh[32162]: Step 2: Restart the slave.
Try the above steps? BTW, 4096 is not large enough in fact, we have meet too many open files when running storm topology on mesos though we have set it to 8192, so currently, we give mesos-slave a 65535 open files limit. -- Thanks, Chengwei > Feb 05 12:33:58 node-d4856455ad5c systemd[1]: mesos-slave.service: main > process > exited, code=exited, status=1/FAILURE > Feb 05 12:33:58 node-d4856455ad5c docker[3351]: mesos_slave > Feb 05 12:33:58 node-d4856455ad5c systemd[1]: Unit mesos-slave.service entered > failed state. > >
signature.asc
Description: Digital signature