Thanks Paul! Unfortunately '/boot' is not available in my embedded linux,
and I do not have the configuration file for the old kernel since it is
provided as is. However, I have the new kernel configuration since I
compiled it myself. Would it be helpful if I provide you the .config file
when I compile the kernel? It maybe quite painful to look through that file
though. Is there any other way that I can obtain the configuration?

I checked my config for the new kernel, and UNIX-domain sockets and Sys V
IPC are both enabled in the build. Are there any other possibilities I can
check?

Thanks,
Di

--
Di Wu (Allan)
PhD student, VAST Laboratory <http://vast.cs.ucla.edu/>,
Department of Computer Science, UC Los Angeles
Email: al...@cs.ucla.edu

On Tue, Nov 25, 2014 at 10:45 AM, Paul Hargrove <phhargr...@lbl.gov> wrote:

> Allan,
>
> A likely possibility is that some important kernel feature (that Open MPI
> assumes is present) is missing.
> That includes not only "kernel modules" as you mention, but also features
> configure in (or out) of the base kernel.
> For instance, some embedded kernels omit UNIX-domain sockets and SysV IPC
> support.
>
> If you can send me (preferably off-list) the kernel config files for the
> old an new kernels I may be able to spot something.
> If present, you are looking for /boot/config-[VERSION]
>
> -Paul
>
> On Tue, Nov 25, 2014 at 10:25 AM, Allan Wu <al...@cs.ucla.edu> wrote:
>
>> I'm sorry I forgot to change the subject when I reply to the digest
>> issue. Please find my original email below.
>>
>> Regards,
>> Di
>>
>> On Tue, Nov 25, 2014 at 10:19 AM, Allan Wu <al...@cs.ucla.edu> wrote:
>>
>>> Thanks Ralph for the reply. Sorry about the log file, I think I forgot
>>> to put an extension to the file. Please find a new one attached with this
>>> email.
>>>
>>> I'm sorry for not enough debugging information, but 'omp_info' and
>>> '--debug-devel' are the only ways I know for collecting information, are
>>> there any other things I can try to provide more info?
>>>
>>> When I execute 'mpirun --debug-devel -np 1 ./helloworld', all the output
>>> is the logging information in my last email. It got stuck at
>>>  "[fpga1:00718] tmp: /tmp", and nothing from my helloworld program is
>>> printed out to the screen. So I think it is mpirun failing to start my
>>> executable, not failing to terminate.
>>>
>>> I was wondering if this has anything to do with my newer kernel version,
>>> since it works well in the old case.
>>>
>>> Thanks,
>>> --
>>> Di Wu (Allan)
>>> PhD student, VAST Laboratory <http://vast.cs.ucla.edu/>,
>>> Department of Computer Science, UC Los Angeles
>>> Email: al...@cs.ucla.edu
>>>
>>>
>>> Date: Tue, 25 Nov 2014 07:29:51 -0800
>>> From:
>>> Ralph Castain <r...@open-mpi.org>
>>> To: Open MPI Developers <de...@open-mpi.org>
>>> Subject: Re: [OMPI devel] OpenMPI v1.8 and v1.8.3 mpirun hangs at
>>>         execution       on an embedded ARM Linux kernel version 3.15.0
>>> Message-ID: <898cb117-f6a6-4569-89c3-49b75d65b...@open-mpi.org>
>>> Content-Type: text/plain; charset="utf-8"
>>>
>>> I don?t know what you put in that log file, but it was an executable and
>>> I?m not feeling that trusting :-)
>>>
>>> I?m afraid there isn?t enough debug output there to really tell
>>> anything. From what little I can see, I?m guessing that the application ran
>>> fine and you got the usual ?hello? output and the helloworld process exited
>>> safely - is that correct? And so it is solely mpirun that is failing to
>>> cleanly terminate?
>>>
>>>
>>> > On Nov 24, 2014, at 11:24 PM, Allan Wu <al...@cs.ucla.edu> wrote:
>>> >
>>> > Hello everyone,
>>> >
>>> > I have cross-compiled OpenMPI for an embedded ARM Linux. Everything
>>> works fine for my system based on Linux 3.8.0. I have previously submitted
>>> a post related to my compilation, which can be found here:
>>> http://www.open-mpi.org/community/lists/devel/2014/04/14440.php <
>>> http://www.open-mpi.org/community/lists/devel/2014/04/14440.php>. When
>>> I recently upgraded my Linux kernel to 3.15.0, mpirun begins to stuck at
>>> even the helloworld program. The program consists only simple APIs:
>>> MPI_Init, MPI_Comm_size, MPI_Comm_rank, MPI_Finalize. The problem occurs
>>> even at 'mpirun -np 1 ./helloworld', and below are the output with
>>> --debug-devel (before it got stuck):
>>> > [fpga1:00716] sess_dir_finalize: job session dir not empty - leaving
>>> > [fpga1:00716] procdir: /tmp/openmpi-sessions-root@fpga1_0/63813/0/0
>>> > [fpga1:00716] jobdir: /tmp/openmpi-sessions-root@fpga1_0/63813/0
>>> > [fpga1:00716] top: openmpi-sessions-root@fpga1_0
>>> > [fpga1:00716] tmp: /tmp
>>> > [fpga1:00718] procdir: /tmp/openmpi-sessions-root@fpga1_0/63813/1/0
>>> > [fpga1:00718] jobdir: /tmp/openmpi-sessions-root@fpga1_0/63813/1
>>> > [fpga1:00718] top: openmpi-sessions-root@fpga1_0
>>> >
>>> [fpga1:00718] tmp: /tmp
>>> >
>>> > I suspect maybe it is due to incompatible kernel version or some
>>> missing kernel modules. I tried also with the latest version 1.8.3, and had
>>> the same problem. Does anyone have any thoughts? I have attached the output
>>> of 'ompi-info --all' with this email.
>>> >
>>> > Please let me know if I need to provide more information. Thanks in
>>> advance!
>>> >
>>> > Regards,
>>> > --
>>> > Di Wu (Allan)
>>> > PhD student, VAST?Laboratory <http://vast.cs.ucla.edu/>,
>>> > Department of Computer Science, UC Los Angeles
>>> > Email: al...@cs.ucla.edu <mailto:al...@cs.ucla.edu>
>>> > <log.tar.gz>_______________________________________________
>>> > devel mailing list
>>> > de...@open-mpi.org
>>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> > Link to this post:
>>> http://www.open-mpi.org/community/lists/devel/2014/11/16330.php
>>>
>>>
>>
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2014/11/16341.php
>>
>
>
>
> --
> Paul H. Hargrove                          phhargr...@lbl.gov
> Computer Languages & Systems Software (CLaSS) Group
> Computer Science Department               Tel: +1-510-495-2352
> Lawrence Berkeley National Laboratory     Fax: +1-510-486-6900
>

Reply via email to