Re: [Ocfs2-users] ocfs2 node reboot method

2008-08-05 Thread Sunil Mushran
I believe 1.2.5-SLES-r2997 is the version of the fs and not the
tools. Meaning, an upgrade is required to the latest kernel
that is shipping 1.2.9.

As far as failure to mount goes, one reason could be that the
default timeout (10 secs) could be low. See if increasing to the
new default of 30 secs helps.

Tao Ma wrote:
> Masanari Iida wrote:
>   
>>  On Tue, Aug 5, 2008 at 5:43 PM, Tao Ma <[EMAIL PROTECTED]> wrote:
>>
>>   
>> 
 An error message I saw was
 mount.ocfs2: Transport endpoint is not connected while mounting
 /dev/EXTDISK/OCFS2
   
 
>>> Interesting. Have you update ocfs2 in some nodes? Normally it happens when
>>> there are some protocol collision among nodes.
>>> Any helpful information in "dmesg"?
>>>
>>> 
>>>   
>>  The boxes are all SLES10 +  ocfs2-tools-1.2.5-SLES-r2997
>>  I found no other messages at the time of failure.
>>   
>> 
> which version of ocfs2?
>   
>>  I would like to catch some useful information for troubleshooting.
>>  Do you think  tcpdump may catch something?
>>  Or  Do I need to use debug.ocfs2?
>>  If latter case, what is the right option?
>>   
>> 
> debugfs.ocfs2 -l CONN DLM_DOMAIN TCP allow
> mount and check "dmesg".
>
> Regards,
> Tao
>
>
> ___
> Ocfs2-users mailing list
> Ocfs2-users@oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-users
>   


___
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users


Re: [Ocfs2-users] Enable mlog() messages

2008-08-05 Thread Tao Ma
Masanari Iida wrote:
> Hello again,
>
> I looked into the source and found the error message
> "Transport endpoint is not connected" could be came
> from ENOTCONN in tcp.c.
> There are multiple ENOTCONN, so I would like to know
> which one produce my message.  I want to enable mlog().
>
> My understanding is, debug.ocfs2 enable the mlog().
> http://oss.oracle.com/osswiki/OCFS2/Debugging
>
> I think my problem happen while the system boot up
> and mount the volume from /etc/fstab.
> In that case, how can I enable debugging?
>   
As I have said in another mail, you can use
debugfs.ocfs2 -l CONN DLM_DOMAIN SUPER allow
mount the volume.
dmesg should give us enough information about what happens.

Regards,
Tao

___
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users


Re: [Ocfs2-users] ocfs2 node reboot method

2008-08-05 Thread Tao Ma
Masanari Iida wrote:
>  On Tue, Aug 5, 2008 at 5:43 PM, Tao Ma <[EMAIL PROTECTED]> wrote:
>
>   
>>> An error message I saw was
>>> mount.ocfs2: Transport endpoint is not connected while mounting
>>> /dev/EXTDISK/OCFS2
>>>   
>> Interesting. Have you update ocfs2 in some nodes? Normally it happens when
>> there are some protocol collision among nodes.
>> Any helpful information in "dmesg"?
>>
>> 
>  The boxes are all SLES10 +  ocfs2-tools-1.2.5-SLES-r2997
>  I found no other messages at the time of failure.
>   
which version of ocfs2?
>  I would like to catch some useful information for troubleshooting.
>  Do you think  tcpdump may catch something?
>  Or  Do I need to use debug.ocfs2?
>  If latter case, what is the right option?
>   
debugfs.ocfs2 -l CONN DLM_DOMAIN TCP allow
mount and check "dmesg".

Regards,
Tao


___
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users


[Ocfs2-users] Enable mlog() messages

2008-08-05 Thread Masanari Iida
Hello again,

I looked into the source and found the error message
"Transport endpoint is not connected" could be came
from ENOTCONN in tcp.c.
There are multiple ENOTCONN, so I would like to know
which one produce my message.  I want to enable mlog().

My understanding is, debug.ocfs2 enable the mlog().
http://oss.oracle.com/osswiki/OCFS2/Debugging

I think my problem happen while the system boot up
and mount the volume from /etc/fstab.
In that case, how can I enable debugging?

Regards,

Masanari Iida

___
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users


Re: [Ocfs2-users] ocfs2 node reboot method

2008-08-05 Thread Masanari Iida
 On Tue, Aug 5, 2008 at 5:43 PM, Tao Ma <[EMAIL PROTECTED]> wrote:

>>
>> An error message I saw was
>> mount.ocfs2: Transport endpoint is not connected while mounting
>> /dev/EXTDISK/OCFS2
>
> Interesting. Have you update ocfs2 in some nodes? Normally it happens when
> there are some protocol collision among nodes.
> Any helpful information in "dmesg"?
>
 The boxes are all SLES10 +  ocfs2-tools-1.2.5-SLES-r2997
 I found no other messages at the time of failure.

 I would like to catch some useful information for troubleshooting.
 Do you think  tcpdump may catch something?
 Or  Do I need to use debug.ocfs2?
 If latter case, what is the right option?

 Thank you

 Masanari Iida

___
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users


Re: [Ocfs2-users] ocfs2 node reboot method

2008-08-05 Thread Tao Ma
Hi,

Masanari Iida wrote:
> Hello list,
> 
> There is a 14 node OCFS2 cluster.
> When I reboot all 14 nodes at once,  some node failed to
> mount the ocfs2 filesystem while rebooting.
> The mount is supposed to be done by /etc/fstab.
> The symptom is happened on randam node.
> I would like to know if there is such a rule that OCFS2 nodes
> need to be reboot one-by-one, instead of all-at-once.
No rule for rebooting ocfs2.
> 
> An error message I saw was
> mount.ocfs2: Transport endpoint is not connected while mounting
> /dev/EXTDISK/OCFS2
Interesting. Have you update ocfs2 in some nodes? Normally it happens 
when there are some protocol collision among nodes.
Any helpful information in "dmesg"?

also please provide the version info of ocfs2.

Regards,
Tao

___
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users


[Ocfs2-users] ocfs2 node reboot method

2008-08-05 Thread Masanari Iida
Hello list,

There is a 14 node OCFS2 cluster.
When I reboot all 14 nodes at once,  some node failed to
mount the ocfs2 filesystem while rebooting.
The mount is supposed to be done by /etc/fstab.
The symptom is happened on randam node.
I would like to know if there is such a rule that OCFS2 nodes
need to be reboot one-by-one, instead of all-at-once.

An error message I saw was
mount.ocfs2: Transport endpoint is not connected while mounting
/dev/EXTDISK/OCFS2

iptables is not set on each nodes.

Regards,

Masanari Iida

___
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users