On Tue, 2017-09-05 at 05:41 -0400, William Allen Simpson wrote:
> On 9/4/17 6:59 AM, Swen Schillig wrote:
> > On Sat, 2017-09-02 at 00:15 -0400, William Allen Simpson wrote:
> > > On 9/1/17 6:09 PM, Frank Filz wrote:
> > > > Lately, we have been plagued by a lot of intermittent test
> > > > failures.
> > > >
> > > > I have seen intermittent failures in pynfs WRT14, WRT15, and
> > > > WRT16.
> > > > These
> > > > have not been resolved by the latest ntirpc pullup.
> > > >
> > >
> > > Details? What's WRT16?
> > >
> > > My pynfs results say:
> > >
> > > WRT13 st_write.testDoubleWrite
> > > :
> > > PASS
> > > WRT14 st_write.testLargeWrite
> > > :
> > > PASS
> > > WRT15 st_write.testSizes
> > > :
> > > PASS
> > > WRT18 st_write.testChangeGranularityWrite
> > > :
> > > FAILURE
> > > consecutive SETATTR(mode)'s don't all change change
> > > attribute
> > > WRT19 st_write.testStolenStateid
> > > :
> > > PASS
> > > **************************************************
> > > Command line asked for 584 of 668 tests
> > > Of those: 10 Skipped, 2 Failed, 2 Warned, 570 Passed
> >
> > My latest (dev.7) say
> >
> > WRT5 st_write.testLargeData :
> > FAILURE
> > RPCAcceptError: RPCError: MSG_ACCEPTED: GARBAGE_ARGS
> >
> > 100% reproducible all others succeed.
> >
>
> Of course, my WRT5 passes.
>
> But this is wonderful. Please tell us how you get this 100%
> reproducible
> result, so that we can reproduce it....
I'm afraid I'm not doing anything special.
Anyway, here are the details of the box running ganesha
- 3.10.0-327.el7.ppc64le #1 SMP Thu Oct 29 17:31:13 EDT 2015 ppc64le
ppc64le ppc64le GNU/Linux
- ganesha-2.6-dev.7
ganesha.conf
EXPORT
{
Export_Id = 12;
Path = /gpfs0/V4;
Pseudo = /V4;
FSAL {
Name = GPFS;
}
CLIENT {
Access_Type = RW;
Squash = No_root_squash;
Clients=9.244.58.107,9.155.107.180,9.155.107.185,9.244.58.169;
}
}
Client 1
Linux fscc-p8-16-o 3.10.0-327.el7.ppc64 #1 SMP Thu Oct 29 17:33:57 EDT 2015
ppc64 ppc64 ppc64 GNU/Linux
executing
./nfs4.0/testserver.py -v --outfile ~/pynfs40.out --maketree 9.155.107.180:/V4
--showomit --secure --rundep WRT5
gives
INIT st_setclientid.testValid : RUNNING
INIT st_setclientid.testValid : PASS
MKFILE st_open.testOpen : RUNNING
MKFILE st_open.testOpen : PASS
WRT5 st_write.testLargeData : RUNNING
Got error: [Errno 104] Connection reset by peer
Traceback (most recent call last):
File "/root/pynfs-HEAD-6618874/nfs4.0/lib/testmod.py", line 222, in run
self.runtest(self, environment)
File "/root/pynfs-HEAD-6618874/nfs4.0/servertests/st_write.py", line 137, in
testLargeData
res = c.write_file(fh, data[pos:], pos, stateid)
File "/root/pynfs-HEAD-6618874/nfs4.0/nfs4lib.py", line 768, in write_file
res = self.compound(ops)
File "/root/pynfs-HEAD-6618874/nfs4.0/nfs4lib.py", line 333, in compound
res = self.call(NFSPROC4_COMPOUND, p.get_buffer())
File "/root/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 410, in call
xid = self.send(procedure, data, program, version)
File "/root/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 337, in send
self.reconnect().send_record(header + data)
File "/root/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 176, in
_send_record
self.sendall(mark + chunk)
File "/usr/lib64/python2.7/socket.py", line 224, in meth
return getattr(self._sock,name)(*args)
error: [Errno 104] Connection reset by peer
WRT5 st_write.testLargeData : FAILURE
error: [Errno 104] Connection reset by peer
**************************************************
INIT st_setclientid.testValid : PASS
MKFILE st_open.testOpen : PASS
WRT5 st_write.testLargeData : FAILURE
error: [Errno 104] Connection reset by peer
**************************************************
Command line asked for 3 of 668 tests
Of those: 0 Skipped, 1 Failed, 0 Warned, 2 Passed
Client 2
4.12.9-200.fc25.x86_64 #1 SMP Fri Aug 25 13:23:30 UTC 2017 x86_64 x86_64 x86_64
GNU/Linux
running
./nfs4.0/testserver.py -v --outfile ~/pynfs40.out --maketree 9.155.107.180:/V4
--showomit --secure --rundep WRT5
gives
INIT st_setclientid.testValid : RUNNING
INIT st_setclientid.testValid : PASS
MKFILE st_open.testOpen : RUNNING
MKFILE st_open.testOpen : PASS
WRT5 st_write.testLargeData : RUNNING
Got error: [Errno 104] Connection reset by peer
Traceback (most recent call last):
File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/lib/testmod.py", line 222,
in run
self.runtest(self, environment)
File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/servertests/st_write.py",
line 137, in testLargeData
res = c.write_file(fh, data[pos:], pos, stateid)
File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/nfs4lib.py", line 768, in
write_file
res = self.compound(ops)
File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/nfs4lib.py", line 333, in
compound
res = self.call(NFSPROC4_COMPOUND, p.get_buffer())
File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 410,
in call
xid = self.send(procedure, data, program, version)
File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 337,
in send
self.reconnect().send_record(header + data)
File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 176,
in _send_record
self.sendall(mark + chunk)
File "/usr/lib64/python2.7/socket.py", line 228, in meth
return getattr(self._sock,name)(*args)
timeout: timed out
WRT5 st_write.testLargeData : FAILURE
timeout: timed out
**************************************************
INIT st_setclientid.testValid : PASS
MKFILE st_open.testOpen : PASS
WRT5 st_write.testLargeData : FAILURE
timeout: timed out
**************************************************
Command line asked for 3 of 668 tests
Of those: 0 Skipped, 1 Failed, 0 Warned, 2 Passed
However, running the exact same test from the server itself succeeds.
./nfs4.0/testserver.py -v --outfile ~/pynfs40.out --maketree 9.155.107.180:/V4
--showomit --secure --rundep WRT5
INIT st_setclientid.testValid : RUNNING
INIT st_setclientid.testValid : PASS
MKFILE st_open.testOpen : RUNNING
MKFILE st_open.testOpen : PASS
WRT5 st_write.testLargeData : RUNNING
WRT5 st_write.testLargeData : PASS
**************************************************
INIT st_setclientid.testValid : PASS
MKFILE st_open.testOpen : PASS
WRT5 st_write.testLargeData : PASS
**************************************************
Command line asked for 3 of 668 tests
Of those: 0 Skipped, 0 Failed, 0 Warned, 3 Passed
>
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Nfs-ganesha-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nfs-ganesha-devel