Re: oops in rbd module (con_work in libceph)

2012-07-12 Thread Yann Dupont
Le 11/07/2012 22:23, Yann Dupont a écrit : Le 10/07/2012 19:46, Gregory Farnum a écrit : Each time, at the exact date, a bad CRC (they are the only ones for this day, so it seems related) Yes; a bad CRC should cause the socket to close — that's intended behavior (although you might want to

Re: oops in rbd module (con_work in libceph)

2012-07-12 Thread Yann Dupont
Le 12/07/2012 09:15, Yann Dupont a écrit : Le 11/07/2012 22:23, Yann Dupont a écrit : Le 10/07/2012 19:46, Gregory Farnum a écrit : Each time, at the exact date, a bad CRC (they are the only ones for this day, so it seems related) Yes; a bad CRC should cause the socket to close — that's

Re: oops in rbd module (con_work in libceph)

2012-07-12 Thread Yann Dupont
Le 12/07/2012 22:20, Yann Dupont a écrit : Tomorrow I'll try to see if I made errors on my git bisect. I'll also try to format the rbd with xfs to see if it's really ext4 related. Well. I can confirm it's a race between ext4 rbd (or maybe just a plain ext4 bug, but could it explain the

Re: oops in rbd module (con_work in libceph)

2012-07-11 Thread Yann Dupont
Le 10/07/2012 19:46, Gregory Farnum a écrit : Each time, at the exact date, a bad CRC (they are the only ones for this day, so it seems related) Yes; a bad CRC should cause the socket to close — that's intended behavior (although you might want to look into why that's happening, ah ! very

Re: oops in rbd module (con_work in libceph)

2012-07-10 Thread Gregory Farnum
On Mon, Jul 9, 2012 at 10:04 AM, Yann Dupont yann.dup...@univ-nantes.fr wrote: Le 09/07/2012 18:54, Yann Dupont a écrit : Ok. I've compiled the kernel this afternoon, and tested it without much success : Jul 9 18:17:23 label5.u14.univ-nantes.prive kernel: [ 284.116236] libceph: osd0

Re: oops in rbd module (con_work in libceph)

2012-07-09 Thread Yann Dupont
Le 07/07/2012 02:16, Alex Elder a écrit : [...] There are a number of bugs that have been fixed since Linux 3.4, and the fixes have not made it into the 3.4.y stable releases. I just sent an announcement about the Ceph stable branch that's available in the Ceph git repository. If possible I

Re: oops in rbd module (con_work in libceph)

2012-07-09 Thread Yann Dupont
Le 09/07/2012 18:54, Yann Dupont a écrit : Ok. I've compiled the kernel this afternoon, and tested it without much success : Jul 9 18:17:23 label5.u14.univ-nantes.prive kernel: [ 284.116236] libceph: osd0 172.20.14.130:6801 socket closed Jul 9 18:17:43 label5.u14.univ-nantes.prive kernel:

oops in rbd module (con_work in libceph)

2012-07-06 Thread Yann Dupont
Hello. Bug happens in rbd client, at least in Kernel 3.4.4 . I have a completely reproductible bug. here is the oops : Jul 6 10:16:52 label5.u14.univ-nantes.prive kernel: [ 329.456285] EXT4-fs (rbd1): mounted filesystem with ordered data mode. Opts: (null) Jul 6 10:18:38

Re: oops in rbd module (con_work in libceph)

2012-07-06 Thread Yann Dupont
Le 06/07/2012 10:31, Yann Dupont a écrit : Hello. Bug happens in rbd client, at least in Kernel 3.4.4 . I have a completely reproductible bug. just a note : 3.2.22 doesn't seems to exhibit the problem. I repeated the process 2 times without problems on this kernel. I'll launch realistic

Re: oops in rbd module (con_work in libceph)

2012-07-06 Thread Alex Elder
On 07/06/2012 10:35 AM, Yann Dupont wrote: Le 06/07/2012 10:31, Yann Dupont a écrit : Hello. Bug happens in rbd client, at least in Kernel 3.4.4 . I have a completely reproductible bug. just a note : 3.2.22 doesn't seems to exhibit the problem. I repeated the process 2 times without