Re: vnd and softraid panic

2013-01-30 Thread Joel Sing
On Wed, 30 Jan 2013, Eivind Evensen wrote:
 On Wed, Jan 23, 2013 at 02:33:16AM +1100, Joel Sing wrote:
  On Thu, 3 Jan 2013, Eivind Evensen wrote:
   On Mon, Dec 31, 2012 at 07:21:08PM +1100, Joel Sing wrote:
On Mon, 31 Dec 2012, Eivind Evensen wrote:
 Hello.

 Trying to play around a bit with softraid using vnd reliably
 results in a panic when assembling the raid volume. I think the
 first time I tried this was around 4.9 so it's not something new.
  
   ...
 
  FWIW this should now be rectified in -current.

 I tried a snapshot downloaded on 21.th
 (OpenBSD 5.2-current (GENERIC) #17: Fri Jan 18 19:42:57 MST 2013)
 which produced the same results. I waited a few days in case the snapshot
 was too old and built from sources from yesterday, still giving
 the same results. I don't need such a setup so it's not a big deal, but
 here's output and dmesg in case it may be helpful:

Thanks for the report. For some reason I thought you had tried using softraid 
crypto on a vnd and overlooked the fact that you were trying to use RAID 5 
(and now RAID 1). To clarify, RAID 0, CONCAT and CRYPTO should all now work 
on top of a vnd - RAID 1, RAID 4 and RAID 5 will still break in the manner 
given below (panic related to active workunits). These disciplines require 
the same changes that I've made to the other three, however they're a little 
more complex and time consuming. I'll try to get to them fixed soon.

 root:skoeske dd if=/dev/zero of=disk1 bs=1m count=100
 100+0 records in
 100+0 records out
 104857600 bytes transferred in 2.875 secs (36462177 bytes/sec)
 root:skoeske r disk1=disk2
 dd if=/dev/zero of=disk2 bs=1m count=100
 100+0 records in
 100+0 records out
 104857600 bytes transferred in 3.219 secs (32567739 bytes/sec)
 root:skoeske r disk2=disk3
 dd if=/dev/zero of=disk3 bs=1m count=100
 100+0 records in
 100+0 records out
 104857600 bytes transferred in 3.569 secs (29378164 bytes/sec)
 root:skoeske vnconfig vnd0 disk1
 root:skoeske vnconfig vnd1 disk2
 root:skoeske vnconfig vnd2 disk3
 root:skoeske fdisk -iy vnd0
 Warning CHS values out of bounds only saving LBA values
 Writing MBR at offset 0.
 root:skoeske fdisk -iy vnd1
 Warning CHS values out of bounds only saving LBA values
 Writing MBR at offset 0.
 root:skoeske fdisk -iy vnd2
 Warning CHS values out of bounds only saving LBA values
 Writing MBR at offset 0.
 root:skoeske printf a\n\n\n\nRAID\nw\nq\n\n | disklabel -E vnd0
 Label editor (enter '?' for help at any prompt)

  partition: [a] offset: [128] size: [204672] FS type: [4.2BSD]   No
  label changes.

 root:skoeske printf a\n\n\n\nRAID\nw\nq\n\n | disklabel -E vnd1
 Label editor (enter '?' for help at any prompt)

  partition: [a] offset: [128] size: [204672] FS type: [4.2BSD]   No
  label changes.

 root:skoeske printf a\n\n\n\nRAID\nw\nq\n\n | disklabel -E vnd2
 Label editor (enter '?' for help at any prompt)

  partition: [a] offset: [128] size: [204672] FS type: [4.2BSD]   No
  label changes.

 root:skoeske bioctl -c 1 -l /dev/vnd0a,/dev/vnd1a,/dev/vnd2a softraid0
 sd0 at scsibus1 targ 1 lun 0: OPENBSD, SR RAID 1, 005 SCSI2 0/direct
 fixed sd0: 99MB, 512 bytes/sector, 204144 sectors
 softraid0: SR RAID 1 volume attached as sd0
 panic: softraid0: sr_wu_init got active wu
 Stopped at  Debugger+0x4:   popl%ebp
 RUN AT LEAST 'trace' AND 'ps' AND INCLUDE OUTPUT WHEN REPORTING THIS PANIC!
 DO NOT EVEN BOTHER REPORTING THIS WITHOUT INCLUDING THAT INFORMATION!
 ddb trace
 Debugger(d08fcddc,f2087de8,d08d0da4,f2087de8,d0a7e540) at Debugger+0x4
 panic(d08d0da4,d0f8c014,f2087dfc,d036ccf3,d0fedab8) at panic+0x5d
 sr_wu_init(d1055000,d0fedab8,f2087e3c,d1055a30,d1057f00) at sr_wu_init+0x73
 sr_wu_put(d1055000,d0fedab8,f2087e3c,f2087e3c,d02030dd) at sr_wu_put+0x2f
 scsi_io_put(d1055a30,d0fedab8,8000,d0fedab8,d0fedab8) at scsi_io_put+0x19
 scsi_xs_put(f2027000,f2027000,f2087e8c,d041de18,d1055000) at
 scsi_xs_put+0x37 sr_raid1_intr(d1065000,f1fabdc4,f17dc000,200,52000) at
 sr_raid1_intr+0x107 vndstrategy(d1065000,0,0,50,d1065000) at
 vndstrategy+0x70
 spec_strategy(f2087f48,0,f2087f6c,d03f2c28,d1053d90) at spec_strategy+0x3d
 VOP_STRATEGY(d1065000,0,0,0,d0fedaf8) at VOP_STRATEGY+0x2c
 sr_startwu_callback(d1055000,d0fedab8,d02008bf,d1053d80,d03f2c50) at
 sr_startwu _callback+0x39
 workq_thread(d1053d80) at workq_thread+0x36
 Bad frame pointer: 0xd0bc8ed8
 ddb ps
PID   PPID   PGRPUID  S   FLAGS  WAIT  COMMAND
 *23369  0  0  0  70x100200srdis
  19292   1571   6315   1000  30x80  ttyin less
   1571   6315   6315   1000  30x88  pause sh
   6315  30722   6315   1000  30x80  wait  man
  30722  11048  30722   1000  30x88  pause ksh
  11048   2130   2130   1000  30x80  selectsshd
   2130   1703   2130  0  30x80  poll  sshd
   2513  1   2513  0  20x80ksh
  20983  1  20983  0  30x80  ttyin 

Re: vnd and softraid panic

2013-01-30 Thread Eivind Evensen
On Thu, Jan 31, 2013 at 12:20:14AM +1100, Joel Sing wrote:
 On Wed, 30 Jan 2013, Eivind Evensen wrote:
  On Wed, Jan 23, 2013 at 02:33:16AM +1100, Joel Sing wrote:
   On Thu, 3 Jan 2013, Eivind Evensen wrote:
On Mon, Dec 31, 2012 at 07:21:08PM +1100, Joel Sing wrote:
 On Mon, 31 Dec 2012, Eivind Evensen wrote:
  Hello.
 
  Trying to play around a bit with softraid using vnd reliably
  results in a panic when assembling the raid volume. I think the
  first time I tried this was around 4.9 so it's not something new.
   
...
  
   FWIW this should now be rectified in -current.
 
  I tried a snapshot downloaded on 21.th
  (OpenBSD 5.2-current (GENERIC) #17: Fri Jan 18 19:42:57 MST 2013)
  which produced the same results. I waited a few days in case the snapshot
  was too old and built from sources from yesterday, still giving
  the same results. I don't need such a setup so it's not a big deal, but
  here's output and dmesg in case it may be helpful:
 
 Thanks for the report. For some reason I thought you had tried using softraid 
 crypto on a vnd and overlooked the fact that you were trying to use RAID 5 
 (and now RAID 1). To clarify, RAID 0, CONCAT and CRYPTO should all now work 
 on top of a vnd - RAID 1, RAID 4 and RAID 5 will still break in the manner 
 given below (panic related to active workunits). These disciplines require 
 the same changes that I've made to the other three, however they're a little 
 more complex and time consuming. I'll try to get to them fixed soon.

I see, thanks for the explanation.

Eivind.



Re: vnd and softraid panic

2013-01-29 Thread Eivind Evensen
On Wed, Jan 23, 2013 at 02:33:16AM +1100, Joel Sing wrote:
 On Thu, 3 Jan 2013, Eivind Evensen wrote:
  On Mon, Dec 31, 2012 at 07:21:08PM +1100, Joel Sing wrote:
   On Mon, 31 Dec 2012, Eivind Evensen wrote:
Hello.
   
Trying to play around a bit with softraid using vnd reliably results
in a panic when assembling the raid volume. I think the first time I
tried this was around 4.9 so it's not something new.
 
  ...
 
 
 FWIW this should now be rectified in -current.

I tried a snapshot downloaded on 21.th
(OpenBSD 5.2-current (GENERIC) #17: Fri Jan 18 19:42:57 MST 2013)
which produced the same results. I waited a few days in case the snapshot
was too old and built from sources from yesterday, still giving
the same results. I don't need such a setup so it's not a big deal, but
here's output and dmesg in case it may be helpful:


root:skoeske dd if=/dev/zero of=disk1 bs=1m count=100
100+0 records in
100+0 records out
104857600 bytes transferred in 2.875 secs (36462177 bytes/sec)
root:skoeske r disk1=disk2
dd if=/dev/zero of=disk2 bs=1m count=100
100+0 records in
100+0 records out
104857600 bytes transferred in 3.219 secs (32567739 bytes/sec)
root:skoeske r disk2=disk3
dd if=/dev/zero of=disk3 bs=1m count=100
100+0 records in
100+0 records out
104857600 bytes transferred in 3.569 secs (29378164 bytes/sec)
root:skoeske vnconfig vnd0 disk1
root:skoeske vnconfig vnd1 disk2
root:skoeske vnconfig vnd2 disk3
root:skoeske fdisk -iy vnd0
Warning CHS values out of bounds only saving LBA values
Writing MBR at offset 0.
root:skoeske fdisk -iy vnd1 
Warning CHS values out of bounds only saving LBA values
Writing MBR at offset 0.
root:skoeske fdisk -iy vnd2 
Warning CHS values out of bounds only saving LBA values
Writing MBR at offset 0.
root:skoeske printf a\n\n\n\nRAID\nw\nq\n\n | disklabel -E vnd0
Label editor (enter '?' for help at any prompt)
 partition: [a] offset: [128] size: [204672] FS type: [4.2BSD]   No label 
 changes.
root:skoeske printf a\n\n\n\nRAID\nw\nq\n\n | disklabel -E vnd1 
Label editor (enter '?' for help at any prompt)
 partition: [a] offset: [128] size: [204672] FS type: [4.2BSD]   No label 
 changes.
root:skoeske printf a\n\n\n\nRAID\nw\nq\n\n | disklabel -E vnd2 
Label editor (enter '?' for help at any prompt)
 partition: [a] offset: [128] size: [204672] FS type: [4.2BSD]   No label 
 changes.
root:skoeske bioctl -c 1 -l /dev/vnd0a,/dev/vnd1a,/dev/vnd2a softraid0
sd0 at scsibus1 targ 1 lun 0: OPENBSD, SR RAID 1, 005 SCSI2 0/direct fixed
sd0: 99MB, 512 bytes/sector, 204144 sectors
softraid0: SR RAID 1 volume attached as sd0
panic: softraid0: sr_wu_init got active wu
Stopped at  Debugger+0x4:   popl%ebp
RUN AT LEAST 'trace' AND 'ps' AND INCLUDE OUTPUT WHEN REPORTING THIS PANIC!
DO NOT EVEN BOTHER REPORTING THIS WITHOUT INCLUDING THAT INFORMATION!
ddb trace
Debugger(d08fcddc,f2087de8,d08d0da4,f2087de8,d0a7e540) at Debugger+0x4
panic(d08d0da4,d0f8c014,f2087dfc,d036ccf3,d0fedab8) at panic+0x5d
sr_wu_init(d1055000,d0fedab8,f2087e3c,d1055a30,d1057f00) at sr_wu_init+0x73
sr_wu_put(d1055000,d0fedab8,f2087e3c,f2087e3c,d02030dd) at sr_wu_put+0x2f
scsi_io_put(d1055a30,d0fedab8,8000,d0fedab8,d0fedab8) at scsi_io_put+0x19
scsi_xs_put(f2027000,f2027000,f2087e8c,d041de18,d1055000) at scsi_xs_put+0x37
sr_raid1_intr(d1065000,f1fabdc4,f17dc000,200,52000) at sr_raid1_intr+0x107
vndstrategy(d1065000,0,0,50,d1065000) at vndstrategy+0x70
spec_strategy(f2087f48,0,f2087f6c,d03f2c28,d1053d90) at spec_strategy+0x3d
VOP_STRATEGY(d1065000,0,0,0,d0fedaf8) at VOP_STRATEGY+0x2c
sr_startwu_callback(d1055000,d0fedab8,d02008bf,d1053d80,d03f2c50) at sr_startwu
_callback+0x39
workq_thread(d1053d80) at workq_thread+0x36
Bad frame pointer: 0xd0bc8ed8
ddb ps
   PID   PPID   PGRPUID  S   FLAGS  WAIT  COMMAND
*23369  0  0  0  70x100200srdis
 19292   1571   6315   1000  30x80  ttyin less
  1571   6315   6315   1000  30x88  pause sh
  6315  30722   6315   1000  30x80  wait  man
 30722  11048  30722   1000  30x88  pause ksh
 11048   2130   2130   1000  30x80  selectsshd
  2130   1703   2130  0  30x80  poll  sshd
  2513  1   2513  0  20x80ksh
 20983  1  20983  0  30x80  ttyin getty
 22576  1  22576  0  30x80  ttyin getty
 19748  1  19748  0  30x80  ttyin getty
 21880  1  21880  0  30x80  ttyin getty
  5243  1   5243  0  30x80  ttyin getty
 32690  1  32690  0  30x80  selectcron
 24485  1  24485 99  30x80  poll  sndiod
99  1 99  0  30x80  selectinetd
 22458  15830  15830 95  30x80  kqreadsmtpd
 17649  15830  15830 95  30x80  kqreadsmtpd
 17949  15830  15830 95  30x80  kqreadsmtpd
  3934  

Re: vnd and softraid panic

2013-01-22 Thread Joel Sing
On Thu, 3 Jan 2013, Eivind Evensen wrote:
 On Mon, Dec 31, 2012 at 07:21:08PM +1100, Joel Sing wrote:
  On Mon, 31 Dec 2012, Eivind Evensen wrote:
   Hello.
  
   Trying to play around a bit with softraid using vnd reliably results
   in a panic when assembling the raid volume. I think the first time I
   tried this was around 4.9 so it's not something new.

 ...

   ddb trace
   Debugger(d08fa43c,f2017e08,d08ce53b,f2017e08,0) at Debugger+0x4
   panic(d08ce53b,d0f8a014,f2017e3c,d105fa30,d105ce00) at panic+0x5d
   sr_wu_put(d105f000,d0ff12b8,f2017e3c,f2017e3c,d02030dd) at
   sr_wu_put+0x104 scsi_io_put(d105fa30,d0ff12b8,8000,d1068000,d1068000)
   at scsi_io_put+0x19
   scsi_xs_put(f1f4d000,d1068000,f2017e8c,d0418d98,f1f4d000) at
   scsi_xs_put+0x37 sr_raidp_intr(d1068000,f1e8601c,f11ec000,200,52000) at
   sr_raidp_intr+0x15b vndstrategy(d1068000,0,0,50,d1068000) at
   vndstrategy+0x70
   spec_strategy(f2017f48,0,f2017f6c,d03ee028,d1053f50) at
   spec_strategy+0x3d VOP_STRATEGY(d1068000,0,0,0,d0ff12f8) at
   VOP_STRATEGY+0x2c
   sr_startwu_callback(d105f000,d0ff12b8,d02008bf,d1053f40,d03ee050) at
   sr_startwu _callback+0x39
   workq_thread(d1053f40) at workq_thread+0x36
   Bad frame pointer: 0xd0bc6ed8
 
  Thanks - it is a known issue, which I hope to be able to finish
  addressing during the next hackathon.

 Nice to know. If it helps, I can test patches.

FWIW this should now be rectified in -current.
-- 

Reason is not automatic. Those who deny it cannot be conquered by it.
 Do not count on them. Leave them alone. -- Ayn Rand



Re: vnd and softraid panic

2013-01-02 Thread Eivind Evensen
On Mon, Dec 31, 2012 at 07:21:08PM +1100, Joel Sing wrote:
 On Mon, 31 Dec 2012, Eivind Evensen wrote:
  Hello.
 
  Trying to play around a bit with softraid using vnd reliably results
  in a panic when assembling the raid volume. I think the first time I
  tried this was around 4.9 so it's not something new.
...
  ddb trace
  Debugger(d08fa43c,f2017e08,d08ce53b,f2017e08,0) at Debugger+0x4
  panic(d08ce53b,d0f8a014,f2017e3c,d105fa30,d105ce00) at panic+0x5d
  sr_wu_put(d105f000,d0ff12b8,f2017e3c,f2017e3c,d02030dd) at sr_wu_put+0x104
  scsi_io_put(d105fa30,d0ff12b8,8000,d1068000,d1068000) at scsi_io_put+0x19
  scsi_xs_put(f1f4d000,d1068000,f2017e8c,d0418d98,f1f4d000) at
  scsi_xs_put+0x37 sr_raidp_intr(d1068000,f1e8601c,f11ec000,200,52000) at
  sr_raidp_intr+0x15b vndstrategy(d1068000,0,0,50,d1068000) at
  vndstrategy+0x70
  spec_strategy(f2017f48,0,f2017f6c,d03ee028,d1053f50) at spec_strategy+0x3d
  VOP_STRATEGY(d1068000,0,0,0,d0ff12f8) at VOP_STRATEGY+0x2c
  sr_startwu_callback(d105f000,d0ff12b8,d02008bf,d1053f40,d03ee050) at
  sr_startwu _callback+0x39
  workq_thread(d1053f40) at workq_thread+0x36
  Bad frame pointer: 0xd0bc6ed8
 
 Thanks - it is a known issue, which I hope to be able to finish addressing 
 during the next hackathon.

Nice to know. If it helps, I can test patches.

Regards,
Eivind



Re: vnd and softraid panic

2012-12-31 Thread Joel Sing
On Mon, 31 Dec 2012, Eivind Evensen wrote:
 Hello.

 Trying to play around a bit with softraid using vnd reliably results
 in a panic when assembling the raid volume. I think the first time I
 tried this was around 4.9 so it's not something new.

 While the combination of vnd and softraid may not be useful for any
 real purpose, I noticed this while hoping to be able to fail a disk in
 a somewhat simpler to retry manner than the nailgun approach I read
 about here recently...

 I don't know if it's important, but also after the trace and ps listed
 below, boot reboot won't reboot the first time, but I can enter ddb
 again by sending a serial break and then get the machine to reboot by
 reexecuting the boot reboot command.

 Regards,
 Eivind

 root:skoeske dd if=/dev/zero of=disk1 bs=1m count=100
 100+0 records in
 100+0 records out
 104857600 bytes transferred in 3.852 secs (27215650 bytes/sec)
 root:skoeske r disk1=disk2
 dd if=/dev/zero of=disk2 bs=1m count=100
 100+0 records in
 100+0 records out
 104857600 bytes transferred in 2.569 secs (40802861 bytes/sec)
 root:skoeske r disk2=disk3
 dd if=/dev/zero of=disk3 bs=1m count=100
 100+0 records in
 100+0 records out
 104857600 bytes transferred in 3.750 secs (27957919 bytes/sec)
 root:skoeske vnconfig vnd0 disk1
 root:skoeske vnconfig vnd1 disk2
 root:skoeske vnconfig vnd2 disk3
 root:skoeske fdisk -iy vnd0
 Warning CHS values out of bounds only saving LBA values
 Writing MBR at offset 0.
 root:skoeske fdisk -iy vnd1
 Warning CHS values out of bounds only saving LBA values
 Writing MBR at offset 0.
 root:skoeske fdisk -iy vnd2
 Warning CHS values out of bounds only saving LBA values
 Writing MBR at offset 0.
 root:skoeske printf a\n\n\n\nRAID\nw\nq\n\n | disklabel -E vnd0
 Label editor (enter '?' for help at any prompt)

  partition: [a] offset: [128] size: [204672] FS type: [4.2BSD]   No
  label changes.

 root:skoeske printf a\n\n\n\nRAID\nw\nq\n\n | disklabel -E vnd1
 Label editor (enter '?' for help at any prompt)

  partition: [a] offset: [128] size: [204672] FS type: [4.2BSD]   No
  label changes.

 root:skoeske printf a\n\n\n\nRAID\nw\nq\n\n | disklabel -E vnd2
 Label editor (enter '?' for help at any prompt)

  partition: [a] offset: [128] size: [204672] FS type: [4.2BSD]   No
  label changes.

 root:skoeske bioctl -c 5 -l /dev/vnd0a,/dev/vnd1a,/dev/vnd2a softraid0
 sd0 at scsibus1 targ 1 lun 0: OPENBSD, SR RAID 5, 005 SCSI2 0/direct
 fixed sd0: 199MB, 512 bytes/sector, 408064 sectors
 softraid0: SR RAID 5 volume attached as sd0
 panic: softraid0: sr_wu_put got active wu
 Stopped at  Debugger+0x4:   popl%ebp
 RUN AT LEAST 'trace' AND 'ps' AND INCLUDE OUTPUT WHEN REPORTING THIS PANIC!
 DO NOT EVEN BOTHER REPORTING THIS WITHOUT INCLUDING THAT INFORMATION!
 ddb trace
 Debugger(d08fa43c,f2017e08,d08ce53b,f2017e08,0) at Debugger+0x4
 panic(d08ce53b,d0f8a014,f2017e3c,d105fa30,d105ce00) at panic+0x5d
 sr_wu_put(d105f000,d0ff12b8,f2017e3c,f2017e3c,d02030dd) at sr_wu_put+0x104
 scsi_io_put(d105fa30,d0ff12b8,8000,d1068000,d1068000) at scsi_io_put+0x19
 scsi_xs_put(f1f4d000,d1068000,f2017e8c,d0418d98,f1f4d000) at
 scsi_xs_put+0x37 sr_raidp_intr(d1068000,f1e8601c,f11ec000,200,52000) at
 sr_raidp_intr+0x15b vndstrategy(d1068000,0,0,50,d1068000) at
 vndstrategy+0x70
 spec_strategy(f2017f48,0,f2017f6c,d03ee028,d1053f50) at spec_strategy+0x3d
 VOP_STRATEGY(d1068000,0,0,0,d0ff12f8) at VOP_STRATEGY+0x2c
 sr_startwu_callback(d105f000,d0ff12b8,d02008bf,d1053f40,d03ee050) at
 sr_startwu _callback+0x39
 workq_thread(d1053f40) at workq_thread+0x36
 Bad frame pointer: 0xd0bc6ed8

Thanks - it is a known issue, which I hope to be able to finish addressing 
during the next hackathon.

 ddb ps
PID   PPID   PGRPUID  S   FLAGS  WAIT  COMMAND
 * 4458  0  0  0  70x100200srdis
   9614   9263  31326   1000  30x80  ttyin less
   9263  31326  31326   1000  30x88  pause sh
  31326   8662  31326   1000  30x80  wait  man
   8662  24219   8662   1000  30x88  pause ksh
  24219   7388   7388   1000  30x80  selectsshd
   7388   1358   7388  0  30x80  poll  sshd
   8332  1   8332  0  20x80ksh
  21947  1  21947  0  30x80  ttyin getty
  28840  1  28840  0  30x80  ttyin getty
  32669  1  32669  0  30x80  ttyin getty
  25686  1  25686  0  30x80  ttyin getty
  19769  1  19769  0  30x80  ttyin getty
  10627  1  10627  0  30x80  selectcron
  14888  1  14888 99  30x80  poll  sndiod
   5483  1   5483  0  30x80  selectinetd
  27714  10979  10979 95  30x80  kqreadsmtpd
   9877  10979  10979 95  30x80  kqreadsmtpd
  31352  10979  10979 95  30x80  kqreadsmtpd
  17499