Re: New panic in wdc_ata_bio_intr

Jaromír Doleček Tue, 17 Oct 2017 19:50:13 -0700

Not at the moment - on the end, I committed the version with flag.

Thanks for report and testing!


Jaromir

2017-10-16 21:12 GMT+02:00 Chavdar Ivanov <ci4...@gmail.com>:

> Well, that was a good one. Running just fine now:
>
> ~ uname -a
> NetBSD nt61p.lorien.lan 8.99.4 NetBSD 8.99.4 (GENERIC) #1: Mon Oct 16
> 20:01:05 BST 2017  
> sysbu...@nt61p.lorien.lan:/home/sysbuild/src/sys/arch/amd64/compile/GENERIC
> amd64
> ~ dmesg | grep wd0
> wd0 at atabus0 drive 0
> wd0: <Hitachi HTS725032A9A364>
> wd0: drive supports 16-sector PIO transfers, LBA48 addressing
> wd0: 298 GB, 620181 cyl, 16 head, 63 sec, 512 bytes/sect x 625142448
> sectors
> wd0: 32-bit data port
> wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100),
> NCQ (32 tags)
> wd0(piixide0:0:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using
> DMA)
> boot device: wd0
> root on wd0a dumps on wd0b
> ~ atactl wd0 identify
> Model: Hitachi HTS725032A9A364, Rev: PC3OCH0A, Serial #:
> 110320PCKC04BPJ53MLK
> World Wide Name: 5000CCA645DE827F
> Device type: ATA, fixed
> Capacity 320 Gbytes, 625142448 sectors, 512 bytes/sector
> Cylinders: 16383, heads: 16, sec/track: 63
> Command queue depth: 32
> Device capabilities:
>         DMA
>         LBA
>         IORDY operation
>         IORDY disabling
> Device supports following standards:
> ATA-2 ATA-3 ATA-4
> <https://maps.google.com/?q=3+ATA-4&entry=gmail&source=g> ATA-5 ATA-6
> ATA-7 ATA-8
> Command set support:
>         NOP command (enabled)
>         READ BUFFER command (enabled)
>         WRITE BUFFER command (enabled)
>         Look-ahead (enabled)
>         Write cache (enabled)
>         Power Management feature set (enabled)
>         Security Mode feature set (disabled)
>         SMART feature set (enabled)
>         FLUSH CACHE EXT command (enabled)
>         FLUSH CACHE command (enabled)
>         Device Configuration Overlay feature set (enabled)
>         48-bit Address feature set (enabled)
>         Advanced Power Management feature set (enabled)
>         DOWNLOAD MICROCODE command (enabled)
>         World Wide Name
>         General Purpose Logging feature set
>         SMART self-test
>         SMART error logging
> Serial ATA capabilities:
>         1.5Gb/s signaling
>         3.0Gb/s signaling
>         Native Command Queuing
>         PHY Event Counters
> Serial ATA features:
>         DMA Setup Auto Activate (disabled)
>         Device-Initiated Interface Power Managment (disabled)
>         Software Settings Preservation (enabled)
>
> Anything else to test?
>
> Chavdar Ivanov
>
> On Mon, 16 Oct 2017 at 19:07 Jaromír Doleček <jaromir.dole...@gmail.com>
> wrote:
>
>> Okay, can you try following patch? It puts puts back a flag for IRQ
>> handling. If it works, I might have an idea what's happening. I think there
>> is some rogue interrupt disturbing the state.
>>
>> If it doesn't work, can you please try to compile kernel with ATADEBUG,
>> and set atadebug_mask (possibly via ddb during boot) to 0x40?
>>
>> Jaromir
>>
>> 2017-10-15 23:10 GMT+02:00 Chavdar Ivanov <ci4...@gmail.com>:
>>
>>> Sorry, it still crashes the same way. I made sure all was updated before
>>> trying, I do have
>>>
>>> ident /netbsd  | grep wdc
>>>      $NetBSD: atapi_wdc.c,v 1.128 2017/10/10 21:37:49 jdolecek Exp $
>>>      $NetBSD: ata_wdc.c,v 1.108 2017/10/15 11:27:14 jdolecek Exp $
>>>      $NetBSD: wdc_isa.c,v 1.60 2017/10/07 16:05:32 jdolecek Exp $
>>>      $NetBSD: wdc_pcmcia.c,v 1.125 2017/10/07 16:05:33 jdolecek Exp $
>>>      $NetBSD: wdc.c,v 1.285 2017/10/15 18:02:33 jdolecek Exp $
>>>
>>> and the panic is exactly the same.
>>>
>>> I am sure I will sort out my problem on this particular machine if I
>>> swap the internal SSD and the one in the DVD bay, placing the NetBSD root
>>> in the proper place, but nevertheless the panic may indicate some other
>>> unfinished work, so I shall keep it as it is for testing.
>>>
>>> Chavdar Ivanov
>>>
>>> On Sun, 15 Oct 2017 at 19:03 Jaromír Doleček <jaromir.dole...@gmail.com>
>>> wrote:
>>>
>>>> Hi,
>>>>
>>>> should be fixed in rev. 1.285 of dev/ic/wdc.c, can you please check?
>>>>
>>>> Jaromir
>>>>
>>>> 2017-10-14 17:48 GMT+02:00 Chavdar Ivanov <ci4...@gmail.com>:
>>>>
>>>>> It still panics the same way, no difference.
>>>>>
>>>>> On my other laptop, an HP EliteBook, I haven't the problem at all,
>>>>> only on the two T61p's (one of them stopped working a week ago, though).
>>>>>
>>>>> Chavdar Ivanov
>>>>>
>>>>>
>>>>> On Sat, 14 Oct 2017 at 15:45 Jaromír Doleček <
>>>>> jaromir.dole...@gmail.com> wrote:
>>>>>
>>>>>> Sorry, this fixed patch
>>>>>>
>>>>>> 2017-10-14 16:23 GMT+02:00 Jaromír Doleček <jaromir.dole...@gmail.com
>>>>>> >:
>>>>>>
>>>>>>> Can you try attached patch?
>>>>>>>
>>>>>>> Jaromir
>>>>>>>
>>>>>>> 2017-10-11 1:04 GMT+02:00 Chavdar Ivanov <ci4...@gmail.com>:
>>>>>>>
>>>>>>>> The timeouts when running under VirtualBox disappeared, but of
>>>>>>>> course the panic on my T61p remains.
>>>>>>>>
>>>>>>>> Chavdar Ivanov
>>>>>>>>
>>>>>>>> On Tue, 10 Oct 2017 at 22:40 Jaromír Doleček <
>>>>>>>> jaromir.dole...@gmail.com> wrote:
>>>>>>>>
>>>>>>>>> Hey,
>>>>>>>>>
>>>>>>>>> can you try with dev/scsipi/atapi_wdc.c 1.128? That should resolve
>>>>>>>>> the timeouts for atapi, at least it did for me.
>>>>>>>>>
>>>>>>>>> Jaromir
>>>>>>>>>
>>>>>>>>> 2017-10-10 8:08 GMT+02:00 Rares Aioanei <bsdlis...@gmail.com>:
>>>>>>>>>
>>>>>>>>>> I get that also on VBox, except it doesn't try to add cd0a as a
>>>>>>>>>> swap
>>>>>>>>>> device, nor does it show an endless stream of "lost interrupt"
>>>>>>>>>> messages; eventually I get a login prompt. This is with
>>>>>>>>>> yesterday's
>>>>>>>>>> latest -CURRENT.
>>>>>>>>>>
>>>>>>>>>> On Sun, Oct 8, 2017 at 5:17 PM, Chavdar Ivanov <ci4...@gmail.com>
>>>>>>>>>> wrote:
>>>>>>>>>> > I tried the same kernel on a VirtualBox guest - it doesn't
>>>>>>>>>> crash, but one
>>>>>>>>>> > gets endless
>>>>>>>>>> >
>>>>>>>>>> > piixide0:1:0: lost interrupt
>>>>>>>>>> >         type: atapi tc_bcount: 0 tc_skip: 0
>>>>>>>>>> >
>>>>>>>>>> > stream of messages. Also /etc/rc.d/swap2 start hangs while
>>>>>>>>>> trying to add
>>>>>>>>>> > /dev/cd0a as a dump device... as shown by ktruss.
>>>>>>>>>> >
>>>>>>>>>> > Weird.
>>>>>>>>>> >
>>>>>>>>>> > Chavdar
>>>>>>>>>> >
>>>>>>>>>> > On Sun, 8 Oct 2017 at 11:55 Chavdar Ivanov <ci4...@gmail.com>
>>>>>>>>>> wrote:
>>>>>>>>>> >>
>>>>>>>>>> >> System updated about two hours ago. I am getting:
>>>>>>>>>> >>
>>>>>>>>>> >> ....
>>>>>>>>>> >> wd0 at atabus0 drive 0
>>>>>>>>>> >> wd0: <Hitachi HTS725032A9A364>
>>>>>>>>>> >> wd0: drive supports 16-sector PIO transfers, LBA48 addressing
>>>>>>>>>> >> wd0: 298 GB, 620181 cyl, 16 head, 63 sec, 512 bytes/sect x
>>>>>>>>>> 625142448
>>>>>>>>>> >> sectors
>>>>>>>>>> >> piixide0:0:0: bad state 0 in wdc_ata_bio_intr
>>>>>>>>>> >> panic: wdc_ata_bio_intr: bad state
>>>>>>>>>> >> fatal breakpoint trap in supervisor mode
>>>>>>>>>> >> trap type 1 code 0 rip 0xffffffff8021c0c5 cs 0x8 rflags 0x246
>>>>>>>>>> cr2 0 ilevel
>>>>>>>>>> >> 0x8 rsp 0xffffe40040003c38
>>>>>>>>>> >> curlwp 0xffffe4013bb27840 pid 0.2 lowest kstack
>>>>>>>>>> 0xffffe400400002c0
>>>>>>>>>> >> Stopped at pid 0.2 (system) at netbsd:breakpoint+0x5: leave
>>>>>>>>>> >> db{0}> bt
>>>>>>>>>> >> breakpoint() at netbsd:breakpoint+0x5
>>>>>>>>>> >> vpanic() at netbsd:vpanic+0x140
>>>>>>>>>> >> snprintf() at netbsd:snprintf
>>>>>>>>>> >> wdc_ata_bio_poll() at netbsd:wdc_ata_bio_poll
>>>>>>>>>> >> intr_biglock_wrapper() at netbsd:intr_biglock_wrapper+0x1d
>>>>>>>>>> >> Xintr_ioapic_edge10() at netbsd:Xintr_ioapic_edge10+0xee
>>>>>>>>>> >> --- interrupt ---
>>>>>>>>>> >> x86_mwait() at netbsd:x86_mwait+0xd
>>>>>>>>>> >> acpicpu_cstate_idel_enter() at netbsd:acpicpu_cstate_idle_
>>>>>>>>>> enter+0xdb
>>>>>>>>>> >> acpicpu_cstate_idle() at netbsd:acpicpu_cstate_idle+0xb6
>>>>>>>>>> >> idle_loop() at netbsd:idle_loop+0x18c
>>>>>>>>>> >> db{0}>
>>>>>>>>>> >> ....
>>>>>>>>>> >>
>>>>>>>>>> >> (that is on my usual ThinkPad T61p).
>>>>>>>>>> >>
>>>>>>>>>> >> Couldn't get a crash dump.
>>>>>>>>>> >>
>>>>>>>>>> >> Chavdar Ivanov
>>>>>>>>>> >>
>>>>>>>>>> >
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>
>>>>>>
>>>>
>>

Re: New panic in wdc_ata_bio_intr

Reply via email to