System freeze: Adaptec (aac) timeouts (releng 8)

2011-09-14 Thread Dennis Koegel
Cheers,

we have a reproducible system freeze due to Adaptec driver (aac) timeouts:

Sep  3 05:26:44 foo kernel: aac0: COMMAND 0xff80005ae4c0 (TYPE 502) TIMEOUT 
AFTER 129 SECONDS
Sep  3 05:26:44 foo kernel: aac0: COMMAND 0xff80005ac0e0 (TYPE 502) TIMEOUT 
AFTER 129 SECONDS
Sep  3 05:26:44 foo kernel: aac0: COMMAND 0xff80005b0fa0 (TYPE 502) TIMEOUT 
AFTER 129 SECONDS


Once this happens, the userland seems to be alive, but the controller is
completely dead. As soon as the disk subsystem is involved, any process
hangs forever (e.g. SSH crypto-exchange still happens, but a shell won't
even start anymore).

We observe the same issue on two systems of (mostly) identical spec, so
it's not a hardware issue.

Apparently this only happens under heavy disk i/o and high cpu load.
Notably high write throughput plus a 'zpool scrub' on a large
GELI-backed zpool usually triggers the problem after a few hours.
Without high activity, they run smooth for weeks.

Both systems are amd64 with an Adaptec 5805 controller and 16 disks (of
which two form a RAID-1 system volume (UFS), and the remaining 14 serve
as JBOD for a large zpool -- a total of 15 "aacd" devices).

Both were running 8.2R originally. I've taken them to 8-STABLE now and
also applied svn r222951 (where the MFC was forgotten, it seems), but
the problem remains.

Any help is greatly appreciated.

Thanks,
- D.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


FreeBSD Status Report April - June, 2011

2011-09-14 Thread Daniel Gerzo
FreeBSD Quarterly Status Report - April-June, 2011

Introduction

   This report covers FreeBSD-related projects between April and June
   2011. It is the second of the four reports planned for 2011. Since this
   quarter, the work is being focused on the next major version of
   FreeBSD, 9.0, which is to be released in September.

   Thanks to all the reporters for the excellent work! This report
   contains 36 entries and we hope you enjoy reading it.

   Please note that the deadline for submissions covering the period
   between July and September 2011 is October 15th, 2011.
 __

Projects

 * Clang replacing GCC in the base system
 * Fix clang warnings
 * libarchive, bsdtar, bsdcpio
 * ZFS pool version 28

FreeBSD Team Reports

 * ArabBSD
 * The FreeBSD Foundation

Network Infrastructure

 * DIstributed Firewall and Flow-shaper Using Statistical Evidence
   (DIFFUSE)
 * FreeBSD IPv6-only Support
 * IPv6 RA Handling Improvements
 * netmap
 * New ipfw features
 * TCP User Timeout Option (UTO)

Kernel

 * Intel GPU Driver
 * OpenAFS port
 * Overhaul of the mii(4)-subsystem
 * Status Report for NFS

Documentation

 * FreeBSD June 6th, 2011 Doc Sprint
 * The FreeBSD Dutch Documentation Project
 * The FreeBSD Japanese Documentation Project

Architectures

 * FreeBSD on the Sony Playstation 3
 * FreeBSD/arm on Marvell Armada XP
 * FreeBSD/powerpc on AppliedMicro APM86290
 * FreeBSD/powerpc64 on IBM pSeries machines
 * FreeBSD/sparc64

Ports

 * Chromium
 * FreeBSD Haskell Ports
 * KDE-FreeBSD
 * libvirt networking port
 * Portbuilder
 * Ports Collection

Miscellaneous

 * bsd_day(2011)

Google Summer of Code

 * Capsicum adaptation and core libraries
 * Disk device error counters
 * Google Summer of Code
 * nvi-iconv
 * Replacing the Regular Expression Code
 __

ArabBSD

   URL: https://sites.google.com/site/arabbsd/

   Contact: Mohammed Farrag 

   FreeBSD Awareness, Handbook Translation and FreeBSD Kernel Development
   Summer Course.

Open tasks:

1. FreeBSD Kernel Development Summer Course.
 __

bsd_day(2011)

   URL: http://bsdday.eu/2011

   Contact: Martin Matuska 
   Contact: Gábor Páli 

   The purpose of this one-day event is to gather Central European
   developers of today's open-source BSD systems to popularize their work
   and their organizations, and to meet each other in the real life. We
   would also like to motivate potential future developers and users,
   especially undergraduate university students to work with BSD systems.

   This year's BSD-Day will be held in Bratislava, Slovakia at Slovak
   University of Technology, Faculty of Electrical Engineering and
   Information Technology on November 5, 2011.

   Everybody is welcome!

Open tasks:

1. Apply. We are looking for you!
 __

Capsicum adaptation and core libraries

   URL: http://www.cl.cam.ac.uk/research/security/capsicum
   URL: http://wiki.FreeBSD.org/SOC2011IlyaBakulin

   Contact: Ilya Bakulin 

   Some applications from the base system received sandboxing support,
   current task is to adapt lightweight resolver daemon for using it in
   sandboxes -- this fixes problems with applications that need to convert
   IP addresses into domain names while in sandbox.

Open tasks:

1. Add sandboxing to even more applications in the base system.
2. Help Jonathan Anderson and Robert Watson to merge FreeBSD-Capsicum
   into FreeBSD-HEAD.
 __

Chromium

   URL: http://www.chromium.org/Home
   URL: http://trillian.chruetertee.org/chromium

   Contact: Chromium on FreeBSD Team 

   During the last quarter we have been keeping the Chromium browser up to
   date, with new major releases being imported into the Ports Collection
   the same day as the upstream release. As time passes by, more patches
   are incorporated or otherwise became obsolete by virtue of upstream
   code cleanups. Version 13 is already available from the Chruëtertee
   repository, with 70 patches less than version 12.
 __

Clang replacing GCC in the base system

   URL: http://wiki.FreeBSD.org/BuildingFreeBSDWithClang
   URL: http://wiki.FreeBSD.org/PortsAndClang

   Contact: Dimitry Andric 
   Contact: Roman Divacky 
   Contact: Brooks Davis 
   Contact: Pawel Worach 

   We imported newer snapshot of clang/llvm. This features quite a lot of
   goodies. Most notably there's a new register allocator that brings much
   better runtime performance. If you did a performance evaluation of
   clang/llvm in t

make installworld fails (touch not found)

2011-09-14 Thread David Demelier

Hi,

Today I pulled up the last changes for RELENG_8 branch and I got an 
error when doing the make installworld target :


===> gnu/usr.bin/texinfo/doc (install)
install-info --quiet  --defsection=Miscellaneous  --defentry=  info.info 
/usr/share/info/dir
install-info --quiet  --defsection=Miscellaneous  --defentry= 
info-stnd.info /usr/share/info/dir
install-info --quiet  --defsection=Miscellaneous  --defentry= 
texinfo.info /usr/share/info/dir
install -o root -g wheel -m 444  info.info.gz info-stnd.info.gz 
texinfo.info.gz /usr/share/info

===> include (install)
creating osreldate.h from newvers.sh
touch: not found
*** Error code 127

Stop in /usr/src/include.
*** Error code 1

Anyone having the same issue?

Cheers,

--
David Demelier
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: make installworld fails (touch not found)

2011-09-14 Thread Kevin Oberman
On Wed, Sep 14, 2011 at 9:09 AM, David Demelier
 wrote:
> Hi,
>
> Today I pulled up the last changes for RELENG_8 branch and I got an error
> when doing the make installworld target :
>
> ===> gnu/usr.bin/texinfo/doc (install)
> install-info --quiet  --defsection=Miscellaneous  --defentry=  info.info
> /usr/share/info/dir
> install-info --quiet  --defsection=Miscellaneous  --defentry= info-stnd.info
> /usr/share/info/dir
> install-info --quiet  --defsection=Miscellaneous  --defentry= texinfo.info
> /usr/share/info/dir
> install -o root -g wheel -m 444  info.info.gz info-stnd.info.gz
> texinfo.info.gz /usr/share/info
> ===> include (install)
> creating osreldate.h from newvers.sh
> touch: not found
> *** Error code 127
>
> Stop in /usr/src/include.
> *** Error code 1
>
> Anyone having the same issue?

Check the setting of the date/time on the system.
-- 
R. Kevin Oberman, Network Engineer - Retired
E-mail: kob6...@gmail.com
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: System freeze: Adaptec (aac) timeouts (releng 8)

2011-09-14 Thread Elliot Finley
I was having the exact same problem using an Adaptec 52445.  After
downloading and using the latest driver from the adaptec website, the
problems stopped.  I haven't had a single freeze since using the new
code.  The newest driver from the website has source code with it, so
it shouldn't be that big of a deal to incorporate it into the base
system.I emailed the authors of the aac driver (Mike Smith and
Scott Long), but they have both retired.  So I'm not really sure how
to get this code into the base.  If anyone knows, please take up the
charge.

On Wed, Sep 14, 2011 at 2:08 AM, Dennis Koegel  wrote:
> Cheers,
>
> we have a reproducible system freeze due to Adaptec driver (aac) timeouts:
>
> Sep  3 05:26:44 foo kernel: aac0: COMMAND 0xff80005ae4c0 (TYPE 502) 
> TIMEOUT AFTER 129 SECONDS
> Sep  3 05:26:44 foo kernel: aac0: COMMAND 0xff80005ac0e0 (TYPE 502) 
> TIMEOUT AFTER 129 SECONDS
> Sep  3 05:26:44 foo kernel: aac0: COMMAND 0xff80005b0fa0 (TYPE 502) 
> TIMEOUT AFTER 129 SECONDS
> 
>
> Once this happens, the userland seems to be alive, but the controller is
> completely dead. As soon as the disk subsystem is involved, any process
> hangs forever (e.g. SSH crypto-exchange still happens, but a shell won't
> even start anymore).
>
> We observe the same issue on two systems of (mostly) identical spec, so
> it's not a hardware issue.
>
> Apparently this only happens under heavy disk i/o and high cpu load.
> Notably high write throughput plus a 'zpool scrub' on a large
> GELI-backed zpool usually triggers the problem after a few hours.
> Without high activity, they run smooth for weeks.
>
> Both systems are amd64 with an Adaptec 5805 controller and 16 disks (of
> which two form a RAID-1 system volume (UFS), and the remaining 14 serve
> as JBOD for a large zpool -- a total of 15 "aacd" devices).
>
> Both were running 8.2R originally. I've taken them to 8-STABLE now and
> also applied svn r222951 (where the MFC was forgotten, it seems), but
> the problem remains.
>
> Any help is greatly appreciated.
>
> Thanks,
> - D.
> ___
> freebsd-stable@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"
>
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: make installworld fails (touch not found)

2011-09-14 Thread Boris Kochergin

On 09/14/11 12:37, Kevin Oberman wrote:

On Wed, Sep 14, 2011 at 9:09 AM, David Demelier
  wrote:

Hi,

Today I pulled up the last changes for RELENG_8 branch and I got an error
when doing the make installworld target :

===>  gnu/usr.bin/texinfo/doc (install)
install-info --quiet  --defsection=Miscellaneous  --defentry=  info.info
/usr/share/info/dir
install-info --quiet  --defsection=Miscellaneous  --defentry= info-stnd.info
/usr/share/info/dir
install-info --quiet  --defsection=Miscellaneous  --defentry= texinfo.info
/usr/share/info/dir
install -o root -g wheel -m 444  info.info.gz info-stnd.info.gz
texinfo.info.gz /usr/share/info
===>  include (install)
creating osreldate.h from newvers.sh
touch: not found
*** Error code 127

Stop in /usr/src/include.
*** Error code 1

Anyone having the same issue?

Check the setting of the date/time on the system.


I've also seen this happen when my /usr/src was out of sync with /usr/obj.

-Boris
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


R: make installworld fails (touch not found)

2011-09-14 Thread Barbara

>
>Hi,
>
>Today I pulled up the last changes for RELENG_8 branch and I got an 
>error when doing the make installworld target :
>
>===> gnu/usr.bin/texinfo/doc (install)
>install-info --quiet  --defsection=Miscellaneous  --defentry=  info.info 
>/usr/share/info/dir
>install-info --quiet  --defsection=Miscellaneous  --defentry= 
>info-stnd.info /usr/share/info/dir
>install-info --quiet  --defsection=Miscellaneous  --defentry= 
>texinfo.info /usr/share/info/dir
>install -o root -g wheel -m 444  info.info.gz info-stnd.info.gz 
>texinfo.info.gz /usr/share/info
>===> include (install)
>creating osreldate.h from newvers.sh
>touch: not found
>*** Error code 127
>
>Stop in /usr/src/include.
>*** Error code 1
>
>Anyone having the same issue?
>
>Cheers,

Forgot to "adjkerntz -i"?

Barbara

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: make installworld fails (touch not found)

2011-09-14 Thread David Demelier

On 14/09/2011 19:10, Boris Kochergin wrote:

On 09/14/11 12:37, Kevin Oberman wrote:

On Wed, Sep 14, 2011 at 9:09 AM, David Demelier
 wrote:

Hi,

Today I pulled up the last changes for RELENG_8 branch and I got an
error
when doing the make installworld target :

===> gnu/usr.bin/texinfo/doc (install)
install-info --quiet --defsection=Miscellaneous --defentry= info.info
/usr/share/info/dir
install-info --quiet --defsection=Miscellaneous --defentry=
info-stnd.info
/usr/share/info/dir
install-info --quiet --defsection=Miscellaneous --defentry= texinfo.info
/usr/share/info/dir
install -o root -g wheel -m 444 info.info.gz info-stnd.info.gz
texinfo.info.gz /usr/share/info
===> include (install)
creating osreldate.h from newvers.sh
touch: not found
*** Error code 127

Stop in /usr/src/include.
*** Error code 1

Anyone having the same issue?

Check the setting of the date/time on the system.


I've also seen this happen when my /usr/src was out of sync with /usr/obj.

-Boris


Thanks that was the problem!

--
David Demelier
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: csup crash

2011-09-14 Thread Sergey Matveychuk

13.09.2011 03:29, Dmitry Morozovsky wrote:

On Mon, 12 Sep 2011, John Baldwin wrote:


On Monday, September 12, 2011 2:01:16 am Dmitry Morozovsky wrote:

Dear colleagues,

I found csup is crashing on my building machine on some broken repo element,
but cannot quickli realize what's the source of problem


What OS version are you running?  I wonder if you have this fix:


it is on rather fresh stable/8:

FreeBSD beaver.rinet.ru 8.2-STABLE FreeBSD 8.2-STABLE #5 r224908M: Tue Aug 16
15:43:23 MSD 2011 ma...@beaver.rinet.ru:/usr/obj/usr/src/sys/BEAVER  amd64

so I suppose I'd have the fix you'd mentioned.

I'm still have broken repo as a ZFS snapshot, so I'm ready to provide more
info or test any fix you provide.



Have you found a broken file? What its content?
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: System freeze: Adaptec (aac) timeouts (releng 8)

2011-09-14 Thread Adrian Chadd
On 15 September 2011 00:38, Elliot Finley  wrote:
> I was having the exact same problem using an Adaptec 52445.  After
> downloading and using the latest driver from the adaptec website, the
> problems stopped.  I haven't had a single freeze since using the new
> code.  The newest driver from the website has source code with it, so
> it shouldn't be that big of a deal to incorporate it into the base
> system.    I emailed the authors of the aac driver (Mike Smith and
> Scott Long), but they have both retired.  So I'm not really sure how
> to get this code into the base.  If anyone knows, please take up the
> charge.

File a PR and hound people on freebsd-current until it gets done?


Adrian
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: System freeze: Adaptec (aac) timeouts (releng 8)

2011-09-14 Thread Jeremy Chadwick
On Thu, Sep 15, 2011 at 09:36:43AM +0800, Adrian Chadd wrote:
> On 15 September 2011 00:38, Elliot Finley  wrote:
> > I was having the exact same problem using an Adaptec 52445. ?After
> > downloading and using the latest driver from the adaptec website, the
> > problems stopped. ?I haven't had a single freeze since using the new
> > code. ?The newest driver from the website has source code with it, so
> > it shouldn't be that big of a deal to incorporate it into the base
> > system. ? ?I emailed the authors of the aac driver (Mike Smith and
> > Scott Long), but they have both retired. ?So I'm not really sure how
> > to get this code into the base. ?If anyone knows, please take up the
> > charge.
> 
> File a PR and hound people on freebsd-current until it gets done?

...which will either be ignored given that (TMK) nobody is maintaining
the Adaptec drivers, or will be addressed in HEAD which won't help the
OP who runs RELENG_8 until an MFC happens -- and if it happens (I forget
how MFC approvals work).  :-)

As for the lack of aac(4) maintainer, I'm not sure how this should be
addressed in the aac(4) man page.  AUTHORS tends to indicate the names
of the people who created or were involved in creating/maintaining said
driver, which is sometimes (but on FreeBSD hardly always) the
individual(s) who currently support it.  In the case that there is a
different maintainer, how does this get denoted in the man page?

I'm actually very surprised to hear there's an official FreeBSD driver
on Adaptec's site that's actually intended for FreeBSD 8.x.  Last I knew
they had basically blown off FreeBSD support.  I wonder who at Adaptec
is responsible for the FreeBSD driver?  It would be good to know to
involve them in all communiqu?s.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator   Mountain View, CA, US |
| Making life hard for others since 1977.   PGP 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: System freeze: Adaptec (aac) timeouts (releng 8)

2011-09-14 Thread Dennis Koegel
On Wed, Sep 14, 2011 at 07:26:20PM -0700, Jeremy Chadwick wrote:
> I'm actually very surprised to hear there's an official FreeBSD driver
> on Adaptec's site that's actually intended for FreeBSD 8.x.

As far as I can tell from the source, it's the very same driver (same
source code and copyright notices), only that Adaptec has taken over
development; fbsd has 2.1, Adaptec has continued development to a
version 2.4.

I'm not aware how licensing issues play in here, but apart from that, it
should be easy to patch this into base. (I was already half-way there
yesterday and I think I could work up a patch against HEAD and 8.x).

- D.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"