date:20100618

Re: qbittorrent 2.2.9 8.0-STABLE Amd64

2010-06-18 Thread Doug Barton


On 06/18/10 10:26, Andriy Gapon wrote:

on 18/06/2010 18:51 Жиндарев Алексей said the following:

Jun 18 19:33:54  last message repeated 371 times
Jun 18 19:41:31  last message repeated 1359 times
Jun 18 19:43:29  kernel: WARNING pid 31369 (qbittorrent): ioctl sign-extension 
ioctl 8004667e
Jun 18 19:44:00  last message repeated 545 times
Jun 18 19:45:45  last message repeated 1751 times
Jun 18 19:45:46  kernel: WARNING pid 31369 (qbittorrent): ioctl sign-extension 
ioctl 8004667e
Jun 18 19:46:17  last message repeated 481 times

Manifested after the new port, possibly after updating QT


This is FIONBIO ioctl.  Look through the code where this is passed via a 
variable
of incorrect type.  Correct type for ioctl request should be unsigned long.


I can't find any references to FIONBIO at all, or even the word ioctl. 
The software in question is a bittorrent client, I can't see any reason 
it would even use ioctl's directly. I also checked the 
libtorrent-rasterbar sources (which qbittorrent uses) and there is no 
FIONBIO there either.


Thanks for your help Andriy, but I'm more confused than ever. :)


Doug

--

... and that's just a little bit of history repeating.
-- Propellerheads

Improve the effectiveness of your Internet presence with
a domain name makeover!http://SupersetSolutions.com/

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: kernel panic on 8.1-rc1 amd64

2010-06-18 Thread Nenhum_de_Nos

On Fri, 18 Jun 2010 15:10:34 -0500 (CDT)
"Sean C. Farley"  wrote:

> On Fri, 18 Jun 2010, Nenhum_de_Nos wrote:
> 
> > On Thu, June 17, 2010 23:49, Alexandre \"Sunny\" Kovalenko wrote:
> >> Not much -- this is just panic I had in the same timeframe and solved 
> >> it by disabling and, consequently rebuilding, Virtual Box kernel 
> >> modules. I assume you either not loading or did disable the remaining 
> >> two: vboxnetflt.ko and vboxnetadp.ko.
> >>
> >> I guess any third-party modules (webcam, qemu, etc.) would be good 
> >> candidates for being disabled -- it seams like there were changes in 
> >> the kernel which required rebuild of them.
> >>
> >> Outside of that -- sorry, out of ideas.
> >
> > thanks for the tip, for the record it was a module (or two), as I 
> > tried disabling both nvidia and vboxdrv.
> >
> > I'll try to update both now,
> 
> For the record, it has been reported on emulation@ that the nvidia and 
> vbox drivers do not get along with each other.  I occasionally get 
> lockups while there is network traffic between my host and 
> (console-only) guest both running FreeBSD 8 amd64.
> 
> Unfortunately, I do not know which module is at fault, and I have been 
> too busy to try harder to find the culprit.

I read about that also, but they were ok to that moment. I'll rebuild the 
nvidia driver and try again.

thanks,

matheus

-- 
We will call you cygnus,
The God of balance you shall be

A: Because it messes up the order in which people normally read text.
Q: Why is top-posting such a bad thing?

http://en.wikipedia.org/wiki/Posting_style
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: 8.1 speed issues

2010-06-18 Thread Doug Barton


On 06/18/10 14:48, William D. Colburn (Schlake) wrote:

So I've just upgraded from whatever was stable in 2004 to 8.1 (it's a
private file server in my house, I pay no attention to it until it
crashes), and uh, the speed difference is very noticeable.  In short,
it's like I bought a brand new machine that eleven times as fast when
all I did was upgrade the OS.  How on earth did you guys make such a
dramatic improvement?


Only 11? We must be slipping ...


Doug (Thanks for the kind words, they really are appreciated)

--

... and that's just a little bit of history repeating.
-- Propellerheads

Improve the effectiveness of your Internet presence with
a domain name makeover!http://SupersetSolutions.com/

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: FreeBSD 8.1-RC1 Available...

2010-06-18 Thread Doug Barton


On 06/18/10 15:27, Chip Camden wrote:

Thanks for the clarification.  I want to stay stable (though my wife
would contest that I've ever been).


There's only so much an operating system, ANY operating system can do ...


Doug (you're welcome) :)

--

... and that's just a little bit of history repeating.
-- Propellerheads

Improve the effectiveness of your Internet presence with
a domain name makeover!http://SupersetSolutions.com/

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: 8.1 speed issues

2010-06-18 Thread Robert Watson



On Fri, 18 Jun 2010, William D. Colburn (Schlake) wrote:

So I've just upgraded from whatever was stable in 2004 to 8.1 (it's a 
private file server in my house, I pay no attention to it until it crashes), 
and uh, the speed difference is very noticeable.  In short, it's like I 
bought a brand new machine that eleven times as fast when all I did was 
upgrade the OS.  How on earth did you guys make such a dramatic improvement?


Well, I guess the good news is that we did a lot in six years.  The bad news 
is that figuring out which changes helped your specific configuration and 
workload would be quite tricky.  Certainly, our work on SMP optimization for 
scheduling, network stack, threading, storage, etc, has been a big deal.  Thre 
are also countless protocol improvements in TCP/IP, significant changes in 
file system code, ... that could well be involved.


Needless to say, it's satisfying to hear that it worked out -- I'd like to 
think we have even further good stuff coming in 9!


Robert
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: 8.1 speed issues

2010-06-18 Thread Brandon Gooch

On Fri, Jun 18, 2010 at 4:48 PM, William D. Colburn (Schlake)
 wrote:
> So I've just upgraded from whatever was stable in 2004 to 8.1 (it's a
> private file server in my house, I pay no attention to it until it
> crashes), and uh, the speed difference is very noticeable.  In short,
> it's like I bought a brand new machine that eleven times as fast when
> all I did was upgrade the OS.  How on earth did you guys make such a
> dramatic improvement?
>

BSD = Blazing Speed, Dude!

Oh, and it's Free.

Seriously though, if you're curious, the source code commit logs are
always enlightening ;)

-Brandon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: FreeBSD 8.1-RC1 Available...

2010-06-18 Thread Chip Camden

On Jun 18 2010 12:47, Doug Barton wrote:
> On 06/18/10 10:01, Chip Camden wrote:
> >On Jun 17 2010 23:06, Ken Smith wrote:
> >>
> >[snip]
> >>
> >>If you are updating an already running machine the CVS branch
> >>tag is RELENG_8_1, or if you prefer SVN use "releng/8.1".
> >>
> >[snap]
> >
> >Forgive me if the answer to this question should be obvious.  I'm on
> >stable/8, so if I understand correctly all I should need to do is change
> >my supfile CVS tag to RELENG_8_1, csup, and rebuild?
> 
> If you're already on -stable you don't have to do anything, unless you 
> want to stay at what will eventually become 8.1-RELEASE.
> 
> 
> hth,
> 
> Doug
> 
> -- 
> 
>   ... and that's just a little bit of history repeating.
>   -- Propellerheads
> 
>   Improve the effectiveness of your Internet presence with
>   a domain name makeover!http://SupersetSolutions.com/
> 
> ___
> freebsd-stable@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Thanks for the clarification.  I want to stay stable (though my wife
would contest that I've ever been).

-- 
Sterling (Chip) Camden
http://camdensoftware.com | http://chipstips.com | http://chipsquips.com
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

8.1 speed issues

2010-06-18 Thread William D. Colburn (Schlake)

So I've just upgraded from whatever was stable in 2004 to 8.1 (it's a
private file server in my house, I pay no attention to it until it
crashes), and uh, the speed difference is very noticeable.  In short,
it's like I bought a brand new machine that eleven times as fast when
all I did was upgrade the OS.  How on earth did you guys make such a
dramatic improvement?

-- 
-- Schlake
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

make installworld breaks at share/syscons/scrnmaps

2010-06-18 Thread Damian Weber



8-STABLE system csup today

FreeBSD xxx.local 8.1-PRERELEASE FreeBSD 8.1-PRERELEASE #0: Fri Jun 18 18:28:03 
CEST 2010 r...@xxx.local:/usr/obj/usr/src/sys/XXX  amd64

make installworld 
breaks at

===> share/syscons/scrnmaps (install)
./armscii8-2haik8.mk armscii8-2haik8.tmp
uuencode armscii8-2haik8.tmp armscii8-2haik8 > armscii8-2haik8.scm
uuencode: Command not found.
*** Error code 127

Stop in /usr/src/share/syscons/scrnmaps.
*** Error code 1
...

I fixed it on that machine by this patch,  


--- share/syscons/scrnmaps/Makefile.orig2010-06-18 22:48:35.0 
+0200
+++ share/syscons/scrnmaps/Makefile 2010-06-18 22:48:47.0 +0200
@@ -16,7 +16,7 @@
 
 ${SCRMAPS}: ${.TARGET:R}.mk
./${.TARGET:R}.mk ${.TARGET:R}.tmp
-   uuencode ${.TARGET:R}.tmp ${.TARGET:R} > ${.TARGET}
+   /usr/bin/uuencode ${.TARGET:R}.tmp ${.TARGET:R} > ${.TARGET}
rm -f ${.TARGET:R}.tmp
 
 ${SCRMAPS_MK}: ${.TARGET:R} mkscrfil.c
 

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 

...but where the /usr/bin-PATH disappeared?

-- Damian


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: kernel panic on 8.1-rc1 amd64

2010-06-18 Thread Sean C. Farley


On Fri, 18 Jun 2010, Nenhum_de_Nos wrote:


On Thu, June 17, 2010 23:49, Alexandre \"Sunny\" Kovalenko wrote:
Not much -- this is just panic I had in the same timeframe and solved 
it by disabling and, consequently rebuilding, Virtual Box kernel 
modules. I assume you either not loading or did disable the remaining 
two: vboxnetflt.ko and vboxnetadp.ko.


I guess any third-party modules (webcam, qemu, etc.) would be good 
candidates for being disabled -- it seams like there were changes in 
the kernel which required rebuild of them.


Outside of that -- sorry, out of ideas.


thanks for the tip, for the record it was a module (or two), as I 
tried disabling both nvidia and vboxdrv.


I'll try to update both now,


For the record, it has been reported on emulation@ that the nvidia and 
vbox drivers do not get along with each other.  I occasionally get 
lockups while there is network traffic between my host and 
(console-only) guest both running FreeBSD 8 amd64.


Unfortunately, I do not know which module is at fault, and I have been 
too busy to try harder to find the culprit.


Sean
--
s...@freebsd.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Fwd: Sysinstall crashes in 8.1-BETA1

2010-06-18 Thread Nicholas Mills

Verified to exist in 8.1-RC1. Submitted as PR 147980.

-- Forwarded message --
From: Nicholas Mills 
Date: Sat, Jun 5, 2010 at 9:01 PM
Subject: Sysinstall crashes in 8.1-BETA1
To: ra...@freebsd.org

Hello,

I found and patched a bug in sysinstall(8). My system is running GENERIC
8.1-BETA1 on i386. Initial install went well, but after the system booted I
wanted to add some other distributions. However, every time I attempted to
connect to an ftp server sysinstall would crash and dump core. I traced the
problem down to some uninitialized variables in tcpip.c:tcpDeviceSelect.
Attached is my patch.

Thanks,

Nick Mills
nlmi...@g.clemson.edu

tcpip.patch
Description: Binary data
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: FreeBSD 8.1-RC1 Available...

2010-06-18 Thread Doug Barton


On 06/18/10 10:01, Chip Camden wrote:

On Jun 17 2010 23:06, Ken Smith wrote:



[snip]


If you are updating an already running machine the CVS branch
tag is RELENG_8_1, or if you prefer SVN use "releng/8.1".


[snap]

Forgive me if the answer to this question should be obvious.  I'm on
stable/8, so if I understand correctly all I should need to do is change
my supfile CVS tag to RELENG_8_1, csup, and rebuild?


If you're already on -stable you don't have to do anything, unless you 
want to stay at what will eventually become 8.1-RELEASE.



hth,

Doug

--

... and that's just a little bit of history repeating.
-- Propellerheads

Improve the effectiveness of your Internet presence with
a domain name makeover!http://SupersetSolutions.com/

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: 7.2-RELEASE-p4, IO errors & RAID1 failure

2010-06-18 Thread Matthew Lear

On Fri, 2010-06-18 at 10:42 -0700, Jeremy Chadwick wrote:
> On Fri, Jun 18, 2010 at 04:47:11PM +0100, Matthew Lear wrote:
> > Hello Jeremy,
> > Thanks very much for the feedback.
> > 
> > [snip]
> > > Could you please provide the full output from "smartctl -a /dev/ad0"
> > > here?  Your drive may be completely fine and you may not have to swap it
> > > at all; hard to say.
> > 
> > Sure. See below:
> > {snip}
> 
> Your SMART statistics look completely OK.  There's nothing there that
> indicates there were any write failures or otherwise.  I'll explain near
> the end of the Email how to test a range of LBAs "just in case".

Good. That's what I thought too :-)

> I'll take a moment to point out that the error previously seen was a
> timeout during a write transaction (WRITE_DMA48).  Recap:
> 
> > > > ad0: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=395032335
> > > > ad0: FAILURE - WRITE_DMA48 status=51 
> > > > error=10 LBA=395032335
> > > > ar0: WARNING - mirror protection lost. RAID1 array in DEGRADED mode
> 
> The status codes shown (status=51 and error=10) are hexadecimal.  I'm
> pointing this out because they aren't preceded by '0x' or '$' and it
> clarifies my next point:
> 
> NID_NOT_FOUND (bit 4 set in the ATA error field) is referred to as IDNF
> per ATA6-ACS specification and onward, so I'll refer to it as that.
> (I've always wondered why FreeBSD calls this NID_NOT_FOUND; IDFN stands
> for ID Not Found, so what's with the extra "N"?  I've always felt this
> is a typo...)
> 
> Using the ATA8-ACS specification working draft (2007/05/21), since it's
> more recent, we see the following:
> 
>   Section 6.2 - Error field
>   Section 6.2.4 - ID Not Found (IDNF) bit
> 
>   Error bit 4. The IDNF bit shall be set to one if a user-accessible
>   address was not found. The IDNF bit shall be set to one if an
>   address outside of the range of user-accessible addresses is
>   requested when command aborted is not returned (see 4.11.3 and
>   6.2.1).
> 
>   Section 4.11 - Host Protected Area (HPA) feature set
>   Section 4.11.3 - 28-bit and 48-bit HPA commands
> 
>   Any read or write command to an address above the maximum address
>   specified by the SET MAX ADDRESS or SET MAX ADDRESS EXT command shall
>   cause command completion with the IDNF bit set to one and ERR set to
>   one, or command aborted.
> 
> There's no definition of what "address" means in 6.2.4, but the most
> logical (pun intended) guess is an LBA.  This error is returned by the
> disk (e.g. not a controller-induced error).  I've mentioned this problem
> in the past:
> 
> http://wiki.freebsd.org/JeremyChadwick/ATA_issues_and_troubleshooting
> 
> I've always read IDNF to mean "OS requested access (read or write) to an
> LBA which is out of bounds", where "out of bounds" means "not between 0
> and ".  How exactly is that possible?  Alexander, do you have
> any familiarity with this error code per ATA spec?
> 
> Matthew, can you provide output from "atacontrol cap ad0"?  Thanks.

Sure thing. See below.
[r...@meshuga /home/matt]# atacontrol cap ad0

Protocol  SATA revision 2.x
device model  WDC WD3200AAKS-00VYA0
serial number WD-WCARW0164427
firmware revision 12.01B02
cylinders 16383
heads 16
sectors/track 63
lba supported 268435455 sectors
lba48 supported   625142448 sectors
dma supported
overlap not supported

Feature  Support  EnableValue   Vendor
write cacheyes  yes
read ahead yes  yes
Native Command Queuing (NCQ)   yes   -  31/0x1F
Tagged Command Queuing (TCQ)   no   no  31/0x1F
SMART  yes  yes
microcode download yes  yes
security   yes  no
power management   yes  yes
advanced power management  no   no  0/0x00
automatic acoustic management  yes  no  254/0xFE128/0x80
[r...@meshuga /home/matt]# 


> 
> Now regarding the LBA tests -- "smartctl -t select,start-end" will do
> the trick.  start should be a starting LBA, end should be an ending LBA.
> The OS claims that LBA 395032335 is what was requested to be accessed
> when the failure happened, so I would recommend picking start/end ranges
> around that area.  Remember that a single sector encapsulates a very
> large number of blocks (especially given sizes of disks today), so it's
> wise to pick a very large range of LBAs.  I would recommend this in your
> case:
> 
> smartctl -t select,39000,41000 /dev/ad0

[r...@meshuga /home/matt]# smartctl -t
select,39000-41000 /dev/ad0
smartctl 5.39.1 2010-01-28 r3054 [FreeBSD 7.2-RELEASE-p4 i386] (local
build)
Copyright (C) 2002-10 by Bruce Allen,
http://smartmontools.sourceforge.net

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Selective self-test routine immediately
in off-line mode".
SPAN STARTIN

Re: kernel panic on 8.1-rc1 amd64

2010-06-18 Thread Nenhum_de_Nos


On Thu, June 17, 2010 23:49, Alexandre \"Sunny\" Kovalenko wrote:
> Not much -- this is just panic I had in the same timeframe and solved it
> by disabling and, consequently rebuilding, Virtual Box kernel modules. I
> assume you either not loading or did disable the remaining two:
> vboxnetflt.ko and vboxnetadp.ko.
>
> I guess any third-party modules (webcam, qemu, etc.) would be good
> candidates for being disabled -- it seams like there were changes in the
> kernel which required rebuild of them.
>
> Outside of that -- sorry, out of ideas.

thanks for the tip, for the record it was a module (or two), as I tried
disabling both nvidia and vboxdrv.

I'll try to update both now,

thanks,

matheus

-- 
We will call you cygnus,
The God of balance you shall be

A: Because it messes up the order in which people normally read text.
Q: Why is top-posting such a bad thing?

http://en.wikipedia.org/wiki/Posting_style
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: FreeBSD 8.1-RC1 Available...

2010-06-18 Thread Chip Camden

On Jun 17 2010 23:06, Ken Smith wrote:
> 
[snip]
> 
> If you are updating an already running machine the CVS branch
> tag is RELENG_8_1, or if you prefer SVN use "releng/8.1".
> 
[snap]

Forgive me if the answer to this question should be obvious.  I'm on
stable/8, so if I understand correctly all I should need to do is change
my supfile CVS tag to RELENG_8_1, csup, and rebuild?

-- 
Sterling (Chip) Camden
http://camdensoftware.com | http://chipstips.com | http://chipsquips.com
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: [Stable 7] CPIO breakage/

2010-06-18 Thread Xin LI

-BEGIN PGP SIGNED MESSAGE-
Hash: SHA256

On 2010/06/18 10:51, Sean Bruno wrote:
> On Thu, 2010-06-17 at 15:13 -0700, Xin LI wrote:
>> -BEGIN PGP SIGNED MESSAGE-
>> Hash: SHA256
>>
>> On 2010/06/17 13:53, Peter Jeremy wrote:
>>> On 2010-Jun-15 17:22:50 -0700, Xin LI  wrote:
 On 2010/06/15 17:05, Sean Bruno wrote:
> A little more background.  It looks like symlinks are getting stripped
> of their '/' which sucks.  Ideas?
>>> ...
> e.g. /home/foo/bar -> /opt/baz/blob
>
> becomes
>
> home/foo/bar -> opt/baz/blob   
>
> Yuck.

 This is a security measurement I think.
>>>
>>> Can someone please explain how stripping a leading '/' off the
>>> destination of a symlink enhances security?  The destination is
>>> not being written to.
>>>
 --absolute-filenames disables this behavior.
>>>
>>> This definitely reduces security and would seem to be far more
>>> dangerous than being able to create symlinks to absolute pathnames.
>>
>> Sorry I have misunderstood the original issue.  It's the link target
>> being mangled and doesn't seem right to me.  I'll ask the author about this.
>>
>> The attached patch should restore the old behavior.
>>
>> Cheers,
>> - -- 
>> Xin LI  http://www.delphij.net/
>> FreeBSD - The Power to Serve!   Live free or die
> 
> Yep, *this* patches seems to make things much happier.  I'll integrate
> cpio 2.8 back into the Yahoo tree when this is merged in.  

Thanks for testing, I have committed the patch as r209311 and sorry for
the breakage.

Cheers,
- -- 
Xin LI http://www.delphij.net/
FreeBSD - The Power to Serve!  Live free or die
-BEGIN PGP SIGNATURE-
Version: GnuPG v2.0.14 (FreeBSD)

iQEcBAEBCAAGBQJMG7lxAAoJEATO+BI/yjfBBhEIAJRbR1ZniY+JQ4Ak+He+FWKw
jRXb/lTC1PfCDIi5Vm+j0NGAZP2hNBzt9k7uouDyguXcHKvPNXKFhjvaJtdDK40Y
e2Pr2PNeXzwBGaL27eDPdjt2gxZ16GbzQe47d2jyT3nQRUYBGehJcLzJl7chrLZn
0PJmztmZt8Uc6oeQo427PzhgqcCFG5Edrc7dtiFZ1rvdaXGXd64mu30oArLV3MCA
c//wgx+qK2wQ1AGeylZGOqbJmtgdxY3+g961a/G9c0Y/Bt+muWBY5xDK1LpA7IcN
/s8l39g6TMzjp4YYlRUG5flhv3xtFACZxxAnkDYA+02cR2euNpt1irjdxj7jwvI=
=V3yO
-END PGP SIGNATURE-
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: em(4) duplex problems with 82541EI on RELENG_8, -CURRENT on PowerEdge 1850

2010-06-18 Thread Jack Vogel

Yes, the commits today are slated to get into 8.1, at least that's my
understanding.

Jack


On Fri, Jun 18, 2010 at 10:46 AM, Brandon Gooch  wrote:

> On Tue, Jun 1, 2010 at 2:37 PM, Jeremy Chadwick
>  wrote:
> > On Tue, Jun 01, 2010 at 03:18:39PM -0400, Brian A. Seklecki wrote:
> >> = Re-posted from freebsd-hardware@, since this is more of a bug
> >>  report than a hardware comparability inquiry / buying strategy
> >>  discussion. ==
> >>
> >> All:
> >>
> >>   Has anyone upgraded their PowerEdge 1850s to 8.0-PL or
> >>   RELENG_8 -stable?  We're seeing problems where 7.2-PL and
> >>   6.3-PL were not affected on the same hardware.
> >>
> >>   The problem is that forcing the duplex 100/full on both
> >>   sides no longer functions.
> >>
> >>   Configuration:
> >>
> >>- A variety of Cisco L2/L3 switches over the last decade:
> >>-- 2848G-L3
> >>-- 2950
> >>-- 2960s
> >>-- 3550-12Ts
> >>-- 3550XLs
> >>-- Duplex forced 100/full on Cisco side
> >>- FreeBSD/amd64 RELENG_8 or 9-CURRENT with duplex
> >>  forced '100baseTX mediaopt full-duplex',
> >>- This configuration has worked since FreeBSD 5.4
> >>
> >>   When connected to PowerEdge 1850r1/r2, with the onboard Intel
> >>   82541EI, the parenthesis show an actual media speed/duplex of:
> >>
> >>   media: Ethernet 100baseTX   (100baseTX )
> >>
> >>   The same configuration using a Dell-sold Intel dual port
> >>   82546EB, in the same system, on the same switch, works fine.
> >>
> >>
> >> -
> >> ifconfig(8):
> >> -
> >> em3: flags=8843 >>  MULTICAST> metric 0 mtu 1500
> >> options=9b
> >> ether 00:13:72:4f:70:81
> >> inet 192.168.97.20 netmask 0xff80 broadcast 192.168.97.127
> >> media: Ethernet 100baseTX  (100baseTX )
> >> status: active
> >> -
> >> em0: flags=8843 >>  MULTICAST> metric 0 mtu 1500>
> >> options=9b
> >> ether 00:04:23:c8:fe:ac
> >> media: Ethernet 100baseTX 
> >> status: active
> >> -
> >> -
> >> pciconf(8):
> >> -
> >> e...@pci0:7:8:0:   class=0x02 card=0x016d1028 chip=0x10768086
> >> rev=0x05 hdr=0x00
> >> vendor = 'Intel Corporation'
> >> device = 'Gigabit Ethernet Controller (82541EI)'
> >> class  = network
> >> subclass   = ethernet
> >> e...@pci0:3:11:0:  class=0x02 card=0x10128086 chip=0x10108086
> rev=0x01
> >> hdr=0x00
> >>vendor = 'Intel Corporation'
> >>device  = 'Dual Port Gigabit Ethernet Controller (Copper) (82546EB)'
> >>class  = network
> >>subclass   = ethernet
> >>
> >> -
> >>
> >> rc.conf(5) for shits & giggles:
> >>
> >> ifconfig_em0="inet X netmask Y media 100baseTX mediaopt full-duplex"
> >> ifconfig_em3="inet Z netmask F media 100baseTX mediaopt full-duplex"
> >>
> >> 
> >>
> >> Example IOS switch config:
> >>  interface FastEthernet0/39
> >>   description I hate Dell
> >>   switchport access vlan 100
> >>   switchport mode access
> >>   speed 100
> >>   duplex full
> >>   spanning-tree portfast
> >>  end
> >> 
> >>
> >> I've been clearing interface counters on the switch side, but I'll send
> >> 'netstat -i', 'show interface counters', and 'sudo sysctl -w
> >> dev.em.3.stats=1' ASAP to illustrate connectivity errors soon.
> >>
> >> Are we being punished for patronizing Dell?
> >>
> >> Is it possible that ifconfig(8) output has simply changed?  Are the
> >> values in the parenthesis on the right the Ethernet auto-sense desired
> >> values where as outside the parenthesis the current active values?
> >>
> >> In 6.3/7.2, once you forced a speed/duplex, the values in parenthesis
> >> went away entirely.
> >>
> >> The only way I've been able to make that happen is to #define in
> >> src/sys/dev/e1000/if_em.h:
> >>
> >>   #define DO_AUTO_NEG 0
> >>   /*
> >>* This parameter control whether or not the driver will wait for
> >>* autonegotiation to complete.
> >>* 1 - Wait for autonegotiation to complete
> >>* 0 - Don't wait for autonegotiation to complete
> >>   */
> >>
> >> Also seems odd that some ICs are affected but not others.
> >>
> >> Its also possible that my problems are pf(4) + setfib(8) related and I
> >> that this is a separate issue.
> >>
> >> Two new notes since the original post:
> >>
> >>  - I have confirmed this problem on two revisions of the Dell
> >>8th gen hardware in two different datacenters
> >>  - The problem persists on -CURRENT from 05/2010
> >>  - RELENG_7 does not seem to be impacted
> >>  - More stats below.
> >>
> >>
> >> Thanks,
> >> ~BAS
> >>
> >> ---
> >>
> >>
> >>
> >> em1: link state changed to DOWN
> >> em1: link state changed to UP
> >> em1: link state changed to DOWN
> >> em1: link state changed to UP
> >> em1: link state changed to DOWN
> >> em1: link state changed to UP
> >> em1: link state changed to DOWN
> >> em1: link state changed to UP
> >> em1: link state changed t

Re: [Stable 7] CPIO breakage/

2010-06-18 Thread Sean Bruno

On Thu, 2010-06-17 at 15:13 -0700, Xin LI wrote:
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA256
> 
> On 2010/06/17 13:53, Peter Jeremy wrote:
> > On 2010-Jun-15 17:22:50 -0700, Xin LI  wrote:
> >> On 2010/06/15 17:05, Sean Bruno wrote:
> >>> A little more background.  It looks like symlinks are getting stripped
> >>> of their '/' which sucks.  Ideas?
> > ...
> >>> e.g. /home/foo/bar -> /opt/baz/blob
> >>>
> >>> becomes
> >>>
> >>> home/foo/bar -> opt/baz/blob   
> >>>
> >>> Yuck.
> >>
> >> This is a security measurement I think.
> > 
> > Can someone please explain how stripping a leading '/' off the
> > destination of a symlink enhances security?  The destination is
> > not being written to.
> > 
> >> --absolute-filenames disables this behavior.
> > 
> > This definitely reduces security and would seem to be far more
> > dangerous than being able to create symlinks to absolute pathnames.
> 
> Sorry I have misunderstood the original issue.  It's the link target
> being mangled and doesn't seem right to me.  I'll ask the author about this.
> 
> The attached patch should restore the old behavior.
> 
> Cheers,
> - -- 
> Xin LI   http://www.delphij.net/
> FreeBSD - The Power to Serve!Live free or die



Yep, *this* patches seems to make things much happier.  I'll integrate
cpio 2.8 back into the Yahoo tree when this is merged in.  

Thanks for your patience and work on -stable.

Sean

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: em(4) duplex problems with 82541EI on RELENG_8, -CURRENT on PowerEdge 1850

2010-06-18 Thread Brandon Gooch

On Tue, Jun 1, 2010 at 2:37 PM, Jeremy Chadwick
 wrote:
> On Tue, Jun 01, 2010 at 03:18:39PM -0400, Brian A. Seklecki wrote:
>> = Re-posted from freebsd-hardware@, since this is more of a bug
>>  report than a hardware comparability inquiry / buying strategy
>>  discussion. ==
>>
>> All:
>>
>>   Has anyone upgraded their PowerEdge 1850s to 8.0-PL or
>>   RELENG_8 -stable?  We're seeing problems where 7.2-PL and
>>   6.3-PL were not affected on the same hardware.
>>
>>   The problem is that forcing the duplex 100/full on both
>>   sides no longer functions.
>>
>>   Configuration:
>>
>>    - A variety of Cisco L2/L3 switches over the last decade:
>>    -- 2848G-L3
>>    -- 2950
>>    -- 2960s
>>    -- 3550-12Ts
>>    -- 3550XLs
>>    -- Duplex forced 100/full on Cisco side
>>    - FreeBSD/amd64 RELENG_8 or 9-CURRENT with duplex
>>      forced '100baseTX mediaopt full-duplex',
>>    - This configuration has worked since FreeBSD 5.4
>>
>>   When connected to PowerEdge 1850r1/r2, with the onboard Intel
>>   82541EI, the parenthesis show an actual media speed/duplex of:
>>
>>   media: Ethernet 100baseTX   (100baseTX )
>>
>>   The same configuration using a Dell-sold Intel dual port
>>   82546EB, in the same system, on the same switch, works fine.
>>
>>
>> -
>> ifconfig(8):
>> -
>> em3: flags=8843>                  MULTICAST> metric 0 mtu 1500
>> options=9b
>> ether 00:13:72:4f:70:81
>> inet 192.168.97.20 netmask 0xff80 broadcast 192.168.97.127
>> media: Ethernet 100baseTX  (100baseTX )
>> status: active
>> -
>> em0: flags=8843>                  MULTICAST> metric 0 mtu 1500>
>> options=9b
>> ether 00:04:23:c8:fe:ac
>> media: Ethernet 100baseTX 
>> status: active
>> -
>> -
>> pciconf(8):
>> -
>> e...@pci0:7:8:0:       class=0x02 card=0x016d1028 chip=0x10768086
>>                 rev=0x05 hdr=0x00
>>     vendor     = 'Intel Corporation'
>>     device     = 'Gigabit Ethernet Controller (82541EI)'
>>     class      = network
>>     subclass   = ethernet
>> e...@pci0:3:11:0:      class=0x02 card=0x10128086 chip=0x10108086 
>> rev=0x01
>> hdr=0x00
>>    vendor     = 'Intel Corporation'
>>    device  = 'Dual Port Gigabit Ethernet Controller (Copper) (82546EB)'
>>    class      = network
>>    subclass   = ethernet
>>
>> -
>>
>> rc.conf(5) for shits & giggles:
>>
>> ifconfig_em0="inet X netmask Y media 100baseTX mediaopt full-duplex"
>> ifconfig_em3="inet Z netmask F media 100baseTX mediaopt full-duplex"
>>
>> 
>>
>> Example IOS switch config:
>>  interface FastEthernet0/39
>>   description I hate Dell
>>   switchport access vlan 100
>>   switchport mode access
>>   speed 100
>>   duplex full
>>   spanning-tree portfast
>>  end
>> 
>>
>> I've been clearing interface counters on the switch side, but I'll send
>> 'netstat -i', 'show interface counters', and 'sudo sysctl -w
>> dev.em.3.stats=1' ASAP to illustrate connectivity errors soon.
>>
>> Are we being punished for patronizing Dell?
>>
>> Is it possible that ifconfig(8) output has simply changed?  Are the
>> values in the parenthesis on the right the Ethernet auto-sense desired
>> values where as outside the parenthesis the current active values?
>>
>> In 6.3/7.2, once you forced a speed/duplex, the values in parenthesis
>> went away entirely.
>>
>> The only way I've been able to make that happen is to #define in
>> src/sys/dev/e1000/if_em.h:
>>
>>   #define DO_AUTO_NEG 0
>>   /*
>>    * This parameter control whether or not the driver will wait for
>>    * autonegotiation to complete.
>>    * 1 - Wait for autonegotiation to complete
>>    * 0 - Don't wait for autonegotiation to complete
>>   */
>>
>> Also seems odd that some ICs are affected but not others.
>>
>> Its also possible that my problems are pf(4) + setfib(8) related and I
>> that this is a separate issue.
>>
>> Two new notes since the original post:
>>
>>  - I have confirmed this problem on two revisions of the Dell
>>    8th gen hardware in two different datacenters
>>  - The problem persists on -CURRENT from 05/2010
>>  - RELENG_7 does not seem to be impacted
>>  - More stats below.
>>
>>
>> Thanks,
>>     ~BAS
>>
>> ---
>>
>>
>>
>> em1: link state changed to DOWN
>> em1: link state changed to UP
>> em1: link state changed to DOWN
>> em1: link state changed to UP
>> em1: link state changed to DOWN
>> em1: link state changed to UP
>> em1: link state changed to DOWN
>> em1: link state changed to UP
>> em1: link state changed to DOWN
>> em1: link state changed to UP
>> em1: link state changed to DOWN
>>
>> em0: Excessive collisions = 0
>> em0: Sequence errors = 0
>> em0: Defer count = 0
>> em0: Missed Packets = 0
>> em0: Receive No Buffers = 0
>> em0: Receive Length Errors = 0
>> em0: Receive errors = 0
>> em0: Crc errors = 0
>> em0: Alignment errors = 0
>> em0: Collision/Carrier extension errors = 0
>> em0: RX overruns = 0
>> em0: watchdog timeouts = 0
>> em

Re: 7.2-RELEASE-p4, IO errors & RAID1 failure

2010-06-18 Thread Jeremy Chadwick

On Fri, Jun 18, 2010 at 04:47:11PM +0100, Matthew Lear wrote:
> Hello Jeremy,
> Thanks very much for the feedback.
> 
> [snip]
> > Could you please provide the full output from "smartctl -a /dev/ad0"
> > here?  Your drive may be completely fine and you may not have to swap it
> > at all; hard to say.
> 
> Sure. See below:
> {snip}

Your SMART statistics look completely OK.  There's nothing there that
indicates there were any write failures or otherwise.  I'll explain near
the end of the Email how to test a range of LBAs "just in case".

I'll take a moment to point out that the error previously seen was a
timeout during a write transaction (WRITE_DMA48).  Recap:

> > > ad0: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=395032335
> > > ad0: FAILURE - WRITE_DMA48 status=51 
> > > error=10 LBA=395032335
> > > ar0: WARNING - mirror protection lost. RAID1 array in DEGRADED mode

The status codes shown (status=51 and error=10) are hexadecimal.  I'm
pointing this out because they aren't preceded by '0x' or '$' and it
clarifies my next point:

NID_NOT_FOUND (bit 4 set in the ATA error field) is referred to as IDNF
per ATA6-ACS specification and onward, so I'll refer to it as that.
(I've always wondered why FreeBSD calls this NID_NOT_FOUND; IDFN stands
for ID Not Found, so what's with the extra "N"?  I've always felt this
is a typo...)

Using the ATA8-ACS specification working draft (2007/05/21), since it's
more recent, we see the following:

  Section 6.2 - Error field
  Section 6.2.4 - ID Not Found (IDNF) bit

  Error bit 4. The IDNF bit shall be set to one if a user-accessible
  address was not found. The IDNF bit shall be set to one if an
  address outside of the range of user-accessible addresses is
  requested when command aborted is not returned (see 4.11.3 and
  6.2.1).

  Section 4.11 - Host Protected Area (HPA) feature set
  Section 4.11.3 - 28-bit and 48-bit HPA commands

  Any read or write command to an address above the maximum address
  specified by the SET MAX ADDRESS or SET MAX ADDRESS EXT command shall
  cause command completion with the IDNF bit set to one and ERR set to
  one, or command aborted.

There's no definition of what "address" means in 6.2.4, but the most
logical (pun intended) guess is an LBA.  This error is returned by the
disk (e.g. not a controller-induced error).  I've mentioned this problem
in the past:

http://wiki.freebsd.org/JeremyChadwick/ATA_issues_and_troubleshooting

I've always read IDNF to mean "OS requested access (read or write) to an
LBA which is out of bounds", where "out of bounds" means "not between 0
and ".  How exactly is that possible?  Alexander, do you have
any familiarity with this error code per ATA spec?

Matthew, can you provide output from "atacontrol cap ad0"?  Thanks.

Now regarding the LBA tests -- "smartctl -t select,start-end" will do
the trick.  start should be a starting LBA, end should be an ending LBA.
The OS claims that LBA 395032335 is what was requested to be accessed
when the failure happened, so I would recommend picking start/end ranges
around that area.  Remember that a single sector encapsulates a very
large number of blocks (especially given sizes of disks today), so it's
wise to pick a very large range of LBAs.  I would recommend this in your
case:

smartctl -t select,39000,41000 /dev/ad0

I would highly recommend doing this with the disk not doing any I/O,
though it won't hurt it (it'll just delay the scan).  "smartctl -a" will
show the state of things in the "SMART Selective self-test log" at the
bottom, or somewhere else within the output (depends on the drive).

This should, in my opinion, rule out whether or not there's a bad block
or something along those lines within said range.  Given what I believe
IDNF represents, I would say your scan will probably come back clean.
Also remember that the scan performed here is a *disk-level scan*; the
disk firmware itself is doing it (the OS isn't involved).  This helps
rule out any sort of "weird" issues that the OS may be reporting ("hey
man, LBA 8943943983492893428932489324 is bad!"  "Yeah sure it is").

> The two devices in the array are on channels 0 and 1. There is indeed a
> second drive on channel 0 (160G). As I said above, I use that as an
> additional back up device but it's not part of the array.

Okay, so executing "atacontrol detach ata0" will cause you to lose both
ad0 and ad1.  If you can live with that, then cool.

> > What motherboard is this?  Can you change the setting to either
> > "Native", "Enhanced", or (even better) "AHCI"?  I've seen some systems
> > where the Serial ATA option in the BIOS has an "Auto" option, which does
> > totally bizarre things at times.
> 
> I think this has been covered in subsequent postings. I could try it but
> as you say below, I'd like to resolve the disk issue first.
> ...
> > The atacontrol man page covers your situation:
> > ...
> I don't think this is the case for me since ad0 and ad2 are on seperate
> ata channels.
> ...
>

Re: qbittorrent 2.2.9 8.0-STABLE Amd64

2010-06-18 Thread Andriy Gapon

on 18/06/2010 18:51 Жиндарев Алексей said the following:
> Jun 18 19:33:54  last message repeated 371 times
> Jun 18 19:41:31  last message repeated 1359 times
> Jun 18 19:43:29  kernel: WARNING pid 31369 (qbittorrent): ioctl 
> sign-extension ioctl 8004667e
> Jun 18 19:44:00  last message repeated 545 times
> Jun 18 19:45:45  last message repeated 1751 times
> Jun 18 19:45:46  kernel: WARNING pid 31369 (qbittorrent): ioctl 
> sign-extension ioctl 8004667e
> Jun 18 19:46:17  last message repeated 481 times
> 
> Manifested after the new port, possibly after updating QT

This is FIONBIO ioctl.  Look through the code where this is passed via a 
variable
of incorrect type.  Correct type for ioctl request should be unsigned long.

-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: 7.2-RELEASE-p4, IO errors & RAID1 failure

2010-06-18 Thread Matthew Lear

Hello Jeremy,
Thanks very much for the feedback.

[snip]
> Could you please provide the full output from "smartctl -a /dev/ad0"
> here?  Your drive may be completely fine and you may not have to swap it
> at all; hard to say.

Sure. See below:

smartctl 5.39.1 2010-01-28 r3054 [FreeBSD 7.2-RELEASE-p4 i386] (local
build)
Copyright (C) 2002-10 by Bruce Allen,
http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family: Western Digital Caviar Blue Serial ATA family
Device Model: WDC WD3200AAKS-00VYA0
Serial Number:WD-WCARW0164427
Firmware Version: 12.01B02
User Capacity:320,072,933,376 bytes
Device is:In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:Fri Jun 18 16:27:54 2010 BST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status:  (   0) The previous self-test routine
completed
without error or no self-test has ever 
been run.
Total time to complete Offline 
data collection: (8400) seconds.
Offline data collection
capabilities:(0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off 
support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities:(0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability:(0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine 
recommended polling time:(   2) minutes.
Extended self-test routine
recommended polling time:( 100) minutes.
Conveyance self-test routine
recommended polling time:(   5) minutes.
SCT capabilities:  (0x303f) SCT Status supported.
SCT Feature Control supported.
SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE
UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate 0x000f   200   200   051Pre-fail  Always
-   0
  3 Spin_Up_Time0x0003   218   150   021Pre-fail  Always
-   2100
  4 Start_Stop_Count0x0032   100   100   000Old_age   Always
-   118
  5 Reallocated_Sector_Ct   0x0033   200   200   140Pre-fail  Always
-   0
  7 Seek_Error_Rate 0x000e   200   200   051Old_age   Always
-   0
  9 Power_On_Hours  0x0032   088   088   000Old_age   Always
-   9316
 10 Spin_Retry_Count0x0012   100   100   051Old_age   Always
-   0
 11 Calibration_Retry_Count 0x0012   100   100   051Old_age   Always
-   0
 12 Power_Cycle_Count   0x0032   100   100   000Old_age   Always
-   116
192 Power-Off_Retract_Count 0x0032   200   200   000Old_age   Always
-   115
193 Load_Cycle_Count0x0032   200   200   000Old_age   Always
-   118
194 Temperature_Celsius 0x0022   109   103   000Old_age   Always
-   38
196 Reallocated_Event_Count 0x0032   200   200   000Old_age   Always
-   0
197 Current_Pending_Sector  0x0012   200   200   000Old_age   Always
-   0
198 Offline_Uncorrectable   0x0010   200   200   000Old_age
Offline  -   0
199 UDMA_CRC_Error_Count0x003e   200   200   000Old_age   Always
-   0
200 Multi_Zone_Error_Rate   0x0008   200   200   051Old_age
Offline  -   0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_DescriptionStatus  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offlineCompleted without error   00%  9299
-
# 2  Short offline   Completed without error   00%  9298
-

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
1

qbittorrent 2.2.9 8.0-STABLE Amd64

2010-06-18 Thread Жиндарев Алексей

Jun 18 19:33:54  last message repeated 371 times
Jun 18 19:41:31  last message repeated 1359 times
Jun 18 19:43:29  kernel: WARNING pid 31369 (qbittorrent): ioctl sign-extension 
ioctl 8004667e
Jun 18 19:44:00  last message repeated 545 times
Jun 18 19:45:45  last message repeated 1751 times
Jun 18 19:45:46  kernel: WARNING pid 31369 (qbittorrent): ioctl sign-extension 
ioctl 8004667e
Jun 18 19:46:17  last message repeated 481 times

Manifested after the new port, possibly after updating QT

-- 
С Уважением,Алексей Жиндарев.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: [HEADSUP]: Ports feature freeze for 8.1 now in effect

2010-06-18 Thread Ion-Mihai Tetcu

On Fri, 18 Jun 2010 14:10:28 +0200
Erwin Lansing  wrote:

> In preparation for 8.1-RELEASE, the ports tree is now in feature
> freeze.
> 
> Normal upgrade, new ports, and changes that only affect other branches
> are allowed without prior approval but with the extra Feature safe:
> yes tag in the commit message. Any commit that is sweeping, i.e.
> touches a large number of ports, infrastructural changes, commits to
> ports with unusually high number of dependent ports, and any other
> commit that requires the rebuilding of many packages is not allowed
> without prior explicit approval from portmgr after that date.
> 
> When in doubt, please do not hesitate to contact portmgr.


 "any commit that requires the rebuilding of many packages"

And this time we will ask for instant back-out of everything that
should had not been committed in the first place.


If you have time, you can always help with unmaintained ports:
http://qat.tecnik93.com/index.php?action=failed_buildports&maintainer=ports%40freebsd.org&;
or even maintained ones:
http://qat.tecnik93.com/index.php?action=failed_buildports


Help us getting a good, stable package set for the release please,

-- 
IOnut - Un^d^dregistered ;) FreeBSD "user"
  "Intellectual Property" is   nowhere near as valuable   as "Intellect"
FreeBSD committer -> ite...@freebsd.org, PGP Key ID 057E9F8B493A297B


signature.asc
Description: PGP signature

Re: 7.2-RELEASE-p4, IO errors & RAID1 failure

2010-06-18 Thread Alexander Motin

Jeremy Chadwick wrote:
> On Fri, Jun 18, 2010 at 01:36:53PM +0200, Miroslav Lachman wrote:
>> Jeremy Chadwick wrote:
>>> On Fri, Jun 18, 2010 at 08:08:24AM +0100, Matthew Lear wrote:
>> [...]
>>
 The drives in the RAID exist on two seperate ATA channels:
 [r...@meshuga /home/matt]# atacontrol list
 ATA channel 0:
 Master:  ad0  SATA revision 2.x
 Slave:   ad1  SATA revision 1.x
 ATA channel 1:
 Master:  ad2  SATA revision 2.x
 Slave:   no device present
 ATA channel 2:
 Master: acd0  SATA revision 1.x
 Slave:   no device present
 ATA channel 3:
 Master:  no device present
 Slave:   no device present

 ad1 is a third 160G drive that I periodically back up to using cron.
>>> So your RAID-1 array consists of ad0 and ad2?  You didn't provide
>>> "atacontrol status" output so I'm going to assume that's the case.
>>>
>>> What's odd to me is that you somehow have two disks on a single ATA
>>> channel -- look closely at channel 0.  SATA has a 1:1 device-to-channel
>>> mapping, so I'm a little surprised to see there's two devices on channel
>>> 0.  To me, this indicates your system BIOS is configured to run in
>>> "Emulation" mode -- where the ATA controller pretends to be a PATA/IDE
>>> controller, thus SATA-0 and SATA-1 devices appear as primary master and
>>> primary slave, respectively.
>>>
>>> What motherboard is this?  Can you change the setting to either
>>> "Native", "Enhanced", or (even better) "AHCI"?  I've seen some systems
>>> where the Serial ATA option in the BIOS has an "Auto" option, which does
>>> totally bizarre things at times.
>>>
>>> But before changing the setting, I would recommend dealing with the disk
>>> problem first.  Changing the SATA controller operation mode will almost
>>> certainly change all of your device names (you'll have to go into
>>> single-user mode, mount filesystems by hand, fix /etc/fstab, etc.).
>> [...]
>>
>> It is "normal" on HP G5 series. I have ProLiant ML 110 G5. I tried
>> all type of settings in BIOS, but all of them shows two disks on one
>> ATA channel:
>>
>> HP ProLiant ML 110 G5
>>
>> FreeBSD 7.2-RELEASE-p4 amd64 GENERIC
>>
>> r...@kiwi ~/# atacontrol list
>> ATA channel 0:
>> Master:  ad0  SATA revision 2.x
>> Slave:   ad1  SATA revision 2.x
>> ATA channel 1:
>> Master:  ad2  SATA revision 2.x
>> Slave:   ad3  SATA revision 2.x
>> ATA channel 2:
>> Master: acd0  SATA revision 1.x
>> Slave:   no device present
>> ATA channel 3:
>> Master:  no device present
>> Slave:   no device present
>>
>>
>>
>> atapci0:  port
>> 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x1c10-0x1c1f,0x1c00-0x1c0f at
>> device 31.2 on pci0
>> ata0:  on atapci0
>> ata0: [ITHREAD]
>> ata1:  on atapci0
>> ata1: [ITHREAD]
>> pci0:  at device 31.3 (no driver attached)
>> atapci1:  port 
>> 0x1c68-0x1c6f,0x1c5c-0x1c5f,0x1c60-0x1c67,0x1c58-0x1c5b,0x1c30-0x1c3f,0x1c20-0x1c2f
>> irq 18 at device 31.5 on pci0
>> atapci1: [ITHREAD]
>> ata2:  on atapci1
>> ata2: [ITHREAD]
>> ata3:  on atapci1
>> ata3: [ITHREAD]
>>
>>
>> pciconf -lv
>> atap...@pci0:0:31:2:class=0x01018a card=0x31f4103c
>> chip=0x29208086 rev=0x02 hdr=0x00
>> vendor = 'Intel Corporation'
>> device = '82801IB/IR/IH (ICH9 Family) 4 port Serial ATA
>> Storage Controller 1'
>> class  = mass storage
>> subclass   = ATA
>>
>> atap...@pci0:0:31:5:class=0x010185 card=0x31f4103c
>> chip=0x29268086 rev=0x02 hdr=0x00
>> vendor = 'Intel Corporation'
>> device = '82801IB/IR/IH (ICH9 Family) 2 port Serial ATA
>> Storage Controller 2'
>> class  = mass storage
>> subclass   = ATA
>>
>>
>>
>> ad0: 953869MB  at ata0-master SATA300
>> ad1: 953869MB  at ata0-slave SATA300
>> ad2: 953869MB  at ata1-master SATA300
>> ad3: 953869MB  at ata1-slave SATA300
>> da0 at umass-sim0 bus 0 target 0 lun 0
>> da0:  Removable Direct Access SCSI-2 device
>> da0: 40.000MB/s transfers
>> da0: 1928MB (3948544 512 byte sectors: 255H 63S/T 245C)
>> acd0: DVDR  at ata2-master SATA150
>>
>>
>> I am using this machine as storage for backups with ZFS RAIDZ
>> without any timeouts so I think that two disks on one channel is not
>> causing the timeouts (only little slowdown)
> 
> Wow, that's really... interesting.  :-)  What this indicates is that the
> controller is running in Native/Enhanced mode yet devices attached to
> SATA ports #0/#1 are master/slave on channel 0, and ports #2/#3 are
> master/slave on channel 1.

Except AHCI, all other modes are just variations of PATA emulation.
"subclass   = ATA" means that there is no AHCI enabled. PATA emulation
itself should not be a problem, but it is definitely not good from
performance and hot-swap points.

As already told, ata(4) has very strict timeout values. It may happen,
that due to medium errors drive needs too much time co complete I/O. It
is theoretically possible that SMART may complete the test due to higher
timeo

Re: /usr/src fails to compile in 8.0 after cvsup to RELENG_8

2010-06-18 Thread William D. Colburn (Schlake)

On Thu, Jun 17, 2010 at 7:09 AM, Matthew Seaman
 wrote:
> Uh -- you don't need to compile emacs again.  You can package up what
> you already have installed under 8.0: e.g.

I just reinstalled from the new CD without saving the package.  Emacs
is compiling, 8.1 is running, and I went ahead and ordered 8T more of
disk to add to the machine to celebrate my success.

-- 
-- Schlake
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

[HEADSUP]: Ports feature freeze for 8.1 now in effect

2010-06-18 Thread Erwin Lansing

In preparation for 8.1-RELEASE, the ports tree is now in feature freeze.

Normal upgrade, new ports, and changes that only affect other branches
are allowed without prior approval but with the extra Feature safe: yes
tag in the commit message. Any commit that is sweeping, i.e. touches a
large number of ports, infrastructural changes, commits to ports with
unusually high number of dependent ports, and any other commit that
requires the rebuilding of many packages is not allowed without prior
explicit approval from portmgr after that date.

When in doubt, please do not hesitate to contact portmgr.

-- 
Erwin Lansing   http://droso.org
Prediction is very difficult
especially about the futureer...@freebsd.org


pgpjUJDoWTfnI.pgp
Description: PGP signature

Re: 7.2-RELEASE-p4, IO errors & RAID1 failure

2010-06-18 Thread Jeremy Chadwick

On Fri, Jun 18, 2010 at 01:36:53PM +0200, Miroslav Lachman wrote:
> Jeremy Chadwick wrote:
> >On Fri, Jun 18, 2010 at 08:08:24AM +0100, Matthew Lear wrote:
> 
> [...]
> 
> >>The drives in the RAID exist on two seperate ATA channels:
> >>[r...@meshuga /home/matt]# atacontrol list
> >>ATA channel 0:
> >> Master:  ad0  SATA revision 2.x
> >> Slave:   ad1  SATA revision 1.x
> >>ATA channel 1:
> >> Master:  ad2  SATA revision 2.x
> >> Slave:   no device present
> >>ATA channel 2:
> >> Master: acd0  SATA revision 1.x
> >> Slave:   no device present
> >>ATA channel 3:
> >> Master:  no device present
> >> Slave:   no device present
> >>
> >>ad1 is a third 160G drive that I periodically back up to using cron.
> >
> >So your RAID-1 array consists of ad0 and ad2?  You didn't provide
> >"atacontrol status" output so I'm going to assume that's the case.
> >
> >What's odd to me is that you somehow have two disks on a single ATA
> >channel -- look closely at channel 0.  SATA has a 1:1 device-to-channel
> >mapping, so I'm a little surprised to see there's two devices on channel
> >0.  To me, this indicates your system BIOS is configured to run in
> >"Emulation" mode -- where the ATA controller pretends to be a PATA/IDE
> >controller, thus SATA-0 and SATA-1 devices appear as primary master and
> >primary slave, respectively.
> >
> >What motherboard is this?  Can you change the setting to either
> >"Native", "Enhanced", or (even better) "AHCI"?  I've seen some systems
> >where the Serial ATA option in the BIOS has an "Auto" option, which does
> >totally bizarre things at times.
> >
> >But before changing the setting, I would recommend dealing with the disk
> >problem first.  Changing the SATA controller operation mode will almost
> >certainly change all of your device names (you'll have to go into
> >single-user mode, mount filesystems by hand, fix /etc/fstab, etc.).
> 
> [...]
> 
> It is "normal" on HP G5 series. I have ProLiant ML 110 G5. I tried
> all type of settings in BIOS, but all of them shows two disks on one
> ATA channel:
> 
> HP ProLiant ML 110 G5
> 
> FreeBSD 7.2-RELEASE-p4 amd64 GENERIC
> 
> r...@kiwi ~/# atacontrol list
> ATA channel 0:
> Master:  ad0  SATA revision 2.x
> Slave:   ad1  SATA revision 2.x
> ATA channel 1:
> Master:  ad2  SATA revision 2.x
> Slave:   ad3  SATA revision 2.x
> ATA channel 2:
> Master: acd0  SATA revision 1.x
> Slave:   no device present
> ATA channel 3:
> Master:  no device present
> Slave:   no device present
> 
> 
> 
> atapci0:  port
> 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x1c10-0x1c1f,0x1c00-0x1c0f at
> device 31.2 on pci0
> ata0:  on atapci0
> ata0: [ITHREAD]
> ata1:  on atapci0
> ata1: [ITHREAD]
> pci0:  at device 31.3 (no driver attached)
> atapci1:  port 
> 0x1c68-0x1c6f,0x1c5c-0x1c5f,0x1c60-0x1c67,0x1c58-0x1c5b,0x1c30-0x1c3f,0x1c20-0x1c2f
> irq 18 at device 31.5 on pci0
> atapci1: [ITHREAD]
> ata2:  on atapci1
> ata2: [ITHREAD]
> ata3:  on atapci1
> ata3: [ITHREAD]
> 
> 
> pciconf -lv
> atap...@pci0:0:31:2:class=0x01018a card=0x31f4103c
> chip=0x29208086 rev=0x02 hdr=0x00
> vendor = 'Intel Corporation'
> device = '82801IB/IR/IH (ICH9 Family) 4 port Serial ATA
> Storage Controller 1'
> class  = mass storage
> subclass   = ATA
> 
> atap...@pci0:0:31:5:class=0x010185 card=0x31f4103c
> chip=0x29268086 rev=0x02 hdr=0x00
> vendor = 'Intel Corporation'
> device = '82801IB/IR/IH (ICH9 Family) 2 port Serial ATA
> Storage Controller 2'
> class  = mass storage
> subclass   = ATA
> 
> 
> 
> ad0: 953869MB  at ata0-master SATA300
> ad1: 953869MB  at ata0-slave SATA300
> ad2: 953869MB  at ata1-master SATA300
> ad3: 953869MB  at ata1-slave SATA300
> da0 at umass-sim0 bus 0 target 0 lun 0
> da0:  Removable Direct Access SCSI-2 device
> da0: 40.000MB/s transfers
> da0: 1928MB (3948544 512 byte sectors: 255H 63S/T 245C)
> acd0: DVDR  at ata2-master SATA150
> 
> 
> I am using this machine as storage for backups with ZFS RAIDZ
> without any timeouts so I think that two disks on one channel is not
> causing the timeouts (only little slowdown)

Wow, that's really... interesting.  :-)  What this indicates is that the
controller is running in Native/Enhanced mode yet devices attached to
SATA ports #0/#1 are master/slave on channel 0, and ports #2/#3 are
master/slave on channel 1.

I don't have an HP box to tinker with, but when I get home in a few
hours I'll tinker around with BIOS settings on my Supermicro ICH7-based
test box and see if I can reproduce it.

I'm curious: does the same behaviour (see first paragraph) happen on
RELENG_8 as well?

Adding mav@ for some insights -- you ever seen this, Alexander?

-- 
| Jeremy Chadwick   j...@parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for

Re: 7.2-RELEASE-p4, IO errors & RAID1 failure

2010-06-18 Thread Miroslav Lachman


Jeremy Chadwick wrote:

On Fri, Jun 18, 2010 at 08:08:24AM +0100, Matthew Lear wrote:


[...]


The drives in the RAID exist on two seperate ATA channels:
[r...@meshuga /home/matt]# atacontrol list
ATA channel 0:
 Master:  ad0  SATA revision 2.x
 Slave:   ad1  SATA revision 1.x
ATA channel 1:
 Master:  ad2  SATA revision 2.x
 Slave:   no device present
ATA channel 2:
 Master: acd0  SATA revision 1.x
 Slave:   no device present
ATA channel 3:
 Master:  no device present
 Slave:   no device present

ad1 is a third 160G drive that I periodically back up to using cron.


So your RAID-1 array consists of ad0 and ad2?  You didn't provide
"atacontrol status" output so I'm going to assume that's the case.

What's odd to me is that you somehow have two disks on a single ATA
channel -- look closely at channel 0.  SATA has a 1:1 device-to-channel
mapping, so I'm a little surprised to see there's two devices on channel
0.  To me, this indicates your system BIOS is configured to run in
"Emulation" mode -- where the ATA controller pretends to be a PATA/IDE
controller, thus SATA-0 and SATA-1 devices appear as primary master and
primary slave, respectively.

What motherboard is this?  Can you change the setting to either
"Native", "Enhanced", or (even better) "AHCI"?  I've seen some systems
where the Serial ATA option in the BIOS has an "Auto" option, which does
totally bizarre things at times.

But before changing the setting, I would recommend dealing with the disk
problem first.  Changing the SATA controller operation mode will almost
certainly change all of your device names (you'll have to go into
single-user mode, mount filesystems by hand, fix /etc/fstab, etc.).


[...]

It is "normal" on HP G5 series. I have ProLiant ML 110 G5. I tried all 
type of settings in BIOS, but all of them shows two disks on one ATA 
channel:


HP ProLiant ML 110 G5

FreeBSD 7.2-RELEASE-p4 amd64 GENERIC

r...@kiwi ~/# atacontrol list
ATA channel 0:
Master:  ad0  SATA revision 2.x
Slave:   ad1  SATA revision 2.x
ATA channel 1:
Master:  ad2  SATA revision 2.x
Slave:   ad3  SATA revision 2.x
ATA channel 2:
Master: acd0  SATA revision 1.x
Slave:   no device present
ATA channel 3:
Master:  no device present
Slave:   no device present



atapci0:  port 
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x1c10-0x1c1f,0x1c00-0x1c0f at 
device 31.2 on pci0

ata0:  on atapci0
ata0: [ITHREAD]
ata1:  on atapci0
ata1: [ITHREAD]
pci0:  at device 31.3 (no driver attached)
atapci1:  port 
0x1c68-0x1c6f,0x1c5c-0x1c5f,0x1c60-0x1c67,0x1c58-0x1c5b,0x1c30-0x1c3f,0x1c20-0x1c2f 
irq 18 at device 31.5 on pci0

atapci1: [ITHREAD]
ata2:  on atapci1
ata2: [ITHREAD]
ata3:  on atapci1
ata3: [ITHREAD]


pciconf -lv
atap...@pci0:0:31:2:class=0x01018a card=0x31f4103c chip=0x29208086 
rev=0x02 hdr=0x00

vendor = 'Intel Corporation'
device = '82801IB/IR/IH (ICH9 Family) 4 port Serial ATA Storage 
Controller 1'

class  = mass storage
subclass   = ATA

atap...@pci0:0:31:5:class=0x010185 card=0x31f4103c chip=0x29268086 
rev=0x02 hdr=0x00

vendor = 'Intel Corporation'
device = '82801IB/IR/IH (ICH9 Family) 2 port Serial ATA Storage 
Controller 2'

class  = mass storage
subclass   = ATA



ad0: 953869MB  at ata0-master SATA300
ad1: 953869MB  at ata0-slave SATA300
ad2: 953869MB  at ata1-master SATA300
ad3: 953869MB  at ata1-slave SATA300
da0 at umass-sim0 bus 0 target 0 lun 0
da0:  Removable Direct Access SCSI-2 device
da0: 40.000MB/s transfers
da0: 1928MB (3948544 512 byte sectors: 255H 63S/T 245C)
acd0: DVDR  at ata2-master SATA150


I am using this machine as storage for backups with ZFS RAIDZ without 
any timeouts so I think that two disks on one channel is not causing the 
timeouts (only little slowdown)


Miroslav Lachman
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: network deamons starting before network!

2010-06-18 Thread Alfred Bartsch

Mark Stapper schrieb:
> Hello,
> 
> Since updating to 8.X I noticed that network services were started
> before the network was up!
> I use lagg failover configuration on both my FreeBSD boxes.
> First, boot fails on mounting my nfs-shares.
> After entering and exiting the "rescue" shell, the system boots as normal.
> 
> uname -a
> FreeBSD mario 8.1-PRERELEASE FreeBSD 8.1-PRERELEASE #4: Fri Jun 18
> 07:46:01 CEST 2010 *...@mario:/usr/obj/usr/src/sys/mario  amd64
> What could I do to fix this?
> 
> Here's an exerpt from /var/log/messages:
> 
> Jun 18 09:10:25  ntpd[1376]: ntpd 4.2.4p5-a (1)
> Jun 18 09:10:25  kernel: fuse4bsd: version 0.3.9-pre1, FUSE ABI 7.8
> Jun 18 09:10:27  ntpd_initres[1412]: host name not found: yoshi
> Jun 18 09:10:27  kernel: nfe0: link state changed to UP
> Jun 18 09:10:27  kernel: lagg0: link state changed to UP
> Jun 18 09:10:27  kernel: nfe1: link state changed to UP
> Jun 18 09:10:27  ntpd_initres[1412]: couldn't resolve `yoshi', giving up
> on it
> Jun 18 09:10:28  dhclient: New Hostname (lagg0): mario
> Jun 18 09:10:28  dhclient: New IP Address (lagg0): 10.58.235.6
> Jun 18 09:10:28  dhclient: New Subnet Mask (lagg0): 255.255.255.0
> Jun 18 09:10:28  dhclient: New Broadcast Address (lagg0): 10.58.235.255
> Jun 18 09:10:28  dhclient: New Routers (lagg0): 10.58.235.1
> 
> 

Hello,

adding:
synchronous_dhclient="YES"
to /etc/rc.conf solved some similar issues for me.
The default behaviour of getting an IP via dhcp has changed.
-- 
Alfred Bartsch
mailto:bart...@dssgmbh.de
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: network deamons starting before network!

2010-06-18 Thread Jeremy Chadwick

On Fri, Jun 18, 2010 at 10:47:34AM +0200, Mark Stapper wrote:
> On 18/06/2010 10:26, Jeremy Chadwick wrote:
> > On Fri, Jun 18, 2010 at 09:34:24AM +0200, Mark Stapper wrote:
> >   
> >> Hello,
> >>
> >> Since updating to 8.X I noticed that network services were started
> >> before the network was up!
> >> 
> > This is common/normal.  I wrote a script (still working on getting it
> > added to the base system) to deal with this issue.  Please see:
> >
> > http://jdc.parodius.com/freebsd/netwait
> >
> > Drop this into /usr/local/etc/rc.d, chmod it 755, and read the
> > descriptions of the variables and add the appropriate settings to
> > rc.conf.
> >
> > Please be aware netwait_if only takes a single interface; you're using
> > lagg, so I hope that when your system boots whichever interface you pick
> > in netwait_if the one which you default to using.
> >
> >   
> couldn't I use lagg0 on this parameter?

You could, but it won't necessarily work in the way you think it will.

netwait explicitly looks for link interface state ("status: no carrier")
in ifconfig output.  This methodology is used in other base system
scripts.

If lagg devices offer that line in ifconfig, then the script will work
but won't necessarily do what you expect.  It's very common on FreeBSD
for drivers of this sort to ""lie"" (please note the quotes) about the
state of link.  Meaning, lagg0 might show link, but maybe that means
"Yeah! I'm configured!" -- but that doesn't mean the *physical network
interfaces lagg0 is associated with* are actually functional.

See where I'm going with this?

In your situation, assuming lagg0 provides a status line in ifconfig,
then yes you can use that -- but *ultimately* you're going to be reliant
entirely upon the netwait_ip ping tests to verify network connections
are working.  That's just the nature of the beast.

The bottom line is that the netwait script *cannot* be reliably used in
"complex network situations" (and this may be one of those).  Below is
my original mail to -stable about the script (then called
"waitnetwork"); see "CAVEATS / POINTS OF INTEREST".  There is truly no
universal way that covers everyone's configuration/setup.

http://www.mail-archive.com/freebsd-stable@freebsd.org/msg109853.html

-- 
| Jeremy Chadwick   j...@parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: network deamons starting before network!

2010-06-18 Thread Mark Stapper

On 18/06/2010 10:26, Jeremy Chadwick wrote:
> On Fri, Jun 18, 2010 at 09:34:24AM +0200, Mark Stapper wrote:
>   
>> Hello,
>>
>> Since updating to 8.X I noticed that network services were started
>> before the network was up!
>> 
> This is common/normal.  I wrote a script (still working on getting it
> added to the base system) to deal with this issue.  Please see:
>
> http://jdc.parodius.com/freebsd/netwait
>
> Drop this into /usr/local/etc/rc.d, chmod it 755, and read the
> descriptions of the variables and add the appropriate settings to
> rc.conf.
>
> Please be aware netwait_if only takes a single interface; you're using
> lagg, so I hope that when your system boots whichever interface you pick
> in netwait_if the one which you default to using.
>
>   
couldn't I use lagg0 on this parameter?
thx btw :-)
Regards,
Mark



signature.asc
Description: OpenPGP digital signature

Re: network deamons starting before network!

2010-06-18 Thread Jeremy Chadwick

On Fri, Jun 18, 2010 at 09:34:24AM +0200, Mark Stapper wrote:
> Hello,
> 
> Since updating to 8.X I noticed that network services were started
> before the network was up!

This is common/normal.  I wrote a script (still working on getting it
added to the base system) to deal with this issue.  Please see:

http://jdc.parodius.com/freebsd/netwait

Drop this into /usr/local/etc/rc.d, chmod it 755, and read the
descriptions of the variables and add the appropriate settings to
rc.conf.

Please be aware netwait_if only takes a single interface; you're using
lagg, so I hope that when your system boots whichever interface you pick
in netwait_if the one which you default to using.

-- 
| Jeremy Chadwick   j...@parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: 7.3-STABLE acpi and mpt problems?

2010-06-18 Thread Andriy Gapon

on 18/06/2010 10:06 Lystopad Olexandr said the following:
>  Hello, John Baldwin!
> 
> On Wed, Jun 16, 2010 at 07:55:27AM -0400
> j...@freebsd.org wrote about "Re: 7.3-STABLE acpi and mpt problems?":
>> On Wednesday 16 June 2010 7:14:09 am Lystopad Olexandr wrote:
>>> Hi!
>>>
>>> I have FreeBSD 7.3-STABLE on Biostar 945GC-M4 motherboard.
>>> When I boot with custom kernel it fail to boot on ACPI. FreeBSD box
>>> do not respond to any key. But it successfully boots with button 2
>>> from boot menu (without acpi).
>>>
>>> Then I try to boot with GENERIC. Success! Ok, I try comment
>>> out one-by-one devices in GENERIC, rebuild and install kernel,
>>> and try to boot. My custom kernel boot fine until I comment
>>> out mpt device in kernel, but it successfully boots without
>>> acpi (button 2 on boot menu). I have no mpt devices in
>>> this box. When I insert "device mpt" back to the kernel it boots
>>> successfully.
>>>
>>> Why acpi depends on device mpt? Why this server do not boot without
>>> this device?
>> How does it fail to boot in the non-mpt case?  Does it hang, does it panic?  
>> If it hangs, can you break into DDB and capture the output of 'ps' and a 
>> stack 
>> trace?  (A serial console is probably useful for this.)
> 
> John, thank you very much for your answer.
> Sorry for delay.
> 
> My freebsd box fails to boot after detecting CPU and memory. It
> simply hang. I think this is too early to run ddb or somethig else.
> 
> Also, similar problem with this device I detect on two remote bsd
> servers in production. After insert mpt device into kernel, that
> servers begin boots fine.

Can you please produce _verbose_ dmesg for booting with acpi+mpt and no acpi, no
mpt?  And also, _verbose_ dmesg of acpi and !mpt.
You can capture the latter with serial console.
If you don't have it, try to get screenshots.


-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: 7.2-RELEASE-p4, IO errors & RAID1 failure

2010-06-18 Thread Jeremy Chadwick

On Fri, Jun 18, 2010 at 08:08:24AM +0100, Matthew Lear wrote:
> Hi there,
> 
> I'm running 7.2-RELEASE-p4 on an i386 HP server (ML G5) in RAID1
> configuration. Very recently, I've seen IO errors such as:
> 
> ad0: TIMEOUT - READ_DMA retrying (1 retry left) LBA=20472527
> 
> reported and the RAID mirror is now offline.
> 
> ad0: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=395032335
> ad0: FAILURE - WRITE_DMA48 status=51
> error=10 LBA=395032335
> ar0: WARNING - mirror protection lost. RAID1 array in DEGRADED mode
> 
> Strangely, I've ran some SMART tests on the device and no error has been
> recorded. Health checks pass. Running a long test on the device doesn't
> show any problem. While SMART can be manufacturer specific I at least
> expected to see something which looked to be suspicious.

Could you please provide the full output from "smartctl -a /dev/ad0"
here?  Your drive may be completely fine and you may not have to swap it
at all; hard to say.

> The drives in the RAID exist on two seperate ATA channels:
> [r...@meshuga /home/matt]# atacontrol list
> ATA channel 0:
> Master:  ad0  SATA revision 2.x
> Slave:   ad1  SATA revision 1.x
> ATA channel 1:
> Master:  ad2  SATA revision 2.x
> Slave:   no device present
> ATA channel 2:
> Master: acd0  SATA revision 1.x
> Slave:   no device present
> ATA channel 3:
> Master:  no device present
> Slave:   no device present
> 
> ad1 is a third 160G drive that I periodically back up to using cron.

So your RAID-1 array consists of ad0 and ad2?  You didn't provide
"atacontrol status" output so I'm going to assume that's the case.

What's odd to me is that you somehow have two disks on a single ATA
channel -- look closely at channel 0.  SATA has a 1:1 device-to-channel
mapping, so I'm a little surprised to see there's two devices on channel
0.  To me, this indicates your system BIOS is configured to run in
"Emulation" mode -- where the ATA controller pretends to be a PATA/IDE
controller, thus SATA-0 and SATA-1 devices appear as primary master and
primary slave, respectively.

What motherboard is this?  Can you change the setting to either
"Native", "Enhanced", or (even better) "AHCI"?  I've seen some systems
where the Serial ATA option in the BIOS has an "Auto" option, which does
totally bizarre things at times.

But before changing the setting, I would recommend dealing with the disk
problem first.  Changing the SATA controller operation mode will almost
certainly change all of your device names (you'll have to go into
single-user mode, mount filesystems by hand, fix /etc/fstab, etc.).

Also, can you please provide output from "dmesg | grep -i ata"?

> I've seen the thread below but I'm not using ZFS. This seems similar to
> what I'm experiencing.
> http://freebsd.monkey.org/freebsd-stable/200801/msg00617.html
> 
> I'm using software RAID with atacontrol but the drives are not hot-swap.

When you say "software RAID", I'm assuming you're referring to ata(4)'s
native OS-level RAID (as in "atacontrol create RAID1 ad0 ad1").  Or are
you using something like Intel MatrixRAID?

> Therefore I expect that I need to detach ad0 from the RAID, power down
> the unit, replace the drive, power on the unit and rebuild the array in
> order to fix things. Trouble is, I'm struggling to find out if this can
> be done safely with atacontrol and the hw configuration I have, and if
> so, how best to do it?

The atacontrol man page covers your situation:

  It is NOT recommended to create such arrays on a primary/secondary pair
  on a SINGLE channel since the throughput of the mirror would be severely
  compromised, the ability to rebuild the array in the event of a disk
  failure would be greatly complicated, and if a disk controller
  electronics failed it could wedge the channel and take both disks in the
  mirror offline.  (which would defeat the purpose of having a mirror in
  the first place)

I realise ad0 is on channel 0 and ad2 is on channel 1, but you have a
"mystery device" as a Slave on channel 0, which is going to be impacted.

You really need AHCI to be able to hot-swap effectively.  The procedure
I've followed for years -- without ZFS in the picture (that should just
add a few extra commands to the picture) -- relies on AHCI and a proper
hot-swap bay/backplane.  Hot-swapping disks without such a backplane,
in my experience, results in the system powering off suddenly.  Anyway,
this is the procedure:

- atacontrol detach ataX   (where ataX = channel disk is attached to)
- Physically remove the bad disk
- Physically insert a new disk
- Wait 15 seconds for drive to settle
- atacontrol attach ataX

The new disk should appear automatically, and should appear as the same
device name (adX) that it did before.  At least that's my experience
when using AHCI with ataahci.ko (I haven't tried when using ahci.ko,
which uses CAM).  We can discuss the details/differences later.

If the disk doesn't reappear ("atacontrol list" shows

Re: 7.2-RELEASE-p4, IO errors & RAID1 failure

2010-06-18 Thread Pieter de Boer


Hi Matthew,


I'm running 7.2-RELEASE-p4 on an i386 HP server (ML G5) in RAID1
configuration. Very recently, I've seen IO errors such as:

ad0: TIMEOUT - READ_DMA retrying (1 retry left) LBA=20472527

reported and the RAID mirror is now offline.

ad0: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=395032335
ad0: FAILURE - WRITE_DMA48 status=51
error=10  LBA=395032335
ar0: WARNING - mirror protection lost. RAID1 array in DEGRADED mode

I had more or less the same timeout issues on my 8.0-RELEASE box on a 
Dell R300 with SATA disks. What I did was raise the ata timeout from 5 
seconds to 20. I did this by patching the kernel code while running, but 
I'm not sure you'd like that approach ;)


In http://www.freebsd.org/cgi/query-pr.cgi?pr=111023 a patch is 
presented that raises the timeouts by patching a few ATA kernel source 
files. This has been committed to RELENG_7 as well, so by upgrading your 
7.2-install to the latest RELENG_7 (or RELENG_8), you'll have that 
timeout fix.


Why ATA commands can take longer than 5 seconds although the disks 
appear to be fine.. wouldn't know ..


--
Pieter



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

7.2-RELEASE-p4, IO errors & RAID1 failure

2010-06-18 Thread Matthew Lear

Hi there,

I'm running 7.2-RELEASE-p4 on an i386 HP server (ML G5) in RAID1
configuration. Very recently, I've seen IO errors such as:

ad0: TIMEOUT - READ_DMA retrying (1 retry left) LBA=20472527

reported and the RAID mirror is now offline.

ad0: TIMEOUT - WRITE_DMA48 retrying (1 retry left) LBA=395032335
ad0: FAILURE - WRITE_DMA48 status=51
error=10 LBA=395032335
ar0: WARNING - mirror protection lost. RAID1 array in DEGRADED mode

Strangely, I've ran some SMART tests on the device and no error has been
recorded. Health checks pass. Running a long test on the device doesn't
show any problem. While SMART can be manufacturer specific I at least
expected to see something which looked to be suspicious.

The drives in the RAID exist on two seperate ATA channels:
[r...@meshuga /home/matt]# atacontrol list
ATA channel 0:
Master:  ad0  SATA revision 2.x
Slave:   ad1  SATA revision 1.x
ATA channel 1:
Master:  ad2  SATA revision 2.x
Slave:   no device present
ATA channel 2:
Master: acd0  SATA revision 1.x
Slave:   no device present
ATA channel 3:
Master:  no device present
Slave:   no device present

ad1 is a third 160G drive that I periodically back up to using cron.

I've seen the thread below but I'm not using ZFS. This seems similar to
what I'm experiencing.
http://freebsd.monkey.org/freebsd-stable/200801/msg00617.html

I'm using software RAID with atacontrol but the drives are not hot-swap.
Therefore I expect that I need to detach ad0 from the RAID, power down
the unit, replace the drive, power on the unit and rebuild the array in
order to fix things. Trouble is, I'm struggling to find out if this can
be done safely with atacontrol and the hw configuration I have, and if
so, how best to do it?

It may well be a case of RTFM (again) but I just wanted to run this by
the community to get some feedback. Loosing data is not an option here
so hopefully I can get the machine back up on its feet soon.

Any help or feedback much appreciated.
Thanks,
--  Matt


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

network deamons starting before network!

2010-06-18 Thread Mark Stapper

Hello,

Since updating to 8.X I noticed that network services were started
before the network was up!
I use lagg failover configuration on both my FreeBSD boxes.
First, boot fails on mounting my nfs-shares.
After entering and exiting the "rescue" shell, the system boots as normal.

uname -a
FreeBSD mario 8.1-PRERELEASE FreeBSD 8.1-PRERELEASE #4: Fri Jun 18
07:46:01 CEST 2010 *...@mario:/usr/obj/usr/src/sys/mario  amd64
What could I do to fix this?

Here's an exerpt from /var/log/messages:

Jun 18 09:10:25  ntpd[1376]: ntpd 4.2.4p5-a (1)
Jun 18 09:10:25  kernel: fuse4bsd: version 0.3.9-pre1, FUSE ABI 7.8
Jun 18 09:10:27  ntpd_initres[1412]: host name not found: yoshi
Jun 18 09:10:27  kernel: nfe0: link state changed to UP
Jun 18 09:10:27  kernel: lagg0: link state changed to UP
Jun 18 09:10:27  kernel: nfe1: link state changed to UP
Jun 18 09:10:27  ntpd_initres[1412]: couldn't resolve `yoshi', giving up
on it
Jun 18 09:10:28  dhclient: New Hostname (lagg0): mario
Jun 18 09:10:28  dhclient: New IP Address (lagg0): 10.58.235.6
Jun 18 09:10:28  dhclient: New Subnet Mask (lagg0): 255.255.255.0
Jun 18 09:10:28  dhclient: New Broadcast Address (lagg0): 10.58.235.255
Jun 18 09:10:28  dhclient: New Routers (lagg0): 10.58.235.1






signature.asc
Description: OpenPGP digital signature

Re: 7.3-STABLE acpi and mpt problems?

2010-06-18 Thread Lystopad Olexandr

 Hello, John Baldwin!

On Wed, Jun 16, 2010 at 07:55:27AM -0400
j...@freebsd.org wrote about "Re: 7.3-STABLE acpi and mpt problems?":
> On Wednesday 16 June 2010 7:14:09 am Lystopad Olexandr wrote:
> > 
> > Hi!
> > 
> > I have FreeBSD 7.3-STABLE on Biostar 945GC-M4 motherboard.
> > When I boot with custom kernel it fail to boot on ACPI. FreeBSD box
> > do not respond to any key. But it successfully boots with button 2
> > from boot menu (without acpi).
> > 
> > Then I try to boot with GENERIC. Success! Ok, I try comment
> > out one-by-one devices in GENERIC, rebuild and install kernel,
> > and try to boot. My custom kernel boot fine until I comment
> > out mpt device in kernel, but it successfully boots without
> > acpi (button 2 on boot menu). I have no mpt devices in
> > this box. When I insert "device mpt" back to the kernel it boots
> > successfully.
> > 
> > Why acpi depends on device mpt? Why this server do not boot without
> > this device?
> 
> How does it fail to boot in the non-mpt case?  Does it hang, does it panic?  
> If it hangs, can you break into DDB and capture the output of 'ps' and a 
> stack 
> trace?  (A serial console is probably useful for this.)

John, thank you very much for your answer.
Sorry for delay.

My freebsd box fails to boot after detecting CPU and memory. It
simply hang. I think this is too early to run ddb or somethig else.

Also, similar problem with this device I detect on two remote bsd
servers in production. After insert mpt device into kernel, that
servers begin boots fine.


-- 
 Olexandr Lystopad
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

39 matches

Mail list logo