Re: FreeBSD 7.0 Questions

2008-02-27 Thread Jeremy Chadwick
On Wed, Feb 27, 2008 at 11:15:30PM -0800, Doug Hardie wrote:
> I have just installed 7.0 on some new hardware.  Have never tried earlier 
> versions.  There are a couple of unexpected items that I do not understand.
>
> 2.  I have 2 SATA drives in the system.  The first is recognized as ad10 
> and the second as ad12.  I expected to see ad0 and ad1.

You shouldn't "expect" this in any way shape or form.  The device
numbers are not consistent, and are known to change depending upon lots
of reasons (AHCI disabled/enabled, another ATA controller in place, SATA
in "compatible" mode or "enhanced" mode, etc. etc.).

Thus, never expect the adX devices to "start with 0".

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


FreeBSD 7.0 Questions

2008-02-27 Thread Doug Hardie
I have just installed 7.0 on some new hardware.  Have never tried  
earlier versions.  There are a couple of unexpected items that I do  
not understand.


1.  When booting there are a few messages like:
acd0: TIMEOUT - READ_BIG retrying

When the system is booted off CD (like installation disk), it  
takes about 10 minutes
after these messages first appear and then it finally starts up  
sysinstall properly.


After the system is installed on disk, it reboots and just  
continues on past those
messages with no delays.  Sometimes those messages do not appear  
in dmesg.boot.


Why are there long delays when booting from CD?

2.  I have 2 SATA drives in the system.  The first is recognized as  
ad10 and the second as
ad12.  I expected to see ad0 and ad1. 
 
___

freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: fsck_ufs: cannot alloc 94208 bytes for inoinfo

2008-02-27 Thread Peter Jeremy
On Wed, Feb 27, 2008 at 11:53:01AM -0800, Matthew Dillon wrote:
> You can also reduce fsck time by reducing the number of cylinder
> groups on the disk.  I usually max them out (-c 999 and newfs then
> sets it to the maximum, usually in the 50-80 range).  This will
> improve performance but not reduce the memory required.

Note that this advice is relevant for UFS1 only.  In UFS2, '-c'
specifies the cylinder group size in _BLOCKS_ not cylinders and
defaults to the maximum size for the given blocksize and IPG etc.

-- 
Peter Jeremy
Please excuse any delays as the result of my ISP's inability to implement
an MTA that is either RFC2821-compliant or matches their claimed behaviour.


pgpHRuuGnCnwn.pgp
Description: PGP signature


Re: FreeBSD 7.0-RELEASE Available

2008-02-27 Thread Mark Kirkwood

Ken Smith wrote:

Just in case some interested parties are not subscribed to the
freebsd-announce mailing list...  FreeBSD 7.0-RELEASE has been formally
released.  If you would like to see the release announcement it's here:

  http://www.freebsd.org/releases/7.0R/announce.html

On behalf of the FreeBSD Project thanks for your interest in FreeBSD.
We hope you enjoy the new release.

  
Thanks guys - much appreciated by those of us that use it as our 
everyday os!


... upgrading from 6.3 stable as we speak...

regards

Mark
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Problems with promise SATA300 TX2Plus

2008-02-27 Thread Andrey V. Elsukov

Jisakiel wrote:

Is there any way to make this card work on FreeBSD 7, or anything
else that I could try? 


Did you find any new BIOS versions for you motherboard?

--
WBR, Andrey V. Elsukov
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ad0 READ_DMA TIMEOUT errors on install of 7.0-RELEASE

2008-02-27 Thread Stephen Hurd

Scott Long wrote:

I'd like to attack these driver problems.  What I need is to spend a
couple of days with an affected system that can reliably reproduce the
problem, instrumenting and testing the driver.  I have a number of
theories about what might be going wrong, but nothing that I'm
definitely sure about.  If you are willing to set up your system with
remote power and remote serial, and if we knew a reliable way to
reproduce the problem, I could probably have the problem identified and
fixed pretty quickly.


So, it turns out that I can't have multiple PPPoE sessions at the same 
time.  :-(


I should be able to hack together something though if the rest of the 
stuff I mentioned is fine.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ad0 READ_DMA TIMEOUT errors on install of 7.0-RELEASE

2008-02-27 Thread Stephen Hurd

Scott Long wrote:

I'm betting that it's a driver problem and not
a hardware problem, though you should probably think about migrating
your data off to a new drive sometime soon.


Yeah, ordered a replacement drive today.


I'd like to attack these driver problems.  What I need is to spend a
couple of days with an affected system that can reliably reproduce the
problem, instrumenting and testing the driver.  I have a number of
theories about what might be going wrong, but nothing that I'm
definitely sure about.  If you are willing to set up your system with
remote power and remote serial, and if we knew a reliable way to
reproduce the problem, I could probably have the problem identified and
fixed pretty quickly.


Hrm... if my ISP allows multiple PPPoE sessions, the remote serial 
shouldn't be a problem.  Remote power though... would my idling in IRC 
and you poking me whenever you want the power state changed work?


I'm not completely certain it's even possible to turn the system in 
question off short of the physical power switch and even if it is, I'm 
positive that it's not possible to turn it back on again since there is 
no power switch header on the motherboard.


If I *can* get multiple PPPoE sessions running, the setup would be 
SSHing to a FreeBSD-sparc64 system with a serial and network connection 
to the affected system.  I could give you root access on one or both 
boxes as needed.


I would want you to make me feel a bit more comfortable about letting 
someone play with the ata driver on a system with a known flakey HD.  
Some kinda of "It's super-duper unlikely that I would thrash the FS" 
thing since I won't be able to put the new HD in for a week or two and I 
have the usual home system backup plan in place (that is, I plan on 
backing up someday...)


I'll try setting up the extra PPPoE session now (since I'm curious about 
it anyways) and get back to you on that detail.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 7.0-RELEASE Available

2008-02-27 Thread Dirk Arlt
Ken Smith <[EMAIL PROTECTED]> writes:
[...]
>   http://www.freebsd.org/releases/7.0R/announce.html
>
> On behalf of the FreeBSD Project thanks for your interest in FreeBSD.
> We hope you enjoy the new release.
>
Thanks for all your work (done and to be done).

Dirk
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 7, Razer Copperhead mouse patch

2008-02-27 Thread Oliver Herold
Hi

this patch works (console, mousepointer) but it doesn't work in X.

ums0:  on uhub0
ums0: 7 buttons and Z dir.
ukbd1:  o
n uhub0
kbd3 at ukbd1

---

AUDIT: Wed Feb 27 23:36:12 2008: 1149 X: client 1 rejected from local
host (uid
1001)
  Auth name: MIT-MAGIC-COOKIE-1 ID: -1
Xlib: connection to ":0.0" refused by server
Xlib: Invalid MIT-MAGIC-COOKIE-1 key

waiting for X server to begin accepting connections .
AUDIT: Wed Feb 27 23:36:14 2008: 1149 X: client 1 rejected from local
host (uid
1001)
  Auth name: MIT-MAGIC-COOKIE-1 ID: -1
Xlib: connection to ":0.0" refused by server
Xlib: Invalid MIT-MAGIC-COOKIE-1 key

---

So I cannot use the Razer (or any other mouse) in X with this patch.


--Oliver

John Baldwin <[EMAIL PROTECTED]> wrote:
> On Saturday 23 February 2008 03:32:41 pm Dominic Fandrey wrote:
> > Oliver Herold wrote:
> > > Hi
> > > 
> > > the Razer Copperhead mouse did work in FreeBSD 7 (current) for a long
> > > time, but after some period it stopped working. This patch from Uwe
> > > Grohnwaldt:
> > > 
> > > http://www.freebsd.org/cgi/query-pr.cgi?pr=usb/118670
> > > 
> > > fixes this wrong behaviour (it's detected as keyboard) and makes the
> > > mouse work in FreeBSD 7 again. Would be imho a nice addition for 
> > > RELENG_7 (stable) or even Release.
> > > 
> > > Cheers, Oliver
> > 
> > Being a Razer fan and user I totally agree.
> 
> Can you try this patch instead?
> 
> Index: ums.c
> ===
> RCS file: /usr/cvs/src/sys/dev/usb/ums.c,v
> retrieving revision 1.97
> diff -u -r1.97 ums.c
> --- ums.c 26 Dec 2007 14:31:16 -  1.97
> +++ ums.c 27 Feb 2008 21:40:48 -
> @@ -198,7 +198,10 @@
>   if (err)
>   return (UMATCH_NONE);
>  
> - if (id->bInterfaceClass == UICLASS_HID &&
> + if (hid_is_collection(desc, size,
> +   HID_USAGE2(HUP_GENERIC_DESKTOP, HUG_MOUSE)))
> + ret = UMATCH_IFACECLASS;
> + else if (id->bInterfaceClass == UICLASS_HID &&
>   id->bInterfaceSubClass == UISUBCLASS_BOOT &&
>   id->bInterfaceProtocol == UIPROTO_MOUSE)
>   ret = UMATCH_IFACECLASS;
> 
> -- 
> John Baldwin

-- 
When you have an efficient government, you have a dictatorship.
-- Harry S. Truman


pgpy3JWkhf3fb.pgp
Description: PGP signature


FreeBSD 7.0-RELEASE Available

2008-02-27 Thread Ken Smith

Just in case some interested parties are not subscribed to the
freebsd-announce mailing list...  FreeBSD 7.0-RELEASE has been formally
released.  If you would like to see the release announcement it's here:

  http://www.freebsd.org/releases/7.0R/announce.html

On behalf of the FreeBSD Project thanks for your interest in FreeBSD.
We hope you enjoy the new release.

-- 
Ken Smith
- From there to here, from here to  |   [EMAIL PROTECTED]
  there, funny things are everywhere.   |
  - Theodore Geisel |


signature.asc
Description: This is a digitally signed message part


Re: Problems with promise SATA300 TX2Plus

2008-02-27 Thread Michael Proto


Jisakiel wrote:
> Greetings. I was trying to install FreeBSD 7 on an old machine to make it a 
> fileserver. Machine is an AMD K7 1200 on an Abit AN-7 mobo (nforce2 400), 
> which has booted Freebsd 7.0 RC1 beforehand (for testing ZFS; only ACPI 
> didn't work as it hung it).
> 
> I recently bought a PCI SATA card on the cheap variety, Promise SATA300 
> TX2Plus, and a couple of 500GB Maxtors. Unfortunately, when any hard drive 
> (both of the 500's and an older 250 Maxtor which I use) is plugged to the 
> SATA ports of the pci card I get an instantaneous reboot when trying to boot 
> the bootonly cd (it doesn't reach the bootloader). If no disks are plugged it 
> works but hangs while booting (with ACPI disabled). 
> 
> Things I tried: 
> 
> - Booting another OS. Card works perfectly both in linux (2.6.24) and 
> windows, with the two hard drives visible. 
> - Leaving just one SATA drive connected to the whole system through the PCI 
> card. It doesn't matter which drive I let, both the 250G and the 500G make it 
> reboot. 
> - Swapping the power cable. 
> - Unplugging my former 2 drives to ease the load on the Enermax 365W power 
> supply. It works in Windows though, with everything on... 
> - Disabling both integrated SATA of the motherboard (SI3112 which works both 
> in linux and freebsd), and integrated firewire (just in case). 
> - Moving the PCI card to another slot. I only have another pci device, a 
> soundcard, and moving it changed the bios boot order (first the pci, then the 
> integrated sata or viceversa) with no change at all afterwards. 
> - Trying 6.3 livecd. Same insta-reboot. 
> - Updating the card's bios. There is no newer one, 1.0.0.34 is what came and 
> what's on the promise web. 
> 
> Is there any way to make this card work on FreeBSD 7, or anything else that I 
> could try? I bought it specifically for that; I might be in time to return 
> it, though I'd have to buy another one on the same price range then (perhaps 
> Promise FastTrak TX2300, HightPoint RocketRaid 1520 or Adaptec 1210SA which I 
> seem to recall that doesn't work too well on linux). Otherwise I'd be more 
> than willing to help debugging it ^^. 
> 
> Thanks everybody... 
> 
> 
> [EMAIL PROTECTED]
> 

For what its worth, I have a RocketRaid 1520 in my fileserver at home
that I've been using for over a year without any problems:

atapci0:  port
0xff00-0xff07,0xfe00-0xfe03
,0xfd00-0xfd07,0xfc00-0xfc03,0xec00-0xecff irq 17 at device 8.0 on pci0
ata2:  on atapci0
ata3:  on atapci0
...
ad4: 286168MB  at ata2-master UDMA133
ad6: 286168MB  at ata3-master UDMA133
ar0: 286168MB  status: READY


-Proto
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 7, Razer Copperhead mouse patch

2008-02-27 Thread John Baldwin
On Saturday 23 February 2008 03:32:41 pm Dominic Fandrey wrote:
> Oliver Herold wrote:
> > Hi
> > 
> > the Razer Copperhead mouse did work in FreeBSD 7 (current) for a long
> > time, but after some period it stopped working. This patch from Uwe
> > Grohnwaldt:
> > 
> > http://www.freebsd.org/cgi/query-pr.cgi?pr=usb/118670
> > 
> > fixes this wrong behaviour (it's detected as keyboard) and makes the
> > mouse work in FreeBSD 7 again. Would be imho a nice addition for 
> > RELENG_7 (stable) or even Release.
> > 
> > Cheers, Oliver
> 
> Being a Razer fan and user I totally agree.

Can you try this patch instead?

Index: ums.c
===
RCS file: /usr/cvs/src/sys/dev/usb/ums.c,v
retrieving revision 1.97
diff -u -r1.97 ums.c
--- ums.c   26 Dec 2007 14:31:16 -  1.97
+++ ums.c   27 Feb 2008 21:40:48 -
@@ -198,7 +198,10 @@
if (err)
return (UMATCH_NONE);
 
-   if (id->bInterfaceClass == UICLASS_HID &&
+   if (hid_is_collection(desc, size,
+ HID_USAGE2(HUP_GENERIC_DESKTOP, HUG_MOUSE)))
+   ret = UMATCH_IFACECLASS;
+   else if (id->bInterfaceClass == UICLASS_HID &&
id->bInterfaceSubClass == UISUBCLASS_BOOT &&
id->bInterfaceProtocol == UIPROTO_MOUSE)
ret = UMATCH_IFACECLASS;

-- 
John Baldwin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ldconfig -R issue (Was: Problems with icu - 3.8)

2008-02-27 Thread John Baldwin
On Monday 18 February 2008 05:10:24 am Sergey Matveychuk wrote:
> +stable@
> 
> Yoshihiro Ota wrote:
> > Why are so many people are bitten by this?  Is that the jobs of 
port-upgrading
> > tool to safe copy these libraries to compat so that all programs using
> > the old libraries works?
> 
> Portupgrade preserves the libraries in /usr/local/lib/compat/pkg:
> % ls /usr/local/lib/compat/pkg/
> libicudata.so.36.0  libicule.so.36.0libicuuc.so.36.0
> libicui18n.so.36.0  libiculx.so.36.0
> libicuio.so.36.0libicutu.so.36.0
> 
> ldconfig knows about the directory:
> % ldconfig -r | head -2
> /var/run/ld-elf.so.hints:
>  search directories: 
> /lib:/usr/lib:/usr/lib/compat:/usr/X11R6/lib:/usr/local/lib:/usr/local/lib/mysql:/usr/local/lib/compat/pkg
> 
> But ldconfig -R don't rescan them:
> % ldconfig -R
> % ldconfig -r | grep libicudata
>  231:-licudata.38 => /usr/X11R6/lib/libicudata.so.38
>  386:-licudata.38 => /usr/local/lib/libicudata.so.38
> 
> Could anybody investigate it? I have no time now.

Drop the .0 from the filenames or add libfoo.so.36 symlinks.

-- 
John Baldwin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


RE: Tar regression from 6.2 to 6.3 with --strip-components

2008-02-27 Thread Jan Mikkelsen
Tim Kientzle wrote:
> Please file this in a PR so I won't lose
> track.  I don't have time to investigate this
> right now, but I should be able to get to it
> sometime next week.

Filed as PR bin/121158.

Thanks!

Jan.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Problems with promise SATA300 TX2Plus

2008-02-27 Thread Jisakiel
Greetings. I was trying to install FreeBSD 7 on an old machine to make it a 
fileserver. Machine is an AMD K7 1200 on an Abit AN-7 mobo (nforce2 400), which 
has booted Freebsd 7.0 RC1 beforehand (for testing ZFS; only ACPI didn't work 
as it hung it).

I recently bought a PCI SATA card on the cheap variety, Promise SATA300 
TX2Plus, and a couple of 500GB Maxtors. Unfortunately, when any hard drive 
(both of the 500's and an older 250 Maxtor which I use) is plugged to the SATA 
ports of the pci card I get an instantaneous reboot when trying to boot the 
bootonly cd (it doesn't reach the bootloader). If no disks are plugged it works 
but hangs while booting (with ACPI disabled). 

Things I tried: 

- Booting another OS. Card works perfectly both in linux (2.6.24) and windows, 
with the two hard drives visible. 
- Leaving just one SATA drive connected to the whole system through the PCI 
card. It doesn't matter which drive I let, both the 250G and the 500G make it 
reboot. 
- Swapping the power cable. 
- Unplugging my former 2 drives to ease the load on the Enermax 365W power 
supply. It works in Windows though, with everything on... 
- Disabling both integrated SATA of the motherboard (SI3112 which works both in 
linux and freebsd), and integrated firewire (just in case). 
- Moving the PCI card to another slot. I only have another pci device, a 
soundcard, and moving it changed the bios boot order (first the pci, then the 
integrated sata or viceversa) with no change at all afterwards. 
- Trying 6.3 livecd. Same insta-reboot. 
- Updating the card's bios. There is no newer one, 1.0.0.34 is what came and 
what's on the promise web. 

Is there any way to make this card work on FreeBSD 7, or anything else that I 
could try? I bought it specifically for that; I might be in time to return it, 
though I'd have to buy another one on the same price range then (perhaps 
Promise FastTrak TX2300, HightPoint RocketRaid 1520 or Adaptec 1210SA which I 
seem to recall that doesn't work too well on linux). Otherwise I'd be more than 
willing to help debugging it ^^. 

Thanks everybody... 


[EMAIL PROTECTED]

   
-

¿Con Mascota por primera vez? - Sé un mejor Amigo
Entra en Yahoo! Respuestas.
  
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: [solved/workaround?] Re: em very slow, shared irq... on 6.3p8

2008-02-27 Thread Holger Kipp
On Wed, Feb 27, 2008 at 12:16:27PM -0800, Jack Vogel wrote:
> 
> Hmmm, something is really broken here that POLL just is just bypassing,
> what was the adapter type exactly (pciconf -l). Sorry, but I must have
> missed this email earlier.

Problem is with em0-em13 that many share an IRQ 16/17, and because I was using
6.2-p8 and therefore did not have MSI enabled/available. Polling was also not
compiled into the kernel.

Polling alone will give times between 0.2 and 1.3 ms (with HZ=1000), and
MSI alone (after activated during boot within loader.conf) will give
best results (around 0.4 to 0.5 ms for a ping through firewall and back
across two lans). This was then with 6.3-p1

Still, 6.3-p1 without msi and polling is still very slow in handling IRQs from
the em-nics.

I still don't know exactly why this took up to 1.5 seconds (sometimes even
more) without MSI or POLLING. Please see my previous emails for dmesg.

pciconf -l gives:

[EMAIL PROTECTED]:4:0:   class=0x02 card=0x11998086 chip=0x10b58086 
rev=0x03 hdr=0x00
[EMAIL PROTECTED]:4:1:   class=0x02 card=0x11998086 chip=0x10b58086 
rev=0x03 hdr=0x00
[EMAIL PROTECTED]:6:0:   class=0x02 card=0x11998086 chip=0x10b58086 
rev=0x03 hdr=0x00
[EMAIL PROTECTED]:6:1:   class=0x02 card=0x11998086 chip=0x10b58086 
rev=0x03 hdr=0x00
[EMAIL PROTECTED]:0:0:  class=0x02 card=0x10a48086 chip=0x10a48086 rev=0x06 
hdr=0x00
[EMAIL PROTECTED]:0:1:  class=0x02 card=0x10a48086 chip=0x10a48086 rev=0x06 
hdr=0x00
[EMAIL PROTECTED]:0:0:  class=0x02 card=0x10a48086 chip=0x10a48086 rev=0x06 
hdr=0x00
[EMAIL PROTECTED]:0:1:  class=0x02 card=0x10a48086 chip=0x10a48086 rev=0x06 
hdr=0x00
[EMAIL PROTECTED]:0:0:  class=0x02 card=0x10a48086 chip=0x10a48086 rev=0x06 
hdr=0x00
[EMAIL PROTECTED]:0:1:  class=0x02 card=0x10a48086 chip=0x10a48086 rev=0x06 
hdr=0x00
[EMAIL PROTECTED]:0:0: class=0x02 card=0x10a48086 chip=0x10a48086 rev=0x06 
hdr=0x00
[EMAIL PROTECTED]:0:1: class=0x02 card=0x10a48086 chip=0x10a48086 rev=0x06 
hdr=0x00
[EMAIL PROTECTED]:0:0: class=0x02 card=0x108c15d9 chip=0x108c8086 rev=0x03 
hdr=0x00
[EMAIL PROTECTED]:0:0: class=0x02 card=0x109a15d9 chip=0x109a8086 rev=0x00 
hdr=0x00

We have one quad PCI-X 64bit card (the first four) and two quad PCI-Express 
cards. 
The last two devices are on-board ports.

device = '82546GB PRO/1000 GT Quad Port Server Adapter'
device = '82546GB PRO/1000 GT Quad Port Server Adapter'
device = '82546GB PRO/1000 GT Quad Port Server Adapter'
device = '82546GB PRO/1000 GT Quad Port Server Adapter'
device = '82571EB Gigabit Ethernet Controller'
device = '82571EB Gigabit Ethernet Controller'
device = '82571EB Gigabit Ethernet Controller'
device = '82571EB Gigabit Ethernet Controller'
device = '82571EB Gigabit Ethernet Controller'
device = '82571EB Gigabit Ethernet Controller'
device = '82571EB Gigabit Ethernet Controller'
device = '82571EB Gigabit Ethernet Controller'
device = '82573E Intel Corporation 82573E Gigabit Ethernet Controller 
(Copper)'
device = '82573L Intel PRO/1000 PL Network Adaptor'

Is this helpful? Please let me know if you need anything else. 

Best regards,
Holger Kipp
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: [solved/workaround?] Re: em very slow, shared irq... on 6.3p8

2008-02-27 Thread Jack Vogel
On Wed, Feb 27, 2008 at 11:50 AM, Holger Kipp <[EMAIL PROTECTED]> wrote:
> On Wed, Feb 27, 2008 at 09:50:16AM -0500, Mike Tancsa wrote:
>
>  more details below. as it currently is, polling seems to do
>  the trick, however handling several em-interfaces with the
>  same irq (mind you, it is pci) shouldn't cause delays of
>  up to 1.5 seconds for a simple ping... Therefore I consider
>  using polling for a nearly idle system more a workaround
>  than a solution to this problem :-(
>
>  > At 05:49 AM 2/27/2008, Holger Kipp wrote:
>  >
>  > >I therefore assume that the problem is between receiving the
>  > >irq from em and getting the data from the interface on the firewall
>  > >itself.
>  >
>  > I would try upgrading to 6.3R (there are several em driver bug fixes)
>  done. system is now 6.3-RELEASE-p1 which also gave me -c option for pciconf
>  and msi syscontrols (were missing in the old 6.2).
>
>
>  > and then try the box with
>  > % cat /boot/loader.conf
>  > hw.pci.enable_msi=1
>  >
>  > ...if the cards support msi.
>  >
>  > I think pciconf -lvc should tell you if the cards and slots support it or
>  > not.
>
>  pciconf -lvc says for all em:
>  cap 05[d0] = MSI supports 1 message, 64 bit
>  so I assume they do support MSI.
>
>  with msi disabled I get
>
>  38 packets transmitted, 38 packets received, 0% packet loss
>  round-trip min/avg/max/stddev = 4.833/228.022/1539.337/339.768 ms
>
>  with msi enabled (via sysctl) I get
>
>  33 packets transmitted, 33 packets received, 0% packet loss
>  round-trip min/avg/max/stddev = 1.865/156.421/1339.841/239.375 ms
>
>  so looks equally bad (I don't consider 30-40 packets a meaningful sample).
>  I don't know if it makes any differences if switched on directly in
>  loader.conf, though.
>
>  enabling polling (withous MSI) gives
>
>  30 packets transmitted, 30 packets received, 0% packet loss
>  round-trip min/avg/max/stddev = 0.366/0.790/1.339/0.290 ms
>
>  (maybe I should have used HZ=2000 to keep it below 0.6ms ;-)
>
>  > Also, if you dont need IPV6, use FAST_IPSEC. It does not need
>  > mpsafe.  If you do need IPSEC and IPV6, 7.0R got rid of that restriction.
>
>  I think this enough changes in one go for a production system ;-)

Hmmm, something is really broken here that POLL just is just bypassing,
what was the adapter type exactly (pciconf -l). Sorry, but I must have
missed this email earlier.

Jack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ad0 READ_DMA TIMEOUT errors on install of 7.0-RELEASE

2008-02-27 Thread Scott Long

Stephen Hurd wrote:


This shows you've had 4 reallocated sectors, meaning your disk does in
fact have bad blocks.  In 90% of the cases out there, bad blocks
continue to "grow" over time, due to whatever reason (I remember reading
an article explaining it, but I can't for the life of me find the URL).
  


This is unusual now?  I've always "known" that a small number of bad 
blocks is normal.  Time to readjust my knowledge again?


Modern drives hide bad sectors by keeping a pool of spare tracks and
automatically remapping bad sectors to that pool.  The problem lies in
when the drive has aged enough that it's run out of spares.



194 Temperature_Celsius 0x0032   253   253   000Old_age   
Always   -   48



This is excessive, and may be attributing to problems.  A hard disk
running at 48C is not a good sign.  This should really be somewhere
between high 20s and mid 30s.
  


Yeah, this is a known problem with this drive... it's been running hot 
for years.  I always figured it was due to the rotational speed increase 
in commodity drives.


48C is high, but I wouldn't consider it excessive.  Drives that start 
generating "excessive" heat tend to fail shortly thereafter.  I do agree 
that the heat is probably shortening the lifespan on the drive.




Error 2 occurred at disk power-on lifetime: 5171 hours (215 days + 11 
hours)
  When the command that caused the error occurred, the device was in 
an unknown state.
Error 1 occurred at disk power-on lifetime: 5171 hours (215 days + 11 
hours)
  When the command that caused the error occurred, the device was in 
an unknown state.



These are automated SMART log entries confirming the DMA failures.  The
fact that SMART saw them means that the disk is also aware of said
issues.  These may have been caused by the reallocated sectors.  It's
also interesting that the LBAs are different than the ones FreeBSD
reported issues with.
  


If that power on lifetime is accurate, that was at least a year ago... 
but I can't find any documentation as to when the power-on lifetime 
wraps or what it actually indicates.  I'm assuming that it is total 
power on time since the drive was manufactured.  If it's total hours as 
a 16-bit integer, it shouldn't wrap.  Is there a way of getting the 
"current" power-on lifetime value that you're aware of?  That power on 
minutes is interesting, but its current value is lower than the value at 
the error (but higher than the power uptime of the system):
 9 Power_On_Minutes0x0032   219   219   000Old_age   
Always   -   1061h+40m


Also interesting is that after getting more errors from FreeBSD, I did 
not get more errors in smartctl.




The errors you're getting from FreeBSD have nothing to do directly with
SMART.  The driver thinks that commands are timing out and that the
drive is becoming unresponsive.  Whether they actually are is another
question.  Given that this problem changes behavior with the version of
FreeBSD that you're running (and even happens in completely virtual
environments like vmware) I'm betting that it's a driver problem and not
a hardware problem, though you should probably think about migrating
your data off to a new drive sometime soon.

I'd like to attack these driver problems.  What I need is to spend a
couple of days with an affected system that can reliably reproduce the
problem, instrumenting and testing the driver.  I have a number of
theories about what might be going wrong, but nothing that I'm
definitely sure about.  If you are willing to set up your system with
remote power and remote serial, and if we knew a reliable way to
reproduce the problem, I could probably have the problem identified and
fixed pretty quickly.

Scott
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: [solved!] Re: em very slow, shared irq... on 6.2p8

2008-02-27 Thread Mike Tancsa

At 03:09 PM 2/27/2008, Holger Kipp wrote:

On Wed, Feb 27, 2008 at 08:50:58PM +0100, Holger Kipp wrote:
> On Wed, Feb 27, 2008 at 09:50:16AM -0500, Mike Tancsa wrote:
>
> more details below. as it currently is, polling seems to do
> the trick, however handling several em-interfaces with the
> same irq (mind you, it is pci) shouldn't cause delays of
> up to 1.5 seconds for a simple ping... Therefore I consider
> using polling for a nearly idle system more a workaround
> than a solution to this problem :-(
[...]
> with msi enabled (via sysctl) I get
>
> 33 packets transmitted, 33 packets received, 0% packet loss
> round-trip min/avg/max/stddev = 1.865/156.421/1339.841/239.375 ms
>
> so looks equally bad (I don't consider 30-40 packets a meaningful sample).
> I don't know if it makes any differences if switched on directly in
> loader.conf, though.

have now activated msi in loader.conf and get very good results again.

38 packets transmitted, 38 packets received, 0% packet loss
round-trip min/avg/max/stddev = 0.291/0.425/0.595/0.067 ms

without polling activated. So it was msi after all I needed here.
Maybe this should go into docu for em or ifconfig?


Hi,
Yes, sorry I should have mentioned, you need to reboot.  But 
I strongly suggest upgrading to 6.3R as there are a number of em bugs 
that are fixed Perhaps some IRQ issues as well.  But for MSI in 
general, I think the Intel guy recommended running that way for the NIC.


---Mike 


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: [solved!] Re: em very slow, shared irq... on 6.2p8

2008-02-27 Thread Holger Kipp
On Wed, Feb 27, 2008 at 08:50:58PM +0100, Holger Kipp wrote:
> On Wed, Feb 27, 2008 at 09:50:16AM -0500, Mike Tancsa wrote:
> 
> more details below. as it currently is, polling seems to do
> the trick, however handling several em-interfaces with the
> same irq (mind you, it is pci) shouldn't cause delays of
> up to 1.5 seconds for a simple ping... Therefore I consider
> using polling for a nearly idle system more a workaround
> than a solution to this problem :-(
[...]
> with msi enabled (via sysctl) I get
> 
> 33 packets transmitted, 33 packets received, 0% packet loss
> round-trip min/avg/max/stddev = 1.865/156.421/1339.841/239.375 ms
> 
> so looks equally bad (I don't consider 30-40 packets a meaningful sample).
> I don't know if it makes any differences if switched on directly in
> loader.conf, though.

have now activated msi in loader.conf and get very good results again.

38 packets transmitted, 38 packets received, 0% packet loss
round-trip min/avg/max/stddev = 0.291/0.425/0.595/0.067 ms

without polling activated. So it was msi after all I needed here.
Maybe this should go into docu for em or ifconfig?

> enabling polling (withous MSI) gives
> 
> 30 packets transmitted, 30 packets received, 0% packet loss
> round-trip min/avg/max/stddev = 0.366/0.790/1.339/0.290 ms

this is still the same with msi activated in loader.conf

Best regards,
Holger Kipp
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: fsck_ufs: cannot alloc 94208 bytes for inoinfo

2008-02-27 Thread Matthew Dillon
 fsck's memory usage is directly related to the number of inodes and
 the number of directories in the filesystem.  Directories are
 particularly memory intensive.

 I've found on my backup system that a UFS1 filesystem with 40 million
 inodes is about the limit that can be fsck'd (at least with a 32 bit
 architecture).  My cron jobs keep my backup partition below that point.
 Even in a 64 bit environment you will be limited by swap and the sheer
 time it takes for fsck to run.  It takes well over 8 hours for my
 backup system to fsck.

 You can also reduce fsck time by reducing the number of cylinder
 groups on the disk.  I usually max them out (-c 999 and newfs then
 sets it to the maximum, usually in the 50-80 range).  This will
 improve performance but not reduce the memory required.

-Matt

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


[solved/workaround?] Re: em very slow, shared irq... on 6.3p8

2008-02-27 Thread Holger Kipp
On Wed, Feb 27, 2008 at 09:50:16AM -0500, Mike Tancsa wrote:

more details below. as it currently is, polling seems to do
the trick, however handling several em-interfaces with the
same irq (mind you, it is pci) shouldn't cause delays of
up to 1.5 seconds for a simple ping... Therefore I consider
using polling for a nearly idle system more a workaround
than a solution to this problem :-(

> At 05:49 AM 2/27/2008, Holger Kipp wrote:
> 
> >I therefore assume that the problem is between receiving the
> >irq from em and getting the data from the interface on the firewall
> >itself.
> 
> I would try upgrading to 6.3R (there are several em driver bug fixes) 
done. system is now 6.3-RELEASE-p1 which also gave me -c option for pciconf
and msi syscontrols (were missing in the old 6.2).


> and then try the box with
> % cat /boot/loader.conf
> hw.pci.enable_msi=1
> 
> ...if the cards support msi.
> 
> I think pciconf -lvc should tell you if the cards and slots support it or 
> not.

pciconf -lvc says for all em:
cap 05[d0] = MSI supports 1 message, 64 bit
so I assume they do support MSI.

with msi disabled I get

38 packets transmitted, 38 packets received, 0% packet loss
round-trip min/avg/max/stddev = 4.833/228.022/1539.337/339.768 ms

with msi enabled (via sysctl) I get

33 packets transmitted, 33 packets received, 0% packet loss
round-trip min/avg/max/stddev = 1.865/156.421/1339.841/239.375 ms

so looks equally bad (I don't consider 30-40 packets a meaningful sample).
I don't know if it makes any differences if switched on directly in
loader.conf, though.

enabling polling (withous MSI) gives

30 packets transmitted, 30 packets received, 0% packet loss
round-trip min/avg/max/stddev = 0.366/0.790/1.339/0.290 ms

(maybe I should have used HZ=2000 to keep it below 0.6ms ;-)

> Also, if you dont need IPV6, use FAST_IPSEC. It does not need 
> mpsafe.  If you do need IPSEC and IPV6, 7.0R got rid of that restriction.

I think this enough changes in one go for a production system ;-)

many thanks for the recommendations!

Best regards,
Holger
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ad0 READ_DMA TIMEOUT errors on install of 7.0-RELEASE

2008-02-27 Thread Alex Zbyslaw

Stephen Hurd wrote:


Jeremy Chadwick wrote:


SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE  
UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0033   253   253   063Pre-fail  
Always   -   4 


This shows you've had 4 reallocated sectors, meaning your disk does in
fact have bad blocks.  In 90% of the cases out there, bad blocks
continue to "grow" over time, due to whatever reason (I remember reading
an article explaining it, but I can't for the life of me find the URL).  



This is unusual now?  I've always "known" that a small number of bad 
blocks is normal.  Time to readjust my knowledge again?


I have bought disks where the value of Reallocated_Sector_Ct was not 0, 
at least by the time I looked at it with smartctl.  Nothing bad has 
happened to those disks in several years (hope that's not tempting fate).


I have always assumed that what matters is when this value *changes*.  
If it's not changing, who cares?  smartd will monitor disks and email 
you when certain attributes change (e.g. Pre-fail attributes like 
Reallocated_Sector_Ct).  If it changed, it would mean that an attempt to 
write data  had failed and that reallocation had happened.


e.g. from smartd.conf

/dev/ad4 -o on -S on -a -m root -M daily

If your Current_Pending_Sector were non-zero you'd be in trouble, I believe.

0.02, pinch of salt, not an expert, slippery when hot, long time since I 
read the specs, etc etc.


--Alex


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Tar regression from 6.2 to 6.3 with --strip-components

2008-02-27 Thread Tim Kientzle

I've just noticed a regression in tar from 6.2 to 6.3:

Running this on 6.2 produces no output:

#!/bin/sh
mkdir -p a b output
touch a/file1 b/file2
tar cf test.tar a b
tar -x -C output --strip-components 1 -f test.tar

On 6.3, it produces this output:

: Invalid empty pathname
: Invalid empty pathname
tar: Error exit delayed from previous errors.


Please file this in a PR so I won't lose
track.  I don't have time to investigate this
right now, but I should be able to get to it
sometime next week.

Tim
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ad0 READ_DMA TIMEOUT errors on install of 7.0-RELEASE

2008-02-27 Thread Jeremy Chadwick
On Wed, Feb 27, 2008 at 10:32:48AM -0800, Stephen Hurd wrote:
> Booting the 6.3-RELEASE CD seems to make the problem go away... possibly 
> 7.0 stresses the HD more?

We don't know.  The author of the ATA subsystem is somewhat MIA, likely
busy with real-life things (jobs, etc.).  My main point was that you're
not alone with DMA timeouts and other oddities, but the reallocated
sector count being non-zero doesn't permit me to say "Yeah, you're
experiencing what others are".

>>> SMART Attributes Data Structure revision number: 16
>>> Vendor Specific SMART Attributes with Thresholds:
>>> ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE  UPDATED 
>>>  WHEN_FAILED RAW_VALUE
>>>   5 Reallocated_Sector_Ct   0x0033   253   253   063Pre-fail  Always  
>>>  -   4
>>
>> This shows you've had 4 reallocated sectors, meaning your disk does in
>> fact have bad blocks.  In 90% of the cases out there, bad blocks
>> continue to "grow" over time, due to whatever reason (I remember reading
>> an article explaining it, but I can't for the life of me find the URL).
>
> This is unusual now?  I've always "known" that a small number of bad blocks 
> is normal.  Time to readjust my knowledge again?

This isn't normal.  The realloc sector count in SMART, when a disk comes
out of the factory, is zero.  That number increases only when new
defects are found, and when those sectors are remapped to spares which
are available (there is a limited number of spares).  This is also
called a "grown defect list".

This isn't to be confused with what's called a "physical defect list",
which are known sectors/LBAs which are bad, straight out of the factory.
On ATA disks, the manufacturer stores the list in the drive and its not
modifiable via formatting or even a BIOS-based format (e.g. a SATA RAID
controller); some vendors do implement "low level formatting" via
undocumented ATA commands, which can erase that list, but that's besides
the point.  On SCSI disks, the physical defect list is readable and also
erasable via a low-level format, but SCSI disks also have a grown defect
list which is separate.

What I'm trying to say is that your disk already has 4 bad blocks that
the disk firmware itself is aware of, which means chances are there are
others which it hasn't figured out.  A high number of ECCs could
indicate that as well.

>>> 194 Temperature_Celsius 0x0032   253   253   000Old_age   Always  
>>>  -   48
>>
>> This is excessive, and may be attributing to problems.  A hard disk
>> running at 48C is not a good sign.  This should really be somewhere
>> between high 20s and mid 30s.
>
> Yeah, this is a known problem with this drive... it's been running hot for 
> years.  I always figured it was due to the rotational speed increase in 
> commodity drives.

7200rpm disks shouldn't be running at 48C.  None of my 7200rpm disks, in
my barely-cooled FreeBSD box at home (e.g. two 1100rpm fans and that's
it) get anywhere near that.  36C is the highest they've seen -- and
there's 4 stacked right on top of one another.

Heck, on my disks, the SMART warning threshold (set by the manufacturer,
which is Western Digital) is 45C.

10krpm disks probably run hotter, but are not commodity.

>>> Error 2 occurred at disk power-on lifetime: 5171 hours (215 days + 11 
>>> hours)
>>>   When the command that caused the error occurred, the device was in an 
>>> unknown state.
>>> Error 1 occurred at disk power-on lifetime: 5171 hours (215 days + 11 
>>> hours)
>>>   When the command that caused the error occurred, the device was in an 
>>> unknown state.
>>
>> These are automated SMART log entries confirming the DMA failures.  The
>> fact that SMART saw them means that the disk is also aware of said
>> issues.  These may have been caused by the reallocated sectors.  It's
>> also interesting that the LBAs are different than the ones FreeBSD
>> reported issues with.
>
> If that power on lifetime is accurate, that was at least a year ago... but 
> I can't find any documentation as to when the power-on lifetime wraps or 
> what it actually indicates.  I'm assuming that it is total power on time 
> since the drive was manufactured.

Correct: it indicates how many hours the drive itself has been powered
on as an aggregate total.  E.g. if powered on for 48 hours, then shut
off for 3 hours, then powered on for another 7, the stat would read 55
hours.

> If it's total hours as a 16-bit integer, it shouldn't wrap.  Is there a way
> of getting the "current" power-on lifetime value that you're aware of?

I would have to go look at the SMART extension to ATA/SATA and find out
how large the counter is.  It probably varies from vendor to vendor too,
as SMART, despite being a standard, has a lot of "loose ends" in the
specification which vendors take advantage of.

> That power on minutes is interesting, but its current value is lower
> than the value at the error (but higher than the power uptime of the
> system):
>  9 Power_On_Minutes

Re: Documentation of NO_* knobs

2008-02-27 Thread Patrick M. Hausen
Hello,

On Wed, Feb 27, 2008 at 09:18:31AM -0800, Jeremy Chadwick wrote:

> I think you're looking for all the WITHOUT knobs in src.conf(5).

I'm running RELENG_6_3 on production machines unlikely to change
any time real soon.

> Note that src.conf(5) does not apply to RELENG_6 or earlier, where the
> knobs are named NO_xxx.  You can get a list of those from
> /usr/share/examples/etc/make.conf.

I knew about that file. NO_SYSCONS and NO_EXAMPLES are neither
in this one nor in the manpage for make.conf.
Yet they are in the handbook example for NanoBSD builds.

That's why I'm asking for the "official" exhaustive list.

Thanks for taking the time to answer,
Patrick
-- 
punkt.de GmbH * Vorholzstr. 25 * 76137 Karlsruhe
Tel. 0721 9109 0 * Fax 0721 9109 100
[EMAIL PROTECTED]   http://www.punkt.de
Gf: Jürgen Egeling  AG Mannheim 108285
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: fsck_ufs: cannot alloc 94208 bytes for inoinfo

2008-02-27 Thread Oliver Fromme
Olivier Mueller wrote:
 > "fsck_ufs: cannot alloc 94208 bytes for inoinfo"
 > 
 > This is what I get after about one hour while trying a fsck on a large
 > (1.4TB) partition "broken" since a power outage.  
 > 
 > HW: HP DL380G5, under freebsd 6.1/i386, with 1GB of RAM and: 

Your fsck will need roughly 1 GB memory per 1 TB file
system size.  That formular was posted some time ago on
the -fs mailing list.  It only applies with the default
newfs parameters -- if you used other parameters (inode
density, bsize/fsize), fsck's memory requirements are
different.

 > Now I added this to the /boot/loader.conf:
 > kern.maxdsiz="1073741824" # 1GB
 > kern.dfldsiz="1073741824" # 1GB
 > and I'm trying the fsck again, but I'm not sure it will help.

Given the above formula, it's probably not enough, so you
might have to increase it further.  Also make sure that
you have enough swap space.

If you have a spare box, it might be helpful to transplant
some RAM from it so fsck doesn't have to swap.

Best regards
   Oliver

-- 
Oliver Fromme, secnetix GmbH & Co. KG, Marktplatz 29, 85567 Grafing b. M.
Handelsregister: Registergericht Muenchen, HRA 74606,  Geschäftsfuehrung:
secnetix Verwaltungsgesellsch. mbH, Handelsregister: Registergericht Mün-
chen, HRB 125758,  Geschäftsführer: Maik Bachmann, Olaf Erb, Ralf Gebhart

FreeBSD-Dienstleistungen, -Produkte und mehr:  http://www.secnetix.de/bsd

"If you think C++ is not overly complicated, just what is a protected
abstract virtual base pure virtual private destructor, and when was the
last time you needed one?"
-- Tom Cargil, C++ Journal
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ad0 READ_DMA TIMEOUT errors on install of 7.0-RELEASE

2008-02-27 Thread Stephen Hurd

Jeremy Chadwick wrote:

And after the reboot, the READ_DMA timeouts were back.



You're not the only one seeing this behaviour.  There are too many posts
in the past reporting similar.  Here's the breakdown:

* Some have switched to alternate operating systems (usually Linux) for
  a short while and seen no sign of DMA timeouts.
  


Booting the 6.3-RELEASE CD seems to make the problem go away... possibly 
7.0 stresses the HD more?



However: in your case, your disk does look to have problems based on the
SMART output you provided.  It does not matter how new/old the disk is,
by the way.  I'll point out the problematic stats.  You need to replace
the disk ASAP.
  


Yeah, that's pretty much what I figured, the timing (ie: the moment I 
boot 7.0-RELEASE) is the only bit that seems fishy.  This HD has been 
powered on pretty much continuously for around three years.  Given that 
it's a Maxtor, I'm honestly a bit surprised that it's lasted as well as 
it has.



SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE  UPDATED  
WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0033   253   253   063Pre-fail  Always   
-   4



This shows you've had 4 reallocated sectors, meaning your disk does in
fact have bad blocks.  In 90% of the cases out there, bad blocks
continue to "grow" over time, due to whatever reason (I remember reading
an article explaining it, but I can't for the life of me find the URL).
  


This is unusual now?  I've always "known" that a small number of bad 
blocks is normal.  Time to readjust my knowledge again?



194 Temperature_Celsius 0x0032   253   253   000Old_age   Always   
-   48



This is excessive, and may be attributing to problems.  A hard disk
running at 48C is not a good sign.  This should really be somewhere
between high 20s and mid 30s.
  


Yeah, this is a known problem with this drive... it's been running hot 
for years.  I always figured it was due to the rotational speed increase 
in commodity drives.



Error 2 occurred at disk power-on lifetime: 5171 hours (215 days + 11 hours)
  When the command that caused the error occurred, the device was in an unknown 
state.
Error 1 occurred at disk power-on lifetime: 5171 hours (215 days + 11 hours)
  When the command that caused the error occurred, the device was in an unknown 
state.



These are automated SMART log entries confirming the DMA failures.  The
fact that SMART saw them means that the disk is also aware of said
issues.  These may have been caused by the reallocated sectors.  It's
also interesting that the LBAs are different than the ones FreeBSD
reported issues with.
  


If that power on lifetime is accurate, that was at least a year ago... 
but I can't find any documentation as to when the power-on lifetime 
wraps or what it actually indicates.  I'm assuming that it is total 
power on time since the drive was manufactured.  If it's total hours as 
a 16-bit integer, it shouldn't wrap.  Is there a way of getting the 
"current" power-on lifetime value that you're aware of?  That power on 
minutes is interesting, but its current value is lower than the value at 
the error (but higher than the power uptime of the system):
 9 Power_On_Minutes0x0032   219   219   000Old_age   
Always   -   1061h+40m


Also interesting is that after getting more errors from FreeBSD, I did 
not get more errors in smartctl.



My advice to you is: replace the disk ASAP.  This problem will only get
worse.  Try another hard disk brand too (I don't have anything "against"
Maxtor, but usually its recommended to avoid a brand you have problems
with until the next time you have issues, then switch brands, etc.
etc...).  I'm very fond of Western Digital's SE16, RE, and RE2 series
currently.  But avoid Fujitsu and Samsung (both have a long track record
of having buggy drive firmwares, forcing vendors to make custom
workarounds for issues); stick with Seagate, Western Digital, or Maxtor.
  


Yeah, that's my plan... but I wanted to stake out some whining rights in 
advance so I can do the "But you said it was a bad HD or cable!  Now I'm 
out $x00 and my system still doesn't work!  Help me or I switch to 
DragonFly BSD/Desktop BSD/Linux which is perfect and has no problems!" 
thing.  Then go on Slashdot and post long rambling messages about how 
FreeBSD is dead and it doesn't matter than the manpages on any given 
Linux box are useless.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Documentation of NO_* knobs

2008-02-27 Thread Jeremy Chadwick
On Wed, Feb 27, 2008 at 05:32:55PM +0100, Patrick M. Hausen wrote:
> is there an exhaustive list of all possible NO_* knobs
> for make.conf? While experimenting with NanoBSD I found
> e.g. that the handbook
> - just two out of many. Yet, these are not in the manpage of
> make.conf(5) on a 6.3-RELEASE. So, where did the handbook
> author find them?

I think you're looking for all the WITHOUT knobs in src.conf(5).

Starting with RELENG_7, all of the NO knobs for removal of features in
the base system were moved into /etc/src.conf and renamed to WITHOUT.
Some may also have changed names, so look closely.

Note that src.conf(5) does not apply to RELENG_6 or earlier, where the
knobs are named NO_xxx.  You can get a list of those from
/usr/share/examples/etc/make.conf.

Hope this helps.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Documentation of NO_* knobs

2008-02-27 Thread Patrick M. Hausen
Hi, all,

is there an exhaustive list of all possible NO_* knobs
for make.conf? While experimenting with NanoBSD I found
e.g. that the handbook

http://www.freebsd.org/doc/en_US.ISO8859-1/articles/nanobsd/index.html

mentions

NO_EXAMPLES
NO_SYSCONS
...

- just two out of many. Yet, these are not in the manpage of
make.conf(5) on a 6.3-RELEASE. So, where did the handbook
author find them?

Thanks,
Patrick
-- 
punkt.de GmbH * Vorholzstr. 25 * 76137 Karlsruhe
Tel. 0721 9109 0 * Fax 0721 9109 100
[EMAIL PROTECTED]   http://www.punkt.de
Gf: Jürgen Egeling  AG Mannheim 108285
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: 7.0-PRERELEASE Fatal Trap 12 with sysctl and acpi

2008-02-27 Thread Jim Pingle

Jim Pingle wrote:

Jim Pingle wrote:
I'm having some trouble with a SuperMicro SuperServer 6022L-6 that 
previously ran 7.0-BETA4 without problems. Today, I updated this 
machine to 7.0-PRERELEASE and now it will not fully boot unless I 
disable ACPI. A quick search of the PR database didn't turn up 
anything similar with sysctl and ACPI.


I wiped the machine, installed from the RC3 CD, and it did not crash. If I 
update to RELENG_7, the crash comes back. If I go back to RELENG_7_0, there 
is no crash.



Kernel config is GENERIC, with ULE scheduler and "options ASR_COMPAT"


This happens with GENERIC, with no extra options, as well as with my custom 
kernel.


If I get some time next week I might try a binary search of commits 
between BETA4 and now, to pinpoint where it stopped working.


As a buildworld/buildkernel takes about an hour and a half on this 
hardware (2x2GHz Xeon), I haven't fully narrowed this down yet. It is 
somewhere between 12/15/2007 (works) and 12/25/2007 (crashes). I glanced 
at the archives between those points but I didn't see any similar 
complaints. The only ACPI references I saw in the archives were 
referring to thermal zone problems, and a commit relating to those.


I'll return to this early next week to see if I can narrow this down 
more precisely.


I tried a binary search of the source tree to narrow down the crash. I found 
that one vector for the crash was introduced between 2007/12/19 20:00:00 and 
2007/12/19 23:59:00, which left me with only a handful of files to test.


By process of elimination, I found that if I backed some changes out in 
machdep.c, the crash stopped.


machdep.c v1.658 2007/08/09 njl - Boots OK
machdep.c v1.658.2.1 2007/12/19 rpaulo - Crashes

The confusing part (to me) is that my next step was to update all the way to 
RELENG_7 as of yesterday, then back out those same changes, but the crash 
still happened. So either I misidentified the cause of the crash -- which is 
quite possible -- or it was reintroduced in some other change (or both!).


I have a debug kernel built now, and I can generate vmcore files at will. 
Does anyone have any ideas? Is there some more information that I can gather 
that will help find the cause?


Now that I have some more solid information, I'll open a PR.

Jim
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ntpd fails to synchronize on FreeBSD 6.3-STABLE

2008-02-27 Thread Jeremy Chadwick
On Wed, Feb 27, 2008 at 09:58:28PM +0700, Pongthep Kulkrisada wrote:
> [EMAIL PROTECTED]:~#  /etc/rc.d/ntpdate start
> Setting date via ntp.
> 27 Feb 20:46:53 ntpdate[2000]: no server suitable for synchronization found
> [EMAIL PROTECTED]:~#  tcpdump -l -n -s 8192 -p "port 123"
> tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
> listening on fxp0, link-type EN10MB (Ethernet), capture size 8192 bytes
> 20:51:46.149541 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 20:51:47.149369 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 20:51:48.149192 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 20:52:50.148777 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 20:52:50.148818 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 20:52:54.149147 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 20:53:53.149127 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 20:53:56.148700 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 20:53:57.149545 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 20:54:56.149586 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 20:55:02.149701 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 20:55:02.149749 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 20:56:00.148838 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 20:56:05.149070 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 20:56:07.148751 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 20:57:06.148789 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 20:57:11.148992 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 20:57:13.148718 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 20:58:10.149016 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 20:58:17.148954 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 20:58:17.148997 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 20:59:14.149296 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 20:59:22.149048 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 20:59:23.148886 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 21:00:19.149376 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 21:00:26.149309 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 21:00:29.148856 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 21:01:23.149634 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 21:01:30.149579 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 21:01:33.149117 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 21:02:29.149586 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 21:02:35.148637 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 21:02:37.149400 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 21:03:32.149004 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 21:03:40.148796 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 21:03:41.149618 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 21:04:35.149397 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 21:04:45.148898 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 21:04:46.148714 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 21:05:39.149665 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 21:05:50.148985 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 21:05:50.149032 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 21:06:44.148776 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 21:06:54.149246 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 21:06:56.148916 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 21:07:49.148879 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> 21:07:58.149478 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
> length 48
> 21:08:00.149183 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, 
> length 48
> 21:09:56.149530 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 
> 48
> ^C
> 49 packets captured
> 230 packets received by filter
> 0 packets dropped by kernel

You're not getting responses back from __any__ of those NTP servers.  If
you have a firewall *in front* of your BSD box (meaning a separate box,
not ipfw/ipfilter/pf on the same BSD box!), then this i

Re: ntpd fails to synchronize on FreeBSD 6.3-STABLE

2008-02-27 Thread Pongthep Kulkrisada
> This isn't enough time.  Please try this instead.
> 
> # /etc/rc.d/ntpd stop
> # /etc/rc.d/ntpdate start
> 
> This should set your clock, even if only by a few milliseconds.
> Assuming the ntpdate part is successful, continue on:
> 
> # tcpdump -l -n -s 8192 -p "port 123"
> 
> Now, in another window, execute:
> 
> # /etc/rc.d/ntpd start
> 
> Then let the tcpdump go for about 15 minutes.  You aren't using the
> "iburst" feature on any of the servers, so it will take some time before
> they try to sync up.
Alright, here is the output.

Script started on Wed Feb 27 20:46:19 2008
[EMAIL PROTECTED]:~#/etc/rc.c/ntpd stop
Stopping ntpd.
[EMAIL PROTECTED]:~#/etc/rc.d/ntpdate start
Setting date via ntp.
27 Feb 20:46:53 ntpdate[2000]: no server suitable for synchronization found
[EMAIL PROTECTED]:~#tcpdump -l -n -s 8192 -p "port 123"
tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
listening on fxp0, link-type EN10MB (Ethernet), capture size 8192 bytes
20:51:46.149541 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
20:51:47.149369 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
20:51:48.149192 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
20:52:50.148777 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
20:52:50.148818 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
20:52:54.149147 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
20:53:53.149127 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
20:53:56.148700 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
20:53:57.149545 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
20:54:56.149586 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
20:55:02.149701 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
20:55:02.149749 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
20:56:00.148838 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
20:56:05.149070 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
20:56:07.148751 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
20:57:06.148789 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
20:57:11.148992 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
20:57:13.148718 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
20:58:10.149016 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
20:58:17.148954 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
20:58:17.148997 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
20:59:14.149296 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
20:59:22.149048 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
20:59:23.148886 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
21:00:19.149376 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
21:00:26.149309 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
21:00:29.148856 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
21:01:23.149634 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
21:01:30.149579 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
21:01:33.149117 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
21:02:29.149586 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
21:02:35.148637 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
21:02:37.149400 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
21:03:32.149004 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
21:03:40.148796 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
21:03:41.149618 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
21:04:35.149397 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
21:04:45.148898 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
21:04:46.148714 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
21:05:39.149665 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
21:05:50.148985 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
21:05:50.149032 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
21:06:44.148776 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
21:06:54.149246 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
21:06:56.148916 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
21:07:49.148879 IP 192.168.1.10.123 > 202.73.37.27.123: NTPv4, Client, length 48
21:07:58.149478 IP 192.168.1.10.123 > 133.243.238.163.123: NTPv4, Client, 
length 48
21:08:00.149183 IP 192.168.1.10.123 > 122.154.11.67.123: NTPv4, Client, length 
48
21:09:56.149530 IP

Re: em very slow, shared irq... on 6.3p8

2008-02-27 Thread Mike Tancsa

At 05:49 AM 2/27/2008, Holger Kipp wrote:


I therefore assume that the problem is between receiving the
irq from em and getting the data from the interface on the firewall
itself.


I would try upgrading to 6.3R (there are several em driver bug fixes) 
and then try the box with

% cat /boot/loader.conf
hw.pci.enable_msi=1

...if the cards support msi.

I think pciconf -lvc should tell you if the cards and slots support it or not.

Also, if you dont need IPV6, use FAST_IPSEC. It does not need 
mpsafe.  If you do need IPSEC and IPV6, 7.0R got rid of that restriction.


---Mike 


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Tar regression from 6.2 to 6.3 with --strip-components

2008-02-27 Thread Rink Springer
Hi,

On Thu, Feb 28, 2008 at 12:09:50AM +1100, Jan Mikkelsen wrote:
> And the tar extraction returns a failure.

I can confirm this does not work on 8-CURRENT either.

> Is this known?  Should I raise a PR?

That seems a good idea to me. Thanks!

-- 
Rink P.W. Springer- http://rink.nu
"Anyway boys, this is America. Just because you get more votes doesn't
 mean you win." - Fox Mulder
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Tar regression from 6.2 to 6.3 with --strip-components

2008-02-27 Thread Kris Kennaway

Jan Mikkelsen wrote:

Hi,

I've just noticed a regression in tar from 6.2 to 6.3:

Running this on 6.2 produces no output:

#!/bin/sh
mkdir -p a b output
touch a/file1 b/file2
tar cf test.tar a b
tar -x -C output --strip-components 1 -f test.tar

On 6.3, it produces this output:

: Invalid empty pathname
: Invalid empty pathname
tar: Error exit delayed from previous errors.

And the tar extraction returns a failure.

Is this known?  Should I raise a PR?


Let's see what Tim has to say.

Kris
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Tar regression from 6.2 to 6.3 with --strip-components

2008-02-27 Thread Jan Mikkelsen
Hi,

I've just noticed a regression in tar from 6.2 to 6.3:

Running this on 6.2 produces no output:

#!/bin/sh
mkdir -p a b output
touch a/file1 b/file2
tar cf test.tar a b
tar -x -C output --strip-components 1 -f test.tar

On 6.3, it produces this output:

: Invalid empty pathname
: Invalid empty pathname
tar: Error exit delayed from previous errors.

And the tar extraction returns a failure.

Is this known?  Should I raise a PR?

Regards,

Jan Mikkelsen

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: em very slow, shared irq... on 6.3p8

2008-02-27 Thread JoaoBR
On Wednesday 27 February 2008 07:49:42 Holger Kipp wrote:
> Hello,
>
> I updated a system with 12 dc-interfaces to a new hardware
> with 14 em-interfaces. Yes, it is a firewall.
> New System is 6.2-RELEASE-p8.
>
> What I now experience between two internal networks (100MBit/s each)
> is the following:
> 1318 packets transmitted, 1317 packets received, 0% packet loss
> round-trip min/avg/max/stddev = 0.387/246.153/2441.392/324.142 ms
>
> tcpdump on the firewall shows similar delays (on the outgoing
> interface).
>
> tcpdump on the system I ping however shows very quick responses
> for incoming packages (ie usually less than a millisecond).
>
> I therefore assume that the problem is between receiving the
> irq from em and getting the data from the interface on the firewall
> itself.
>
> My first option would be to activate polling on em-interfaces - but as
> I did not experience this sort of notieceable slowdown with the old
> dc-based firewall (without polling), maybe someone can shed some light
> on this strange behaviour or has other suggestions as well?



I had a setup with 4 (dlink 4port + 1 nic nfe onboard) which run extremely 
stable 6.3

I upgraded the hardware (S939 -> AM2) and used two em 2port cards, without 
polling it was certanly unusable but with polling I got very good performance

without polling normally in a day or less the machine hung, no msg, simply 
freezed

with polling it stands some days up to two weeks when it freeze again

I upgraded to 7.0 same result

check your setup with vmstat -i and if you see two nics on the same interrupt 
I guess you get the same result as I got

actual I am running one em 2port and two single port pci cards what seems to 
be stable

anyway, similare setup on Tyan MBs do not have this problem where I also have 
8 nics on one system

I also do not have this problems on S939 boards only on AM2, so I am not sure 
where the problem is exactly but seems in a certain way hardware related

You could try changing your PS unit because you might be short on power with 
lots of em cards and SATA disks




-- 

João







A mensagem foi scaneada pelo sistema de e-mail e pode ser considerada segura.
Service fornecido pelo Datacenter Matik  https://datacenter.matik.com.br
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: panic in ufs_lookup (6.2-STABLE)

2008-02-27 Thread Ivan Voras
Andrew N. Below wrote:

> RELENG_6 cvsuped at 2007-01-15
> 
> what should I check (source revisions) to ensure we have same bug?

This is a quite old version of FreeBSD. I don't know for sure if the bug
was fixed by that time, but you should update to newest RELENG_6 or
RELENG_6_3 to make sure.

Alternatively, something might be corrupting the file system data. Maybe
you should experiment with a few rounds of rsync followed by md5 on both
the source and destination file systems.



signature.asc
Description: OpenPGP digital signature


Re: ad0 READ_DMA TIMEOUT errors on install of 7.0-RELEASE

2008-02-27 Thread Jeremy Chadwick
On Wed, Feb 27, 2008 at 01:11:36AM -0800, Stephen Hurd wrote:
> ...  The corrupted sync message scared the heck out of me:
> Waiting (max 60 seconds) for system process `vnlru' to stop...done
> Waiti
> Synncgi n(gm adxi sk6s0,  svencoodnedss )r efmoari nsiynsgte.m. .pr1o0c ess 
> `syncer' to stop...8 7 8 3 3 3 1 0 0 0 0 done

http://lists.freebsd.org/pipermail/freebsd-current/2007-October/078145.html
http://lists.freebsd.org/pipermail/freebsd-current/2007-November/079130.html
http://lists.freebsd.org/pipermail/freebsd-current/2007-November/079131.html
http://lists.freebsd.org/pipermail/freebsd-stable/2007-December/038727.html


> And after the reboot, the READ_DMA timeouts were back.

You're not the only one seeing this behaviour.  There are too many posts
in the past reporting similar.  Here's the breakdown:

* Some reporting this problem have been told to replace their ATA or
  SATA cables (which have previously been known to be working, but cables
  going bad does happen) -- and this has fixed the problem for a couple.

* Some have checked their SMART stats and found their disks to be in
  perfect condition.

* Some have switched to alternate operating systems (usually Linux) for
  a short while and seen no sign of DMA timeouts.

* Some have replaced the storage controller to no avail, and some have
  replaced the entire motherboard to no avail.  In some cases (myself
  included), replacing the motherboard did in fact help.

However: in your case, your disk does look to have problems based on the
SMART output you provided.  It does not matter how new/old the disk is,
by the way.  I'll point out the problematic stats.  You need to replace
the disk ASAP.

BTW, any SMART stats you see labelled "Offline" means the numbers will
not be updated until you perform an offline test (smartctl -t short or
smartctl -t long).

> The only "odd" think I can think of about my system is an unusually high HZ 
> value (2386) I'm building a kernel now with 1000 to check if that makes a 
> difference.

This is not the cause, rest assured.

> SMART Attributes Data Structure revision number: 16
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE  UPDATED  
> WHEN_FAILED RAW_VALUE
>   5 Reallocated_Sector_Ct   0x0033   253   253   063Pre-fail  Always  
>  -   4

This shows you've had 4 reallocated sectors, meaning your disk does in
fact have bad blocks.  In 90% of the cases out there, bad blocks
continue to "grow" over time, due to whatever reason (I remember reading
an article explaining it, but I can't for the life of me find the URL).

> 194 Temperature_Celsius 0x0032   253   253   000Old_age   Always  
>  -   48

This is excessive, and may be attributing to problems.  A hard disk
running at 48C is not a good sign.  This should really be somewhere
between high 20s and mid 30s.

> 195 Hardware_ECC_Recovered  0x000a   253   252   000Old_age   Always  
>  -   11498

This implies a large number of ECC (error correction) activities have
occured, but all were successful.

> Error 2 occurred at disk power-on lifetime: 5171 hours (215 days + 11 hours)
>   When the command that caused the error occurred, the device was in an 
> unknown state.
> Error 1 occurred at disk power-on lifetime: 5171 hours (215 days + 11 hours)
>   When the command that caused the error occurred, the device was in an 
> unknown state.

These are automated SMART log entries confirming the DMA failures.  The
fact that SMART saw them means that the disk is also aware of said
issues.  These may have been caused by the reallocated sectors.  It's
also interesting that the LBAs are different than the ones FreeBSD
reported issues with.

My advice to you is: replace the disk ASAP.  This problem will only get
worse.  Try another hard disk brand too (I don't have anything "against"
Maxtor, but usually its recommended to avoid a brand you have problems
with until the next time you have issues, then switch brands, etc.
etc...).  I'm very fond of Western Digital's SE16, RE, and RE2 series
currently.  But avoid Fujitsu and Samsung (both have a long track record
of having buggy drive firmwares, forcing vendors to make custom
workarounds for issues); stick with Seagate, Western Digital, or Maxtor.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


em very slow, shared irq... on 6.3p8

2008-02-27 Thread Holger Kipp
Hello,

I updated a system with 12 dc-interfaces to a new hardware
with 14 em-interfaces. Yes, it is a firewall.
New System is 6.2-RELEASE-p8.

What I now experience between two internal networks (100MBit/s each) 
is the following:
1318 packets transmitted, 1317 packets received, 0% packet loss
round-trip min/avg/max/stddev = 0.387/246.153/2441.392/324.142 ms

tcpdump on the firewall shows similar delays (on the outgoing 
interface).

tcpdump on the system I ping however shows very quick responses
for incoming packages (ie usually less than a millisecond).

I therefore assume that the problem is between receiving the
irq from em and getting the data from the interface on the firewall
itself.

My first option would be to activate polling on em-interfaces - but as
I did not experience this sort of notieceable slowdown with the old
dc-based firewall (without polling), maybe someone can shed some light
on this strange behaviour or has other suggestions as well?

More details and dmesg below.

Help and suggestions welcome :-)

Best regards,
Holger Kipp

PS: Regarding the MPSAFE-issue - I need to use ipsec, so don't have a choice
here. 




On the firewall I see many irqs (sometimes more than 5000) very often
on one of the em-interfaces using sysctl/vmstat:


2 usersLoad  0.00  0.00  0.04  Feb 27 12:20

Mem:KBREALVIRTUAL VN PAGER  SWAP PAGER
Tot   Share  TotShareFree in  out in  out
Act   14804372432800 3800 1327728 count
All  7223245248 76723364 5696 pages
 Interrupts
Proc:r  p  d  s  wCsw  Trp  Sys  Int  Sof  Fltcow7606 total
 27  78621  462 4105   158840 wire1: atkb
14988 act 6: fdc0
 0.2%Sys   3.0%Intr  0.0%User  0.0%Nice 96.8%Idl   547868 inact   12: psm
||||||||||   1244 cache   14: ata
++1326484 free359 16: em4
  daefr  2761 17: em5
Namei Name-cacheDir-cache prcfr   28: twe
Calls hits% hits% react   306 52: em0
  pdwak 1 53: em1
  zfodpdpgs   177 55: em3
Disks twed0   ozfod   intrn  2001 cpu0: time
KB/t  21.33   %slo-z   114464 buf2001 cpu1: time
tps   1 1 tfree10 dirtybuf
MB/s   0.0110 desiredvnodes
% busy0 66594 numvnodes
19758 freevnodes


dmesg

Copyright (c) 1992-2007 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 6.2-RELEASE-p8 #1: Thu Oct  4 16:07:31 CEST 2007
WARNING: debug.mpsafenet forced to 0 as ipsec requires Giant
WARNING: MPSAFE network stack disabled, expect reduced performance.
MPTable: 
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel(R) Core(TM)2 CPU  6420  @ 2.13GHz (2128.01-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x6f6  Stepping = 6
  
Features=0xbfebfbff
  Features2=0xe3bd,CX16,,>
  AMD Features=0x2010
  AMD Features2=0x1
  Cores per package: 2
real memory  = 2146349056 (2046 MB)
avail memory = 2095296512 (1998 MB)
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
ioapic0: Assuming intbase of 0
ioapic1: Assuming intbase of 24
ioapic2: Assuming intbase of 48
ioapic0  irqs 0-23 on motherboard
ioapic1  irqs 24-47 on motherboard
ioapic2  irqs 48-71 on motherboard
kbd1 at kbdmux0
cpu0 on motherboard
cpu1 on motherboard
pcib0:  pcibus 0 on motherboard
pci0:  on pcib0
pcib0: unable to route slot 28 INTB
pcib1:  irq 16 at device 1.0 on pci0
pci1:  on pcib1
pcib2:  at device 0.0 on pci1
pci2:  on pcib2
twe0: <3ware Storage Controller. Driver version 1.50.01.002> port 0x4000-0x400f 
mem 0xe010-0xe01f,0xe080-0xe0ff irq 28 at device 2.0 on pci2
twe0: [GIANT-LOCKED]
twe0: 2 ports, Firmware FE8S 1.05.00.068, BIOS BE7X 1.08.00.048
pci1:  at device 0.1 (no driver attached)
pcib3:  at device 0.2 on pci1
pci3:  on pcib3
pcib4:  at device 3.0 on pci3
pci4:  on pcib4
em0:  port 0x5000-0x503f 
mem 0xe100-0xe101 irq 52 at device 4.0 on pci4
em0: Ethernet address: 00:0e:0c:c6:cc:54
em

ad0 READ_DMA TIMEOUT errors on install of 7.0-RELEASE

2008-02-27 Thread Stephen Hurd
My system has been working reliably with 6.3 for quite some time... when 
I rebooted into single user mode to do the installworld with the 
7.0-RELEASE kernel, the install died about halfway through with READ_DMA 
TIMEOUT errors.  Since I had a mixed system at that point, I set 
hw.ata.ata_dma="0" in /boot/loader.conf and completed the install.


After a good boot to verify everything was working, I flipped 
hw.ata.ata_dma back and rebooted.  The corrupted sync message scared the 
heck out of me:

Waiting (max 60 seconds) for system process `vnlru' to stop...done
Waiti
Synncgi n(gm adxi sk6s0,  svencoodnedss )r efmoari nsiynsgte.m. .pr1o0c 
ess `syncer' to stop...8 7 8 3 3 3 1 0 0 0 0 done


And after the reboot, the READ_DMA timeouts were back.

I installed sysutils/smartmontools (output attached in case it's usefull)

The only "odd" think I can think of about my system is an unusually high 
HZ value (2386) I'm building a kernel now with 1000 to check if that 
makes a difference.



Copyright (c) 1992-2008 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
   The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 7.0-RELEASE #1: Tue Feb 26 22:49:13 PST 2008
   [EMAIL PROTECTED]:/usr/obj/usr/src/sys/SERVER
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel Pentium III (996.85-MHz 686-class CPU)
 Origin = "GenuineIntel"  Id = 0x686  Stepping = 6
 
Features=0x387fbff

real memory  = 1207959552 (1152 MB)
avail memory = 1172832256 (1118 MB)
MPTable: 
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
cpu0 (BSP): APIC ID:  0
cpu1 (AP): APIC ID:  1
ioapic0: Assuming intbase of 0
ioapic1: Assuming intbase of 16
ioapic0  irqs 0-15 on motherboard
ioapic1  irqs 16-31 on motherboard
kbd1 at kbdmux0
cpu0 on motherboard
cpu1 on motherboard
pcib0:  pcibus 0 on motherboard
pci0:  on pcib0
vgapci0:  port 0xd800-0xd8ff mem 
0xfd00-0xfdff,0xfeaff000-0xfeaf irq 22 at device 1.0 on pci0

drm0:  on vgapci0
info: [drm] Initialized mach64 1.0.0 20020904
fxp0:  port 0xd400-0xd43f mem 
0xfeafe000-0xfeafefff,0xfe90-0xfe9f irq 20 at device 4.0 on pci0

miibus0:  on fxp0
inphy0:  PHY 1 on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: Ethernet address: 00:e0:81:21:49:7e
fxp0: [ITHREAD]
fxp1:  port 0xd000-0xd03f mem 
0xfeafd000-0xfeafdfff,0xfe70-0xfe7f irq 21 at device 5.0 on pci0

miibus1:  on fxp1
inphy1:  PHY 1 on miibus1
inphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp1: Ethernet address: 00:e0:81:21:49:7f
fxp1: [ITHREAD]
isab0:  port 0x580-0x58f at device 15.0 on pci0
isa0:  on isab0
atapci0:  port 
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 15.1 on pci0

ata0:  on atapci0
ata0: [ITHREAD]
ata1:  on atapci0
ata1: [ITHREAD]
ohci0:  mem 0xfeafc000-0xfeafcfff irq 10 
at device 15.2 on pci0

ohci0: [GIANT-LOCKED]
ohci0: [ITHREAD]
usb0: OHCI version 1.0, legacy support
usb0:  on ohci0
usb0: USB revision 1.0
uhub0: <(0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1> on usb0
uhub0: 2 ports with 2 removable, self powered
pcib1:  pcibus 1 on motherboard
pci1:  on pcib1
pcm0:  port 0xef00-0xef3f irq 27 at device 2.0 on pci1
pcm0: 
pcm0: [ITHREAD]
pcm0: 
pmtimer0 on isa0
orm0:  at iomem 
0xc-0xc7fff,0xc8000-0xc8fff,0xc9000-0xc9fff pnpid ORM on isa0

sc0:  at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0:  at port 0x3c0-0x3df iomem 0xa-0xb on isa0
atkbdc0:  at port 0x60,0x64 on isa0
atkbd0:  irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
atkbd0: [ITHREAD]
psm0:  irq 12 on atkbdc0
psm0: [GIANT-LOCKED]
psm0: [ITHREAD]
psm0: model MouseMan+, device ID 0
fdc0:  at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 
on isa0

fdc0: [FILTER]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
ppc0:  at port 0x378-0x37f irq 7 on isa0
ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/8 bytes threshold
ppbus0:  on ppc0
ppbus0: [ITHREAD]
lpt0:  on ppbus0
lpt0: Interrupt-driven port
ppc0: [GIANT-LOCKED]
ppc0: [ITHREAD]
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio0: [FILTER]
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
unknown:  can't assign resources (memory)
unknown:  can't assign resources (port)
speaker0:  at port 0x61 pnpid PNP0800 on isa0
unknown:  can't assign resources (port)
unknown:  can't assign resources (irq)
unknown:  can't assign resources (port)
unknown:  can't assign resources (port)
unknown:  can't assign resources (port)
uhid0: 2> on uhub0

uhub0: device problem (IOERROR), disabling port 2
Timecounters tick every 0.838 msec
ipfw2 (+ipv6) initialized, divert enabled, rule-based forwarding 
disabled, default to deny, logging disabled

ad0: 239372MB  at ata0-master PIO4
acd0: CDRW  at ata1-master UDMA33
SMP: AP CPU #1 Launched!
Trying to mount root from ufs:/dev/ad0s1a
fxp0: Microcode loaded