panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss
Hi,
I'm getting this with FreeBSD-8-stable, it usually happens when
starting apache:

panic: vm_fault_copy_wired: page missing
cpuid = 3
KDB: enter: panic
[thread pid 1013 tid 100106 ]
Stopped at  kdb_enter+0x3d: movq$0,0x68f170(%rip)
db tr
Tracing pid 1013 tid 100106 td 0xff0007a66ae0
kdb_enter() at kdb_enter+0x3d
panic() at panic+0x17b
vm_fault_copy_entry() at vm_fault_copy_entry+0x283
vmspace_fork() at vmspace_fork+0x4d0
fork1() at fork1+0x35f
fork() at fork+0x1c
syscall() at syscall+0x1e7
Xfast_syscall() at Xfast_syscall+0xe1
--- syscall (2, FreeBSD ELF64, fork), rip = 0x8009f41ac, rsp = 0x7fffe7d8, 
rbp = 0x800c34a80 ---

any help in tracking this?

thanks,
danny


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-15 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 9:21 PM, Ian Smith smi...@nimnet.asn.au wrote:
 On Wed, 14 Apr 2010, Garrett Cooper wrote:
   On Wed, Apr 14, 2010 at 7:49 PM, Garrett Cooper yanef...@gmail.com wrote:
    On Wed, Apr 14, 2010 at 5:46 PM, Maho NAKATA cha...@mac.com wrote:
    Hi Andry and Adam
   
    My test again. No desktop, etc. I just run dgemm.
    Contrary to Adam's result, Hyper Threading makes the performance worse.
    all tests are done on Core i7 920 @ 2.67GHz. (TurboBoost @2.8GHz)
   
    Turbo Boost off, Hyper threading off: 82% (35GFlops)    [1]
    Turbo Boost off, Hyper threading off: 72% (30.5GFlops)  [2]

 Er, shouldn't one of those say HTT on?  and/or Turbo boost on?  Else
 they're both the same test as [4] but with different results?

There's a problem with 8.x+ cores reported by the kernel. For some odd
reason more recent Intel processors aren't reporting themselves as
HT-enabled when they have HT-cores (see: kern/145385).

I didn't look into the issue too hard, but since it does seem to be a
major performance loss perhaps I should; besides, it would be good
experience to put under my belt :].

    Turbo Boost on,  Hyper threading on: 71% (32GFlops)    [3]
    Turbo Boost off, Hyper threading off: 84-89% (38-40GFlops) [4]

 Clarification of all four possible test configs - 8 if you add pinning
 CPUs or not - might make this a bit clearer?

    Doesn't this make sense? Hyperthreaded cores in Intel procs still
    provide an incomplete set of registers as they're logical processors,
    so I would expect for things to be slower if they're automatically run
    on the SMT cores instead of the physical ones.

 Since we're talking FP, do HTT 'cores' share an FPU, or have their own?
 If contended, you'd have to expect worse (at least FP) performance, no?

   Ah, that's another excellent point. What instructions is dgemm
using -- pure integer based arithmetic, floating point arithmetic,
specialized operations that would benefit from using SIMD, etc?

    Is there a weighting scheme to SCHED_ULE where logical processors
    (like the SMT variety) get a lower score than real processors do, and
    thus get scheduled for less intensive interrupting tasks, or maybe
    just don't get scheduled in high use scenarios like it would if it was
    a physical processor?
  
   Err... wait. Didn't see that the turbo boost results didn't scale
   linearly or align with one another until just a sec ago. Nevermind my
   previous comment.

 Waiting for the fog to lift ..

As am I. I don't know enough in this area, but I'm definitely open
to learning.

Thanks,
-Garrett
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Linux static linked ver doesn't work on FBSD (Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-15 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 10:26 PM, Maho NAKATA cha...@mac.com wrote:
 From: Pieter de Goeje pie...@degoeje.nl
 Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance 
 on FreeBSD 8/amd64, Corei7 920
 Date: Wed, 14 Apr 2010 16:05:18 +0200

 I think the best test would be to run a statically compiled linux binary on
 FreeBSD. That way the compiler settings are exactly the same.

 It is not possible for Linux amd64 binary to run on FreeBSD amd64,
 ...and not i386 version neither. GotoBLAS uses special systeml call.

 % ./dgemm
 linux_sys_futex: unknown op 265
 linux: pid 1264 (dgemm): syscall mbind not implemented
 n: 3000
 ^C
 just halt.

Yes, and while this isn't directly tied into numa, mbind(2),
mempolicy(2), and a few others use the same facilities that are
available via plain numa. I know because of messes I've tried to clean
up in these areas. I'm really not sure why this is using numa though
to be honest...
Thanks,
-Garrett
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Garrett Cooper
On Thu, Apr 15, 2010 at 12:22 AM, Daniel Braniss da...@cs.huji.ac.il wrote:
 Hi,
 I'm getting this with FreeBSD-8-stable, it usually happens when
 starting apache:

 panic: vm_fault_copy_wired: page missing
 cpuid = 3
 KDB: enter: panic
 [thread pid 1013 tid 100106 ]
 Stopped at      kdb_enter+0x3d: movq    $0,0x68f170(%rip)
 db tr
 Tracing pid 1013 tid 100106 td 0xff0007a66ae0
 kdb_enter() at kdb_enter+0x3d
 panic() at panic+0x17b
 vm_fault_copy_entry() at vm_fault_copy_entry+0x283
 vmspace_fork() at vmspace_fork+0x4d0
 fork1() at fork1+0x35f
 fork() at fork+0x1c
 syscall() at syscall+0x1e7
 Xfast_syscall() at Xfast_syscall+0xe1
 --- syscall (2, FreeBSD ELF64, fork), rip = 0x8009f41ac, rsp = 0x7fffe7d8,
 rbp = 0x800c34a80 ---

 any help in tracking this?

Hi Danny,
Can you provide some details about your systems, like amd64 vs
i386, processor model, amount of RAM, swap, etc?
Thanks!
-Garrett
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Kostik Belousov
On Thu, Apr 15, 2010 at 10:22:20AM +0300, Daniel Braniss wrote:
 Hi,
 I'm getting this with FreeBSD-8-stable, it usually happens when
 starting apache:
 
 panic: vm_fault_copy_wired: page missing
 cpuid = 3
 KDB: enter: panic
 [thread pid 1013 tid 100106 ]
 Stopped at  kdb_enter+0x3d: movq$0,0x68f170(%rip)
 db tr
 Tracing pid 1013 tid 100106 td 0xff0007a66ae0
 kdb_enter() at kdb_enter+0x3d
 panic() at panic+0x17b
 vm_fault_copy_entry() at vm_fault_copy_entry+0x283
 vmspace_fork() at vmspace_fork+0x4d0
 fork1() at fork1+0x35f
 fork() at fork+0x1c
 syscall() at syscall+0x1e7
 Xfast_syscall() at Xfast_syscall+0xe1
 --- syscall (2, FreeBSD ELF64, fork), rip = 0x8009f41ac, rsp = 
 0x7fffe7d8, 
 rbp = 0x800c34a80 ---
 
 any help in tracking this?
 
 thanks,
   danny

Is it true that the process started, or at least some of loaded dso
are from NFS mount ?


pgp7IFZgxPczs.pgp
Description: PGP signature


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Kostik Belousov
On Thu, Apr 15, 2010 at 01:06:20AM -0700, Garrett Cooper wrote:
 On Thu, Apr 15, 2010 at 12:22 AM, Daniel Braniss da...@cs.huji.ac.il wrote:
  Hi,
  I'm getting this with FreeBSD-8-stable, it usually happens when
  starting apache:
 
  panic: vm_fault_copy_wired: page missing
  cpuid = 3
  KDB: enter: panic
  [thread pid 1013 tid 100106 ]
  Stopped at      kdb_enter+0x3d: movq    $0,0x68f170(%rip)
  db tr
  Tracing pid 1013 tid 100106 td 0xff0007a66ae0
  kdb_enter() at kdb_enter+0x3d
  panic() at panic+0x17b
  vm_fault_copy_entry() at vm_fault_copy_entry+0x283
  vmspace_fork() at vmspace_fork+0x4d0
  fork1() at fork1+0x35f
  fork() at fork+0x1c
  syscall() at syscall+0x1e7
  Xfast_syscall() at Xfast_syscall+0xe1
  --- syscall (2, FreeBSD ELF64, fork), rip = 0x8009f41ac, rsp = 
  0x7fffe7d8,
  rbp = 0x800c34a80 ---
 
  any help in tracking this?
 
 Hi Danny,
 Can you provide some details about your systems, like amd64 vs
 i386, processor model, amount of RAM, swap, etc?
 Thanks!

It is obviously amd64, 64-bit process.


pgp9i22KRUwda.pgp
Description: PGP signature


Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-15 Thread Adrian Chadd
May I make a suggestion?

Would you mind creating a shared google spreadsheet with your testing
results and a shared google document with the test setup?

I think having the data in an easily represented, easily shared medium
would be beneficial to everyone.


Adrian

On 15 April 2010 08:46, Maho NAKATA cha...@mac.com wrote:
 Hi Andry and Adam

 My test again. No desktop, etc. I just run dgemm.
 Contrary to Adam's result, Hyper Threading makes the performance worse.
 all tests are done on Core i7 920 @ 2.67GHz. (TurboBoost @2.8GHz)

 Turbo Boost off, Hyper threading off: 82% (35GFlops)    [1]
 Turbo Boost off, Hyper threading off: 72% (30.5GFlops)  [2]

 Turbo Boost on,  Hyper threading on: 71% (32GFlops)    [3]
 Turbo Boost off, Hyper threading off: 84-89% (38-40GFlops) [4]

 ---my system---
 CPU: Intel(R) Core(TM) i7 CPU         920  @ 2.67GHz (2683.44-MHz K8-class 
 CPU)
  Origin = GenuineIntel  Id = 0x106a5  Stepping = 5
  Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Features2=0x98e3bdSSE3,DTES64,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,SSE4.2,POPCNT
  AMD Features=0x28100800SYSCALL,NX,RDTSCP,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant
 real memory  = 12884901888 (12288 MB)
 avail memory = 12387717120 (11813 MB)
 ACPI APIC Table: 110909 APIC1026
 FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
 FreeBSD/SMP: 1 package(s) x 4 core(s)
 ---my system---

 ---DETAILS---
 [1]
 % ./dgemm
 n: 3000
 time : 57.666717 or 16.339074
 Mflops : 33060.624827
 n: 3100
 time : 61.502677 or 16.597376
 Mflops : 35910.025544
 n: 3200
 time : 69.075401 or 19.199833
 Mflops : 34144.297133
 n: 3300
 time : 73.699540 or 19.633594
 Mflops : 36618.756539
 n: 3400
 time : 82.256194 or 22.373651
 Mflops : 35144.518837
 n: 3500
 time : 88.975662 or 24.118761
 Mflops : 35563.394249
 n: 3600
 time : 96.436652 or 26.027588
 Mflops : 35861.148385
 n: 3700
 [2]
 % ./dgemm
 n: 3000
 time : 139.622739 or 17.693806
 Mflops : 30529.327312
 n: 3100
 time : 154.344971 or 19.566886
 Mflops : 30460.247702
 n: 3200
 time : 169.507739 or 21.467100
 Mflops : 30538.116602
 n: 3300
 time : 186.363773 or 23.615281
 Mflops : 30444.600545
 n: 3400
 time : 203.798979 or 25.817667
 Mflops : 30456.322788
 n: 3500
 ...
 [3]
 % ./dgemm
 n: 3000
 time : 134.673079 or 16.958682
 Mflops : 31852.711082
 n: 3100
 time : 148.410085 or 18.663248
 Mflops : 31935.073574
 n: 3200
 time : 162.835473 or 20.468825
 Mflops : 32027.475770
 n: 3300
 time : 179.025370 or 22.479189
 Mflops : 31983.262501
 n: 3400
 time : 195.859710 or 24.663009
 Mflops : 31882.208788
 n: 3500
 [4]
 % ./dgemm
 n: 3000
 time : 54.259647 or 14.684309
 Mflops : 36786.204907
 n: 3100
 time : 60.899147 or 17.124599
 Mflops : 34804.447141
 n: 3200
 time : 64.295342 or 17.490787
 Mflops : 37480.577569
 n: 3300
 time : 69.781247 or 18.288840
 Mflops : 39311.284796
 n: 3400
 time : 79.234397 or 21.829736
 Mflops : 36020.187858
 n: 3500
 time : 83.905419 or 22.381237
 Mflops : 38324.289174
 n: 3600
 time : 92.195022 or 25.105942
 Mflops : 37177.621122
 n: 3700
 time : 97.718841 or 25.434243
 Mflops : 39841.319494
 n: 3800
 time : 105.740463 or 27.414029
 Mflops : 40042.592613
 n: 3900
 time : 113.980157 or 29.678505
 Mflops : 39984.635420
 n: 4000
 time : 122.941569 or 31.946174
 Mflops : 40077.412531
 n: 4100
 ---DETAILS---


 From: Adam Vande More amvandem...@gmail.com
 Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance 
 on FreeBSD 8/amd64, Corei7 920
 Date: Wed, 14 Apr 2010 11:34:45 -0500

  time : 162.45 or 20.430651
  Mflops : 32087.318295
  n: 3300
  time : 178.497079 or 22.446093
  Mflops : 32030.420499
  n: 3400
  time : 195.550715 or 24.586152
  Mflops : 31981.873273
  n: 3500
  time : 213.403379 or 26.825058
  Mflops : 31975.513363
  n: 3600
  ...
  above output is on Core i7 920 (2.66GHz; TurboBoost on)

 My results:
 $ ./dgemm
 n: 3000
 time : 54.151302 or 28.189781
 Mflops : 19162.263125
 n: 3100
 time : 60.157449 or 32.214141
 Mflops : 18501.570537
 n: 3200
 time : 65.753191 or 34.114872
 Mflops : 19216.393378

 CPU:
 CPU: Intel(R) Core(TM)2 Duo CPU     E7300  @ 2.66GHz (2653.35-MHz K8-class
 CPU)
  Origin = GenuineIntel  Id = 0x10676  Stepping = 6


 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE

  Features2=0x8e39dSSE3,DTES64,MON,DS_CPL,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1
  AMD Features=0x20100800SYSCALL,NX,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant
 ⋮
 FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 FreeBSD/SMP: 1 package(s) x 2 core(s)

 FreeBSD:
 FreeBSD 8.0-STABLE r205070 amd64

 Please note that the system was not dedicated to the test, I had
 Xorg+KDE3+thunderbird+skype+kopete+konsole(s) plus a bunch of daemons
 running.
 That probably explains irregularities in the results.

 I am not sure how exactly theoretical maximum should 

Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss
 On Thu, Apr 15, 2010 at 12:22 AM, Daniel Braniss da...@cs.huji.ac.il wrot=e:
  Hi,
  I'm getting this with FreeBSD-8-stable, it usually happens when
  starting apache:
 
  panic: vm_fault_copy_wired: page missing
  cpuid = 3
  KDB: enter: panic
  [thread pid 1013 tid 100106 ]
  Stopped at      kdb_enter+0x3d: movq    $0,0x68f170(%rip)
  db tr
  Tracing pid 1013 tid 100106 td 0xff0007a66ae0
  kdb_enter() at kdb_enter+0x3d
  panic() at panic+0x17b
  vm_fault_copy_entry() at vm_fault_copy_entry+0x283
  vmspace_fork() at vmspace_fork+0x4d0
  fork1() at fork1+0x35f
  fork() at fork+0x1c
  syscall() at syscall+0x1e7
  Xfast_syscall() at Xfast_syscall+0xe1
  --- syscall (2, FreeBSD ELF64, fork), rip = 0x8009f41ac, rsp = 
  0x7fff=e7d8,
  rbp = 0x800c34a80 ---
 
  any help in tracking this?
  Hi Danny,
 Can you provide some details about your systems, like amd64 vs
 i386, processor model, amount of RAM, swap, etc?
sure, straight from the lion's mouth:

Copyright (c) 1992-2010 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 8.0-STABLE #33 r2073: Wed Apr 14 15:29:07 IDT 2010
da...@sunfire:/r+d/obj/sunfire/r+d/stable/8/sys/HUJI amd64
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Dual-Core AMD Opteron(tm) Processor 2218 (2613.41-MHz K8-class CPU)
  Origin = AuthenticAMD  Id = 0x40f13  Family = f  Model = 41  Stepping = 3
  Features=0x178bfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,
CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT
  Features2=0x2001SSE3,CX16
  AMD Features=0xea500800SYSCALL,NX,MMX+,FFXSR,RDTSCP,LM,3DNow!+,3DNow!
  AMD Features2=0x1fLAHF,CMP,SVM,ExtAPIC,CR8
real memory  = 17179869184 (16384 MB)
avail memory = 16562614272 (15795 MB)

the hardware is a Sun X2200.

thanks for any help! this machine is supposed to replace our old web server
and it's not happening :-(

danny


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-15 Thread Andriy Gapon
on 14/04/2010 20:47 Adam Vande More said the following:
 I'm no expert Andriy, but it seems like if gotoblas
 implemented some of the FreeBSD optimizations then we'd be in the same
 ballpark.

This is a good point.
But on the other hand, it means that our scheduler doesn't do a perfect job
here.  BTW, I use ULE.
My observation is that when a number of CPU-intensive long running processes is
less than or equal to number of cores, then the processes tend to stay on the
same cores for a long time.
But if the number of the processes is greater, then they seem to jump from core
to core a lot.
But I am not sure what would be an optimal strategy for that case.  If we try to
keep some lucky processes on the same core, then cpu time might be shared
unfairly.  Shuffling cores provides more fairness, but can hurt total 
performance.

-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Jeremy Chadwick
On Thu, Apr 15, 2010 at 11:50:41AM +0300, Daniel Braniss wrote:
  On Thu, Apr 15, 2010 at 12:22 AM, Daniel Braniss da...@cs.huji.ac.il 
  wrot=e:
   Hi,
   I'm getting this with FreeBSD-8-stable, it usually happens when
   starting apache:
  
   panic: vm_fault_copy_wired: page missing
   cpuid = 3
   KDB: enter: panic
   [thread pid 1013 tid 100106 ]
   Stopped at      kdb_enter+0x3d: movq    $0,0x68f170(%rip)
   db tr
   Tracing pid 1013 tid 100106 td 0xff0007a66ae0
   kdb_enter() at kdb_enter+0x3d
   panic() at panic+0x17b
   vm_fault_copy_entry() at vm_fault_copy_entry+0x283
   vmspace_fork() at vmspace_fork+0x4d0
   fork1() at fork1+0x35f
   fork() at fork+0x1c
   syscall() at syscall+0x1e7
   Xfast_syscall() at Xfast_syscall+0xe1
   --- syscall (2, FreeBSD ELF64, fork), rip = 0x8009f41ac, rsp = 
   0x7fff=e7d8,
   rbp = 0x800c34a80 ---
  
   any help in tracking this?
   Hi Danny,
  Can you provide some details about your systems, like amd64 vs
  i386, processor model, amount of RAM, swap, etc?
 sure, straight from the lion's mouth:
 
 Copyright (c) 1992-2010 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
 The Regents of the University of California. All rights reserved.
 FreeBSD is a registered trademark of The FreeBSD Foundation.
 FreeBSD 8.0-STABLE #33 r2073: Wed Apr 14 15:29:07 IDT 2010
 da...@sunfire:/r+d/obj/sunfire/r+d/stable/8/sys/HUJI amd64
 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: Dual-Core AMD Opteron(tm) Processor 2218 (2613.41-MHz K8-class CPU)
   Origin = AuthenticAMD  Id = 0x40f13  Family = f  Model = 41  Stepping = 3
   
 Features=0x178bfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,
 CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT
   Features2=0x2001SSE3,CX16
   AMD Features=0xea500800SYSCALL,NX,MMX+,FFXSR,RDTSCP,LM,3DNow!+,3DNow!
   AMD Features2=0x1fLAHF,CMP,SVM,ExtAPIC,CR8
 real memory  = 17179869184 (16384 MB)
 avail memory = 16562614272 (15795 MB)
 
 the hardware is a Sun X2200.
 
 thanks for any help! this machine is supposed to replace our old web server
 and it's not happening :-(

Could you please provide the following?

1) Contents of /var/db/ports/apache-whatever/options
2) Contents of /etc/make.conf
3) Your kernel configuration file (HUJI)

Thanks.

-- 
| Jeremy Chadwick   j...@parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Vlad Galu
On Thu, Apr 15, 2010 at 9:22 AM, Daniel Braniss da...@cs.huji.ac.il wrote:
 Hi,
 I'm getting this with FreeBSD-8-stable, it usually happens when
 starting apache:

alc@ made some VM MFCs yesterday, could you try a 13th of April kernel
and see if it works out for you?


-- 
Good, fast  cheap. Pick any two.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss
 On Thu, Apr 15, 2010 at 11:50:41AM +0300, Daniel Braniss wrote:
   On Thu, Apr 15, 2010 at 12:22 AM, Daniel Braniss da...@cs.huji.ac.il 
   wrot=e:
Hi,
I'm getting this with FreeBSD-8-stable, it usually happens when
starting apache:
   
panic: vm_fault_copy_wired: page missing
cpuid = 3
KDB: enter: panic
[thread pid 1013 tid 100106 ]
Stopped at      kdb_enter+0x3d: movq    $0,0x68f170(%rip)
db tr
Tracing pid 1013 tid 100106 td 0xff0007a66ae0
kdb_enter() at kdb_enter+0x3d
panic() at panic+0x17b
vm_fault_copy_entry() at vm_fault_copy_entry+0x283
vmspace_fork() at vmspace_fork+0x4d0
fork1() at fork1+0x35f
fork() at fork+0x1c
syscall() at syscall+0x1e7
Xfast_syscall() at Xfast_syscall+0xe1
--- syscall (2, FreeBSD ELF64, fork), rip = 0x8009f41ac, rsp = 
0x7fff=e7d8,
rbp = 0x800c34a80 ---
   
any help in tracking this?
Hi Danny,
   Can you provide some details about your systems, like amd64 vs
   i386, processor model, amount of RAM, swap, etc?
  sure, straight from the lion's mouth:
  
  Copyright (c) 1992-2010 The FreeBSD Project.
  Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
  The Regents of the University of California. All rights reserved.
  FreeBSD is a registered trademark of The FreeBSD Foundation.
  FreeBSD 8.0-STABLE #33 r2073: Wed Apr 14 15:29:07 IDT 2010
  da...@sunfire:/r+d/obj/sunfire/r+d/stable/8/sys/HUJI amd64
  Timecounter i8254 frequency 1193182 Hz quality 0
  CPU: Dual-Core AMD Opteron(tm) Processor 2218 (2613.41-MHz K8-class CPU)
Origin = AuthenticAMD  Id = 0x40f13  Family = f  Model = 41  Stepping = 
  3

  Features=0x178bfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,
  CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT
Features2=0x2001SSE3,CX16
AMD Features=0xea500800SYSCALL,NX,MMX+,FFXSR,RDTSCP,LM,3DNow!+,3DNow!
AMD Features2=0x1fLAHF,CMP,SVM,ExtAPIC,CR8
  real memory  = 17179869184 (16384 MB)
  avail memory = 16562614272 (15795 MB)
  
  the hardware is a Sun X2200.
  
  thanks for any help! this machine is supposed to replace our old web server
  and it's not happening :-(
 
 Could you please provide the following?
 
 1) Contents of /var/db/ports/apache-whatever/options
sunfire cat  /var/db/ports/apache-xml-security-c/options
# This file is auto-generated by 'make config'.
# No user-servicable parts inside!
# Options for apache-xml-security-c-1.4.0
_OPTIONS_READ=apache-xml-security-c-1.4.0
WITH_XERCES_DEVEL=true

 2) Contents of /etc/make.conf
sunfire cat /etc/make.conf
OVERRIDE_LINUX_BASE_PORT=f8
OVERRIDE_LINUX_NONBASE_PORTS=f8
WRKDIRPREFIX=/home/pobj
PACKAGES=/r+d/packages
FETCH_ENV=  HTTP_PROXY=http://wwwproxy.cs.huji.ac.il:8080/
# added by use.perl 2009-11-10 11:51:57
PERL_VERSION=5.10.1

 3) Your kernel configuration file (HUJI)
 i'll try and send this as an attachment
sunfire config -x /boot/kernel/kernel

 Thanks.
 
 -- 
 | Jeremy Chadwick   j...@parodius.com |
 | Parodius Networking   http://www.parodius.com/ |
 | UNIX Systems Administrator  Mountain View, CA, USA |
 | Making life hard for others since 1977.  PGP: 4BD6C0CB |
 

options CONFIG_AUTOGENERATED
ident   HUJI
machine amd64
cpu HAMMER
makeoptions DEBUG=-g
options PRINTF_BUFR_SIZE=256
options ALTQ_HFSC
options ALTQ_PRIQ
options ALTQ_CBQ
options ALTQ
options DEVICE_POLLING
options CONSPEED=115200
options ALT_BREAK_TO_DEBUGGER
options BOOTP_NFSV3
options INCLUDE_CONFIG_FILE
options AH_SUPPORT_AR5416
options IEEE80211_SUPPORT_MESH
options IEEE80211_AMPDU_AGE
options IEEE80211_DEBUG
options AHD_REG_PRETTY_PRINT
options AHC_REG_PRETTY_PRINT
options ATA_REQUEST_TIMEOUT=3
options SMP
options GDB
options DDB
options KDB
options FLOWTABLE
options MAC
options AUDIT
options HWPMC_HOOKS
options KBD_INSTALL_CDEV
options _KPOSIX_PRIORITY_SCHEDULING
options P1003_1B_SEMAPHORES
options SYSVSEM
options SYSVMSG
options SYSVSHM
options STACK
options KTRACE
options SCSI_DELAY=500
options COMPAT_FREEBSD7
options COMPAT_FREEBSD6
options COMPAT_FREEBSD5
options COMPAT_FREEBSD4
options COMPAT_FREEBSD32
options COMPAT_43TTY
options GEOM_LABEL
options GEOM_PART_GPT
options PSEUDOFS
options PROCFS
options CD9660
options MSDOSFS
options NFS_ROOT
options NFSLOCKD
options NFSCLIENT
options MD_ROOT
options UFS_GJOURNAL
options UFS_DIRHASH
options UFS_ACL
options SOFTUPDATES
options FFS
options SCTP
options INET6
options INET
options PREEMPTION
options SCHED_ULE
options GEOM_PART_MBR
options GEOM_PART_EBR_COMPAT
options GEOM_PART_EBR
options GEOM_PART_BSD
device  isa
device  mem
device  io
device  uart_ns8250
device  cpufreq
device  acpi
device  pci
device  fdc
device  ata
device  atadisk
device  ataraid
device  atapicd
device  atapifd
device  atapist
device  ahc
device  ahd
device  amd
device  hptiop
device  isp
device  mpt
device  sym
device  trm
device  adv
device  adw
device  aic
device  

Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss
 On Thu, Apr 15, 2010 at 9:22 AM, Daniel Braniss da...@cs.huji.ac.il wrote:
  Hi,
  I'm getting this with FreeBSD-8-stable, it usually happens when
  starting apache:
 
 alc@ made some VM MFCs yesterday, could you try a 13th of April kernel
 and see if it works out for you?
 
asap, btw, I reduced the amount of physical memory and things seem ok.

cheers,
danny


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss
 On Thu, Apr 15, 2010 at 9:22 AM, Daniel Braniss da...@cs.huji.ac.il wrote:
  Hi,
  I'm getting this with FreeBSD-8-stable, it usually happens when
  starting apache:
 
 alc@ made some VM MFCs yesterday, could you try a 13th of April kernel
 and see if it works out for you?
the kernel that panics does not include alc's MFC - I did the sync few
hours before -, so now I'm copiling with the MFC.
BTW, with less memory the server is still running!

danny


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss
 
 --xFkczX7rH1pKA3aV
 Content-Type: text/plain; charset=us-ascii
 Content-Disposition: inline
 Content-Transfer-Encoding: quoted-printable
 
 On Thu, Apr 15, 2010 at 10:22:20AM +0300, Daniel Braniss wrote:
  Hi,
  I'm getting this with FreeBSD-8-stable, it usually happens when
  starting apache:
 =20
  panic: vm_fault_copy_wired: page missing
  cpuid =3D 3
  KDB: enter: panic
  [thread pid 1013 tid 100106 ]
  Stopped at  kdb_enter+0x3d: movq$0,0x68f170(%rip)
  db tr
  Tracing pid 1013 tid 100106 td 0xff0007a66ae0
  kdb_enter() at kdb_enter+0x3d
  panic() at panic+0x17b
  vm_fault_copy_entry() at vm_fault_copy_entry+0x283
  vmspace_fork() at vmspace_fork+0x4d0
  fork1() at fork1+0x35f
  fork() at fork+0x1c
  syscall() at syscall+0x1e7
  Xfast_syscall() at Xfast_syscall+0xe1
  --- syscall (2, FreeBSD ELF64, fork), rip =3D 0x8009f41ac, rsp =3D 0x7fff=
 e7d8,=20
  rbp =3D 0x800c34a80 ---
 =20
  any help in tracking this?
 =20
  thanks,
  danny
 
 Is it true that the process started, or at least some of loaded dso
 are from NFS mount ?
everything is nfs :-), the host is dataless
but redusing the amount of physical memory has solved the
problem, so I don't think NFS is the problem.

danny


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Kostik Belousov
On Thu, Apr 15, 2010 at 12:39:13PM +0300, Daniel Braniss wrote:
  
  --xFkczX7rH1pKA3aV
  Content-Type: text/plain; charset=us-ascii
  Content-Disposition: inline
  Content-Transfer-Encoding: quoted-printable
  
  On Thu, Apr 15, 2010 at 10:22:20AM +0300, Daniel Braniss wrote:
   Hi,
   I'm getting this with FreeBSD-8-stable, it usually happens when
   starting apache:
  =20
   panic: vm_fault_copy_wired: page missing
   cpuid =3D 3
   KDB: enter: panic
   [thread pid 1013 tid 100106 ]
   Stopped at  kdb_enter+0x3d: movq$0,0x68f170(%rip)
   db tr
   Tracing pid 1013 tid 100106 td 0xff0007a66ae0
   kdb_enter() at kdb_enter+0x3d
   panic() at panic+0x17b
   vm_fault_copy_entry() at vm_fault_copy_entry+0x283
   vmspace_fork() at vmspace_fork+0x4d0
   fork1() at fork1+0x35f
   fork() at fork+0x1c
   syscall() at syscall+0x1e7
   Xfast_syscall() at Xfast_syscall+0xe1
   --- syscall (2, FreeBSD ELF64, fork), rip =3D 0x8009f41ac, rsp =3D 0x7fff=
  e7d8,=20
   rbp =3D 0x800c34a80 ---
  =20
   any help in tracking this?
  =20
   thanks,
 danny
  
  Is it true that the process started, or at least some of loaded dso
  are from NFS mount ?
 everything is nfs :-), the host is dataless
 but redusing the amount of physical memory has solved the
 problem, so I don't think NFS is the problem.

I do think that NFS is problem. Another key point is that your process
is mlock'ed, right ? This is kind of known issue with NFS and mlock.


pgpkFlpUQUiOO.pgp
Description: PGP signature


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss
 On Thu, Apr 15, 2010 at 9:22 AM, Daniel Braniss da...@cs.huji.ac.il wrote:
  Hi,
  I'm getting this with FreeBSD-8-stable, it usually happens when
  starting apache:
 
 alc@ made some VM MFCs yesterday, could you try a 13th of April kernel
 and see if it works out for you?

with or without the MFC it's still panicking, and the memory size does not
affect the outcome :-(

danny


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss
 
 --U3Zg06C/E2vtHpAW
 Content-Type: text/plain; charset=us-ascii
 Content-Disposition: inline
 Content-Transfer-Encoding: quoted-printable
 
 On Thu, Apr 15, 2010 at 12:39:13PM +0300, Daniel Braniss wrote:
  =20
   --xFkczX7rH1pKA3aV
   Content-Type: text/plain; charset=3Dus-ascii
   Content-Disposition: inline
   Content-Transfer-Encoding: quoted-printable
  =20
   On Thu, Apr 15, 2010 at 10:22:20AM +0300, Daniel Braniss wrote:
Hi,
I'm getting this with FreeBSD-8-stable, it usually happens when
starting apache:
   =3D20
panic: vm_fault_copy_wired: page missing
cpuid =3D3D 3
KDB: enter: panic
[thread pid 1013 tid 100106 ]
Stopped at  kdb_enter+0x3d: movq$0,0x68f170(%rip)
db tr
Tracing pid 1013 tid 100106 td 0xff0007a66ae0
kdb_enter() at kdb_enter+0x3d
panic() at panic+0x17b
vm_fault_copy_entry() at vm_fault_copy_entry+0x283
vmspace_fork() at vmspace_fork+0x4d0
fork1() at fork1+0x35f
fork() at fork+0x1c
syscall() at syscall+0x1e7
Xfast_syscall() at Xfast_syscall+0xe1
--- syscall (2, FreeBSD ELF64, fork), rip =3D3D 0x8009f41ac, rsp =3D3=
 D 0x7fff=3D
   e7d8,=3D20
rbp =3D3D 0x800c34a80 ---
   =3D20
any help in tracking this?
   =3D20
thanks,
danny
  =20
   Is it true that the process started, or at least some of loaded dso
   are from NFS mount ?
  everything is nfs :-), the host is dataless
  but redusing the amount of physical memory has solved the
  problem, so I don't think NFS is the problem.
 
 I do think that NFS is problem. Another key point is that your process
 is mlock'ed, right ? This is kind of known issue with NFS and mlock.
 
well, since it's panicking again, there goes the memsize theory.
this is getting weirder and weirder, it now panics on reboot:

Stopping cron.
Stopping sshd.
=== apache22 profile: httpd
=== apache22 profile: httpdyn
Stopping inetd.
Stopping ntpd.
Stopping lockd.
Waiting for PIDS: 1201.
Stopping statd.
Stopping nfsd.
Stopping mountd.
Stopping devd.
.
Apr 15 13:27:48 sf-02 syslogd: exiting on signal 15
panic: vm_fault_copy_wired: page missing
cpuid = 1
KDB: enter: panic
[thread pid 1014 tid 100118 ]
Stopped at  kdb_enter+0x3d: movq$0,0x68f7a0(%rip)
db  tr
Tracing pid 1014 tid 100118 td 0xff000533f3a0
kdb_enter() at kdb_enter+0x3d
panic() at panic+0x17b
vm_fault_copy_entry() at vm_fault_copy_entry+0x283
vmspace_fork() at vmspace_fork+0x4d0
fork1() at fork1+0x35f
fork() at fork+0x1c
syscall() at syscall+0x1e7
Xfast_syscall() at Xfast_syscall+0xe1
--- syscall (2, FreeBSD ELF64, fork), rip = 0x8009f41ac, rsp = 0x7fffe7d8, 
rbp = 0x800c34a00 ---


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Jeremy Chadwick
On Thu, Apr 15, 2010 at 01:24:14PM +0300, Daniel Braniss wrote:
  On Thu, Apr 15, 2010 at 9:22 AM, Daniel Braniss da...@cs.huji.ac.il wrote:
   Hi,
   I'm getting this with FreeBSD-8-stable, it usually happens when
   starting apache:
  
  alc@ made some VM MFCs yesterday, could you try a 13th of April kernel
  and see if it works out for you?
 
 with or without the MFC it's still panicking, and the memory size does not
 affect the outcome :-(

Shot in the dark: either at the interactive loader prompt or by editing
/boot/loader.conf, try disabling superpage support:

vm.pmap.pg_ps_enabled=0

-- 
| Jeremy Chadwick   j...@parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss
 On Thu, Apr 15, 2010 at 01:24:14PM +0300, Daniel Braniss wrote:
   On Thu, Apr 15, 2010 at 9:22 AM, Daniel Braniss da...@cs.huji.ac.il 
   wrote:
Hi,
I'm getting this with FreeBSD-8-stable, it usually happens when
starting apache:
   
   alc@ made some VM MFCs yesterday, could you try a 13th of April kernel
   and see if it works out for you?
  
  with or without the MFC it's still panicking, and the memory size does not
  affect the outcome :-(
 
 Shot in the dark: either at the interactive loader prompt or by editing
 /boot/loader.conf, try disabling superpage support:
 
 vm.pmap.pg_ps_enabled=0

that's the first thing I tried :-(
just to complicate things a bit, if I start the apache later, via forcestart, 
things
'seem' better.
but keep them comming, I need this fixed.
cheers,
danny


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Jeremy Chadwick
On Thu, Apr 15, 2010 at 02:05:26PM +0300, Daniel Braniss wrote:
  On Thu, Apr 15, 2010 at 01:24:14PM +0300, Daniel Braniss wrote:
On Thu, Apr 15, 2010 at 9:22 AM, Daniel Braniss da...@cs.huji.ac.il 
wrote:
 Hi,
 I'm getting this with FreeBSD-8-stable, it usually happens when
 starting apache:

alc@ made some VM MFCs yesterday, could you try a 13th of April kernel
and see if it works out for you?
   
   with or without the MFC it's still panicking, and the memory size does not
   affect the outcome :-(
  
  Shot in the dark: either at the interactive loader prompt or by editing
  /boot/loader.conf, try disabling superpage support:
  
  vm.pmap.pg_ps_enabled=0
 
 that's the first thing I tried :-(
 just to complicate things a bit, if I start the apache later, via forcestart, 
 things
 'seem' better.
 but keep them comming, I need this fixed.

Take NFS out of the picture if you can...

-- 
| Jeremy Chadwick   j...@parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Ivan Voras

On 04/15/10 13:11, Jeremy Chadwick wrote:

On Thu, Apr 15, 2010 at 02:05:26PM +0300, Daniel Braniss wrote:

On Thu, Apr 15, 2010 at 01:24:14PM +0300, Daniel Braniss wrote:

On Thu, Apr 15, 2010 at 9:22 AM, Daniel Branissda...@cs.huji.ac.il  wrote:

Hi,
I'm getting this with FreeBSD-8-stable, it usually happens when
starting apache:


alc@ made some VM MFCs yesterday, could you try a 13th of April kernel
and see if it works out for you?


with or without the MFC it's still panicking, and the memory size does not
affect the outcome :-(


Shot in the dark: either at the interactive loader prompt or by editing
/boot/loader.conf, try disabling superpage support:

vm.pmap.pg_ps_enabled=0


that's the first thing I tried :-(
just to complicate things a bit, if I start the apache later, via forcestart,
things
'seem' better.
but keep them comming, I need this fixed.


Take NFS out of the picture if you can...


I'm late into the discussion but just to verify - you are talking about 
not running executables over NFS, right? Not serving data?


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Daniel Braniss

 Take NFS out of the picture if you can...
 
I've been thinking along those lines, and Kostic is convinced
that the problem lies there, so I guess I'll give it a try, but
it's no realy a solution.

cheers,
danny


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: panic: vm_fault_copy_wired: page missing

2010-04-15 Thread Kostik Belousov
On Thu, Apr 15, 2010 at 02:54:19PM +0300, Daniel Braniss wrote:
 
  Take NFS out of the picture if you can...
  
 I've been thinking along those lines, and Kostic is convinced
 that the problem lies there, so I guess I'll give it a try, but
 it's no realy a solution.

Better solution is to remove mlock()/mlockall().


pgpBJKMxReJsc.pgp
Description: PGP signature


Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-15 Thread Adam Vande More
On Thu, Apr 15, 2010 at 3:54 AM, Andriy Gapon a...@freebsd.org wrote:

 This is a good point.
 But on the other hand, it means that our scheduler doesn't do a perfect job
 here.  BTW, I use ULE.
 My observation is that when a number of CPU-intensive long running
 processes is
 less than or equal to number of cores, then the processes tend to stay on
 the
 same cores for a long time.
 But if the number of the processes is greater, then they seem to jump from
 core
 to core a lot.
 But I am not sure what would be an optimal strategy for that case.  If we
 try to
 keep some lucky processes on the same core, then cpu time might be shared
 unfairly.  Shuffling cores provides more fairness, but can hurt total
 performance.


Is is possible to add a tunable to the scheduler for it's aggressiveness in
switching cores?

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


NFS permission strangeness

2010-04-15 Thread Giulio Ferro

Here's the setup:
server : NFS server machine (fb 8 stable amd64 )
client : NFS client machine (as above)

server and client are both sharing the same permission database through 
ldap:


Both have in /etc/nsswitch.conf
...
group: files ldap
...
passwd: files ldap

This issue isn't related to ldap, however. I get the same result if I 
manually add

groups to /etc/group file (read on)

Let's suppose I have user giulio configured in my system.
giulio is also part (-G) of groups:
group1, group2, group3, ... , group10

server is exporting the directory
/path/to/root (on zfs)

the directory
/path/to/root/dir/etc/subdir1
has permission 770 and group ownership group3

I login as user giulio on server I can enter subdir1 directory, 
since I'm

member of group group3

I then login as user giulio on client, and I can do the same (as 
expected).



When groups are more than a few, however, I get this strange behavior:

let's suppose the directory:
/path/to/root/dir/etc/subdir2
has permission 770 and group ownership group10

What happens is that I can access subdir2 on the server machine when I
login as giulio, but when I try to access that same dir on the client 
machine

I get:
$ cd /path/to/root/dir/etc
(ok)
$ cd subdir2
subdir2/: Permission denied.

if I issue this command on the client:
$ id
I get :
uid=1000 (giulio), gid=1000 (giuliogroup), groups=group1(1001), 
group2(1002),

group3(1003),...,group10(1010)

So there shouldn't really be any reason for me not to be able to access 
that dir...


Any idea?


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: CPU problems after 8.0-STABLE update

2010-04-15 Thread Akephalos
On Wed, 14 Apr 2010 20:10:09 +0300
Andriy Gapon a...@icyb.net.ua wrote:

 Indeed, that almost sounds too good to be true :-)
 Couple of questions:
 1. Could you please check in dmesg if hpet attaches normally now or
 still has an error?
 2. Do you have to run ntpdate after each power-off or is everything
 OK after the first run?
 3. Have you ever set time on this machine before (in BIOS, other OS,
 etc)?
 4. Can you please double-check that  lapic_allclocks is zero in
 kernel? You can run 'kgdb /boot/kernel/kernel /dev/mem' and then
 'print lapic_allclocks'
 
 Thanks!
 -- 
 Andriy Gapon

1: %dmesg | grep -i hpet
ACPI HPET table warning: Sequence is non-zero (2)
acpi_hpet0: High Precision Event Timer iomem 0xfed0-0xfed003ff on
acpi0 acpi_hpet0: HPET never increments, disabling
device_attach: acpi_hpet0 attach returned 6

2: Yes, I have to run it every time for it to work. Not once my CPU
load is detected after reboot, unless running ntpd.

3: Yes, I had problems with mounts in the future and whatnot. I don't
remember what I used where, but on Linux I used to remove a
file, /var/lib/hwclock/adjtime and run ntpd - for the things to get
in line.

4: In the debugger it prints: $1 = 0, so I suppose yeah, it is set to
zero.

Laterz!

-- 
Mihai
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Strange Swapping Issues(?)

2010-04-15 Thread Peter Jeremy
On 2010-Apr-14 06:44:58 +0200, Gabor PALI p...@freebsd.org wrote:
Apr 14 05:26:45 xxx kernel: swap_pager_getswapspace(16): failed
Apr 14 05:26:45 xxx kernel: pid 7388 (throwto003), uid 1001, was
killed: out of swap space
Apr 14 05:26:45 xxx kernel: swap_pager_getswapspace(16): failed

The out-of-swap hander will kill the largest process so one of your
problems is probably throwto003.  I can't offer any suggestion as to
why the swap_pager_getswapspace() errors continued afterwards.

-- 
Peter Jeremy


pgp0nNZofvEHu.pgp
Description: PGP signature


Re: CPU problems after 8.0-STABLE update

2010-04-15 Thread Andriy Gapon
on 15/04/2010 20:53 Akephalos said the following:
 1: %dmesg | grep -i hpet
 ACPI HPET table warning: Sequence is non-zero (2)
 acpi_hpet0: High Precision Event Timer iomem 0xfed0-0xfed003ff on
 acpi0 acpi_hpet0: HPET never increments, disabling
 device_attach: acpi_hpet0 attach returned 6
 
 2: Yes, I have to run it every time for it to work. Not once my CPU
 load is detected after reboot, unless running ntpd.
 
 3: Yes, I had problems with mounts in the future and whatnot. I don't
 remember what I used where, but on Linux I used to remove a
 file, /var/lib/hwclock/adjtime and run ntpd - for the things to get
 in line.
 
 4: In the debugger it prints: $1 = 0, so I suppose yeah, it is set to
 zero.

I really suspect faulty hardware here.
Maybe dead CMOS/NVRAM battery or dead crystal or some chip or circuit.

-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-15 Thread Andriy Gapon
on 15/04/2010 16:23 Adam Vande More said the following:
 Is is possible to add a tunable to the scheduler for it's aggressiveness
 in switching cores?

No idea; not a scheduler person.

-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Strange Swapping Issues(?)

2010-04-15 Thread Gabor PALI
Hello,

On 04/15/10 20:41, Peter Jeremy wrote:
 The out-of-swap hander will kill the largest process so one of your
 problems is probably throwto003.  I can't offer any suggestion as
 to why the swap_pager_getswapspace() errors continued afterwards.

Okay, it was my fault.  After huge processes, like throwto003 are
killed, swap usage falls below 3-4% in a second.  (I will need to
consult the author of that processes.)

For your amusement, here is a log excerpt of the situation (it can by
easily reproduced on my system).

Columns are as follows:

- Time
- Swap Used (KB) [swapinfo]
- Swap Free (KB) [swapinfo]
- Number of Processes [ps]
- Active Virtual Pages (KB) [vmstat]
- Size of Free List (KB) [vmstat]


Messages are pounding the logs at 17:16:37:  kernel:
swap_pager_getswapspace(12): failed, and at 17:16:39 throwto003 is
finally killed:  kernel: pid 117 (throwto003), uid 1001, was killed:
out of swap space, no further fails.


17:15:3081720   4112584 155 2033636 71464
17:15:3181720   4112584 155 2777084 9732
17:15:41439384  3754920 155 2953212 10104
17:15:42493804  3700500 155 3121148 10152
17:15:44730560  3463744 155 3280812 10208
17:15:47817608  3376696 155 3459068 9980
17:15:49988420  3205884 155 3625008 10268
17:15:511182748 3011556 155 3799036 10036
17:15:531326872 2867432 155 3963900 8280
17:15:541445332 2748972 155 3969020 9632
17:15:551457348 2736956 155 4093928 40304
17:15:571598944 2595360 155 4139004 10568
17:15:581646460 2547844 155 4276220 8360
17:15:591689776 2504528 155 4365308 8476
17:16:001882924 2311380 155 4453372 89048
17:16:042049676 2144628 155 4567036 9860
17:16:062219004 1975300 155 4733948 10264
17:16:082389028 1805276 155 4903932 10012
17:16:102558516 1635788 155 5047396 99916
17:16:122735864 1458440 157 5254428 10532
17:16:152917096 1277208 158 5475788 100944
17:16:173103632 1090672 159 5716856 98876
17:16:203231660 962644  159 5929908 40144
17:16:243498256 696048  159 6168504 68192
17:16:283675324 518980  159 6238136 10700
17:16:303861972 332332  159 6414172 8500
17:16:324032132 162172  157 6556588 10324
17:16:334054828 139476  155 6718460 7932
17:16:344176548 17756   155 6719536 9584
17:16:384193604 700 155 6799908 71096
17:16:39100580  4093724 151 1132240 1640856
17:16:4096336   4097968 151 1123620 1637372
17:16:4196332   4097972 153 1189608 1613748
17:16:4296328   4097976 156 1269164 1529760
17:16:4396272   4098032 154 1147628 1602084


Cheers,
:g

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


ifconfig-i386 on 8-STABLE/amd64

2010-04-15 Thread Jonathan Chen
Hi,

I've got a 32-bit build on a 8-STABLE/amd64 machine, built using
instructions at:

http://wiki.freebsd.org/Wine#head-6963d527c173e57b1567e881305b544d33435b6d

And I'm noticing some odd output from ifconfig (commands are run from
the non-chrooted env):

$ /sbin/ifconfig
xl0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500
options=8VLAN_MTU
ether 00:60:97:a4:7f:82
inet 192.168.1.10 netmask 0xff00 broadcast 192.168.1.255
inet 192.168.1.11 netmask 0x broadcast 192.168.1.11
media: Ethernet autoselect (100baseTX full-duplex)
status: active
fwe0: flags=8802BROADCAST,SIMPLEX,MULTICAST metric 0 mtu 1500
options=8VLAN_MTU
ether 02:1d:7d:f0:67:8b
ch 1 dma -1
fwip0: flags=8802BROADCAST,SIMPLEX,MULTICAST metric 0 mtu 1500
lladdr 0.1d.7d.0.0.f0.67.8b.a.2.ff.fe.0.0.0.0
re0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

options=389bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,WOL_UCAST,WOL_MCAST,WOL_MAGIC
ether 00:1d:7d:9d:3f:1f
inet 192.168.2.1 netmask 0xff00 broadcast 192.168.2.255
inet 192.168.2.10 netmask 0x broadcast 192.168.2.10
inet 192.168.2.12 netmask 0x broadcast 192.168.2.12
media: Ethernet autoselect (none)
status: no carrier
plip0: flags=8810POINTOPOINT,SIMPLEX,MULTICAST metric 0 mtu 1500
ipfw0: flags=8801UP,SIMPLEX,MULTICAST metric 0 mtu 65536
lo0: flags=8049UP,LOOPBACK,RUNNING,MULTICAST metric 0 mtu 16384
options=3RXCSUM,TXCSUM
inet6 fe80::1%lo0 prefixlen 64 scopeid 0x7 
inet6 ::1 prefixlen 128 
inet 127.0.0.1 netmask 0xff00 
nd6 options=3PERFORMNUD,ACCEPT_RTADV

$ /compat/i386/sbin/ifconfig
: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST
inet 192.168.1.10 netmask 0xff00 broadcast 192.168.1.255
inet 192.168.1.11 netmask 0x broadcast 192.168.1.11
inet 192.168.2.1 netmask 0xff00 broadcast 192.168.2.255
inet 192.168.2.10 netmask 0x broadcast 192.168.2.10
inet 192.168.2.12 netmask 0x broadcast 192.168.2.12
inet 127.0.0.1 netmask 0xff00 
ifconfig: ioctl(SIOCGIFINFO_IN6): Device not configured

Why is the output from the 32-bit ifconfig severely mangled?

Cheers.
-- 
Jonathan Chen j...@chen.org.nz
---
I love deadlines. I like the whooshing sound they make as they fly by
- Douglas Adams
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: ifconfig-i386 on 8-STABLE/amd64

2010-04-15 Thread Xin LI
Hi,

On Thu, Apr 15, 2010 at 2:59 PM, Jonathan Chen j...@chen.org.nz wrote:
    $ /compat/i386/sbin/ifconfig
    : flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST
        inet 192.168.1.10 netmask 0xff00 broadcast 192.168.1.255
        inet 192.168.1.11 netmask 0x broadcast 192.168.1.11
        inet 192.168.2.1 netmask 0xff00 broadcast 192.168.2.255
        inet 192.168.2.10 netmask 0x broadcast 192.168.2.10
        inet 192.168.2.12 netmask 0x broadcast 192.168.2.12
        inet 127.0.0.1 netmask 0xff00
    ifconfig: ioctl(SIOCGIFINFO_IN6): Device not configured

 Why is the output from the 32-bit ifconfig severely mangled?

My understanding is that 32-bit management tools like ifconfig is not
supported on 64-bit FreeBSD.  Someone has to sit down and write a
compatibility layer for translating back and forth :(

Cheers,
--
Xin LI delp...@delphij.net http://www.delphij.net
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: ifconfig-i386 on 8-STABLE/amd64

2010-04-15 Thread Jonathan Chen
On Thu, Apr 15, 2010 at 04:12:11PM -0700, Xin LI wrote:
 Hi,
 
 On Thu, Apr 15, 2010 at 2:59 PM, Jonathan Chen j...@chen.org.nz wrote:
  ?? ??$ /compat/i386/sbin/ifconfig
  ?? ??: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST
  ?? ?? ?? ??inet 192.168.1.10 netmask 0xff00 broadcast 192.168.1.255
  ?? ?? ?? ??inet 192.168.1.11 netmask 0x broadcast 192.168.1.11
  ?? ?? ?? ??inet 192.168.2.1 netmask 0xff00 broadcast 192.168.2.255
  ?? ?? ?? ??inet 192.168.2.10 netmask 0x broadcast 192.168.2.10
  ?? ?? ?? ??inet 192.168.2.12 netmask 0x broadcast 192.168.2.12
  ?? ?? ?? ??inet 127.0.0.1 netmask 0xff00
  ?? ??ifconfig: ioctl(SIOCGIFINFO_IN6): Device not configured
 
  Why is the output from the 32-bit ifconfig severely mangled?
 
 My understanding is that 32-bit management tools like ifconfig is not
 supported on 64-bit FreeBSD.  Someone has to sit down and write a
 compatibility layer for translating back and forth :(

Ah. The reason I ask is because wine's (32-bit) network-i/f iteration code
is breaking when running under amd64. It uses code similar to
ifconfig's to determine available network interfaces. I didn't expect
the read-only interface would break when crossing the 32 to 64 barrier.

Thanks for the analysis.
-- 
Jonathan Chen j...@chen.org.nz
--
 In mathematics you don't understand things.
   You just get used to them.
 - Johann von Neumann
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: NFS permission strangeness

2010-04-15 Thread Rick Macklem



On Thu, 15 Apr 2010, Giulio Ferro wrote:


Here's the setup:
server : NFS server machine (fb 8 stable amd64 )
client : NFS client machine (as above)

server and client are both sharing the same permission database through ldap:

Both have in /etc/nsswitch.conf
...
group: files ldap
...
passwd: files ldap

This issue isn't related to ldap, however. I get the same result if I 
manually add

groups to /etc/group file (read on)

Let's suppose I have user giulio configured in my system.
giulio is also part (-G) of groups:
group1, group2, group3, ... , group10

server is exporting the directory
/path/to/root (on zfs)

the directory
/path/to/root/dir/etc/subdir1
has permission 770 and group ownership group3

I login as user giulio on server I can enter subdir1 directory, since I'm
member of group group3

I then login as user giulio on client, and I can do the same (as expected).


When groups are more than a few, however, I get this strange behavior:

let's suppose the directory:
/path/to/root/dir/etc/subdir2
has permission 770 and group ownership group10

What happens is that I can access subdir2 on the server machine when I
login as giulio, but when I try to access that same dir on the client 
machine

I get:
$ cd /path/to/root/dir/etc
(ok)
$ cd subdir2
subdir2/: Permission denied.



Yes, it should work. I just tried the same thing with a server running
UFS/FFS and it worked fine, so I think that the problem might be ZFS 
related. (You will get into trouble with more than 16 groups, since

that is all that AUTH_SYS for Sun RPC handles, but I did 10 like your
example and it worked ok for me, using FreeBSD-CURRENT client/server,
except that my server uses UFS/FFS.)

Hopefully someone with ZFS expertise can help out here?

If you can conveniently do the same test using a server that exports
a UFS/FFS file system, that would be helpful w.r.t. isolating the
problem.

rick
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org