I'm working on figuring out how to do that. I've got a crash dump now I just need to figure out how to get the backtrace.
On Tue, Jun 14, 2011 at 6:17 PM, Derrick Brashear <[email protected]> wrote: > the backtrace from a kernel dump would be far more useful, if you have > a way to collect one. > > On Tue, Jun 14, 2011 at 5:56 PM, Aaron Knister <[email protected]> wrote: > > Good afternoon! > > I'm writing to report a deadlock issue I'm seeing on Solaris 10. > > What I've observed is that when a file larger than the configured size of > > the cache is copied out of AFS the cache manager deadlocks and all access > to > > /afs on the affected system hangs until the system is rebooted. The issue > > occurs with a memory cache as well as a disk cache. > > > > The issue can be mitigated if the cache size is raised to the value of > > roughly half of the physical memory in the given system. The issue > appeared > > somewhere between Solaris 10 "u8" and "u9." > > I've reproduced the problem using OpenAFS 1.4.14.1, 1.5.78 and 1.6.0pre6 > and > > a Solaris 10 "u8" system with all of the latest patches applied. > > I've put together a tar file containing: > > - An fstrace dump starting a few seconds before I initiated the copy > > - A stack trace of the hung cp command > > - The output of cmdebug -long -server localhost run after AFS hangs > > The individual files as well as a tar file of them can be found here: > > http://userpages.umbc.edu/~aaronk/afs/solaris10-deadlock-issue. > > Any help would be greatly appreciated. > > Best, > > Aaron > > -- > > Aaron Knister > > Systems Administrator > > Division of Information Technology > > University of Maryland, Baltimore County > > [email protected] > > > > > > -- > Derrick > -- Aaron Knister Systems Administrator Division of Information Technology University of Maryland, Baltimore County [email protected]
