another thing you can do is to run it with -debug and capture all of the terminal output (both stdout and stderr). It will produce huge mounds of output, but it might be the best way.

doug

Bruce Fischl wrote:

do you know what process it was running when it failed? You could use -autorecon2-cp and -autorecon3. Something else seems like it is wrong though - you shouldn't need 3G of ram.

On Tue, 13 Nov 2007, Nicolas Cherbuin wrote:

Hi Bruce,

Thanks for your answers. I am not sure how much more ram might be needed dependant on anatomy but my last job failed due to FS using more than 3 Gig

PBS: job killed: per node vmem 34078384kb exceeded limit 2097152kb

or does it suggest there is a problem with our configuration?

Also, a related question. Since a number of jobs have failed after many hours of processing, I have had to investigate restarting jobs to finish processing the right hemisphere. For instance after the following log:

status file for recon-all
Thu Nov  8 23:13:39 EST 2007
[EMAIL PROTECTED] MotionCor Thu Nov  8 23:13:53 EST 2007
[EMAIL PROTECTED] Nu Intensity Correction Thu Nov  8 23:14:33 EST 2007
[EMAIL PROTECTED] Talairach Thu Nov  8 23:15:44 EST 2007
[EMAIL PROTECTED] Talairach Failure Detection Thu Nov  8 23:18:32 EST 2007
[EMAIL PROTECTED] Intensity Normalization Thu Nov  8 23:18:32 EST 2007
[EMAIL PROTECTED] Skull Stripping Thu Nov  8 23:25:47 EST 2007
[EMAIL PROTECTED] EM Registration Fri Nov  9 00:11:07 EST 2007
[EMAIL PROTECTED] CA Normalize Fri Nov  9 00:31:03 EST 2007
[EMAIL PROTECTED] CA Reg Fri Nov  9 00:37:06 EST 2007
[EMAIL PROTECTED] CA Reg Inv Fri Nov  9 16:56:14 EST 2007
[EMAIL PROTECTED] Remove Neck Fri Nov  9 16:59:57 EST 2007
[EMAIL PROTECTED] SkullLTA Fri Nov  9 17:04:39 EST 2007
[EMAIL PROTECTED] SubCort Seg Fri Nov  9 17:37:12 EST 2007
[EMAIL PROTECTED] CC Seg Fri Nov  9 18:57:34 EST 2007
[EMAIL PROTECTED] Intensity Normalization2 Fri Nov  9 18:59:22 EST 2007
[EMAIL PROTECTED] Mask BFS Fri Nov  9 19:08:32 EST 2007
[EMAIL PROTECTED] WM Segmentation Fri Nov  9 19:08:45 EST 2007
[EMAIL PROTECTED] Fill Fri Nov  9 19:17:29 EST 2007
[EMAIL PROTECTED] Tessellate lh Fri Nov  9 19:19:57 EST 2007
[EMAIL PROTECTED] Smooth1 lh Fri Nov  9 19:20:37 EST 2007
[EMAIL PROTECTED] Inflation1 lh Fri Nov  9 19:20:56 EST 2007
[EMAIL PROTECTED] QSphere lh Fri Nov  9 19:24:27 EST 2007
[EMAIL PROTECTED] Fix Topology lh Fri Nov  9 19:58:03 EST 2007
[EMAIL PROTECTED] Make Final Surf lh Sat Nov 10 03:17:04 EST 2007
[EMAIL PROTECTED] Smooth2 lh Sat Nov 10 07:11:02 EST 2007
[EMAIL PROTECTED] Inflation2 lh Sat Nov 10 07:11:19 EST 2007
[EMAIL PROTECTED] Sphere lh Sat Nov 10 07:19:55 EST 2007
[EMAIL PROTECTED] Surf Reg lh Sat Nov 10 10:29:15 EST 2007
[EMAIL PROTECTED] Jacobian white lh Sat Nov 10 17:43:19 EST 2007
[EMAIL PROTECTED] AvgCurv lh Sat Nov 10 17:43:29 EST 2007
[EMAIL PROTECTED] Cortical Parc lh Sat Nov 10 17:43:37 EST 2007
[EMAIL PROTECTED] Parcellation Stats lh Sat Nov 10 17:44:55 EST 2007
[EMAIL PROTECTED] Cortical Parc 2 lh Sat Nov 10 17:45:30 EST 2007
[EMAIL PROTECTED] Parcellation Stats 2 lh Sat Nov 10 17:47:12 EST 2007
[EMAIL PROTECTED] Tessellate rh Sat Nov 10 17:47:58 EST 2007
[EMAIL PROTECTED] Smooth1 rh Sat Nov 10 17:48:35 EST 2007
[EMAIL PROTECTED] Inflation1 rh Sat Nov 10 17:48:55 EST 2007
[EMAIL PROTECTED] QSphere rh Sat Nov 10 17:52:32 EST 2007
[EMAIL PROTECTED] Fix Topology rh Sat Nov 10 18:26:17 EST 2007


I initially thought I could restart with -autorecon3 but since this did not work I restarted with:

recon-all -s subject -autorecon2 -hemi right

but this (log below) restarted with a number of (very lengthy!!!) processes that had been completed before. What is the most efficient way to restart a process and is there a way of restarting autorecon2 and have it automatically followed by autorecton3?

Sun Nov 11 11:29:26 EST 2007
[EMAIL PROTECTED] EM Registration Sun Nov 11 11:29:26 EST 2007
[EMAIL PROTECTED] CA Normalize Sun Nov 11 11:48:51 EST 2007
[EMAIL PROTECTED] CA Reg Sun Nov 11 11:54:14 EST 2007
[EMAIL PROTECTED] CA Reg Inv Mon Nov 12 04:21:18 EST 2007
[EMAIL PROTECTED] Remove Neck Mon Nov 12 04:25:00 EST 2007
[EMAIL PROTECTED] SkullLTA Mon Nov 12 04:29:31 EST 2007
[EMAIL PROTECTED] SubCort Seg Mon Nov 12 05:01:49 EST 2007
[EMAIL PROTECTED] CC Seg Mon Nov 12 06:21:48 EST 2007
[EMAIL PROTECTED] Intensity Normalization2 Mon Nov 12 06:23:50 EST 2007
[EMAIL PROTECTED] Mask BFS Mon Nov 12 06:32:52 EST 2007
[EMAIL PROTECTED] WM Segmentation Mon Nov 12 06:33:05 EST 2007
[EMAIL PROTECTED] Fill Mon Nov 12 06:41:56 EST 2007
[EMAIL PROTECTED] Tessellate rh Mon Nov 12 06:44:23 EST 2007
[EMAIL PROTECTED] Smooth1 rh Mon Nov 12 06:45:00 EST 2007
[EMAIL PROTECTED] Inflation1 rh Mon Nov 12 06:45:19 EST 2007
[EMAIL PROTECTED] QSphere rh Mon Nov 12 06:48:48 EST 2007
[EMAIL PROTECTED] Fix Topology rh Mon Nov 12 07:22:02 EST 2007
[EMAIL PROTECTED] Make Final Surf rh Mon Nov 12 15:26:46 EST 2007
[EMAIL PROTECTED] Smooth2 rh Mon Nov 12 19:18:37 EST 2007
[EMAIL PROTECTED] Inflation2 rh Mon Nov 12 19:18:55 EST 2007
[EMAIL PROTECTED] ASeg Stats Mon Nov 12 19:27:09 EST 2007
[EMAIL PROTECTED] Cortical ribbon mask rh Mon Nov 12 20:03:45 EST 2007

Finally, what is the best way to restart it from the last memory exceeded failure which interupted processing towards the end of autorecon2?

Thanks

Nic



-----Original Message-----
From:    Bruce Fischl [mailto:[EMAIL PROTECTED]
Sent:    Sun 11/11/2007 4:30 PM
To:    Nicolas Cherbuin
Cc:    [email protected]
Subject: Re: [Freesurfer] reprocessing on FS4.0 takes much longer for some scans

Hi Nicolas,

1. No, sorry, the amount of RAM is sometimes dependent on the
individual anatomy, and in any case can't be predefine.

2. Not sure about the itaniums. We have no real experience. 72 hours does
sound pretty long. That's I think about what it used to take on our old
athlons. Can you extend the time limit?

As for the random stopping of recon-all, we have seen that sometimes as
well, and are trying to track it down. It seems pretty mysterious, as a
binary will exit with a nonzero exit code according to the shell even
though the last printf in the code has been executed and the next statement
is an exit(0).

Bruce

On
Mon, 12 Nov 2007, Nicolas Cherbuin wrote:

Hi,

At the beginning of the year I processed 400 scans on a linux cluster. It had a reported ram limit of 1 Gig but it could cope with the slight excess of FS3 and a processing time limit of 48 hours (which was fine for 99% of the scans). Most scans went through autorecon-all without problem.

I am now trying to reprocess the scans with the new version but I am running into a number of problems. On the same linux cluser processing the same scans, some scans (~30%) run without problem. The rest either fail because they exceed the memory limit or because they take much more than 48 hours (the jobs are being killed and the logs report only the left and sometimes part of the right hemisphere being processed).

Since the documentation makes clear that FS works best with 2 Gig of ram, I have switched to an Itinium cluster with 2 Gig ram limit and 48 hours processing time limit. When I compare the logs of the same scans processed on both systems the Itinium cluster seems to take longer and although I am still running tests it appears that for at least some scans autoreconall might take 70+ hours.

Here are my questions:

1. On the linux cluster can I tell freesurfer not to exceed a certain ram allocation? and if yes how?

2. Do the problems I have on the Itinium cluster suggest that FS is badly configured on this system? and if yes where should we look? (I don't have access to this system's configuration and I have to feedback to the system managers to fix eventual problems.)

Thank you very much for your help and for sharing these great tools with us.

Nic

_______________________________________________
Freesurfer mailing list
[email protected]
https://mail.nmr.mgh.harvard.edu/mailman/listinfo/freesurfer








_______________________________________________
Freesurfer mailing list
[email protected]
https://mail.nmr.mgh.harvard.edu/mailman/listinfo/freesurfer



--
Douglas N. Greve, Ph.D.
MGH-NMR Center
[EMAIL PROTECTED]
Phone Number: 617-724-2358 Fax: 617-726-7422

In order to help us help you, please follow the steps in:
surfer.nmr.mgh.harvard.edu/fswiki/BugReporting


_______________________________________________
Freesurfer mailing list
[email protected]
https://mail.nmr.mgh.harvard.edu/mailman/listinfo/freesurfer

Reply via email to