That seems like a long time. We managed to push through all 10000 MIMIC files in cTAKES in a little over 2.5 hours (and this was on a VM) using the default settings.
Greg-- On Tue, Jan 29, 2019 at 11:59 AM Baas,Leah <[email protected]> wrote: > Hi all, > > > > I would like to process a batch of 13,414 files (avg file size 6.2 KB) > using the default clinical pipeline. I am new to cTAKES and computer > programming, and I’m looking for guidance on how to process these files > with maximum time/CPU efficiency. I am currently running my program on an > Ubuntu VM with 3 CPUs. It takes me 28 seconds (real time) to process one > 6.0 KB file. I’m reading up on parallel processing strategies, but would be > grateful for any suggestions, tips, etc. that you might have! > > > > Thanks, > > > > Leah > > > > > > ----------------------------------------------------------------------- > Confidentiality Notice: This e-mail message, including any attachments, > is for the sole use of the intended recipient(s) and may contain > privileged and confidential information. Any unauthorized review, use, > disclosure or distribution is prohibited. If you are not the intended > recipient, please contact the sender by reply e-mail and destroy > all copies of the original message. > -- Greg M. Silverman Senior Systems Developer NLP/IE <https://healthinformatics.umn.edu/research/nlpie-group> Cardiovascular Informatics <http://www.med.umn.edu/cardiology/> University of Minnesota [email protected] › evaluate-it.org ‹
