Hi all,

I am trying to use parallel for regular Linux commands, as I have to deal
with huge files on daily basis. But few times I have tried, I don't see any
improvement. Is there a threshold for the file size after which the
parallel is beneficial? Or am I doing it wrong?
Eg.,

$ time head -n 1000000 huge.vcf  | parallel --pipe "awk '{print $123}'"  |
wc -l
1000000

Wall Time       0m29.326s
User Mode       0m22.489s
Kernel Mode     17m55.061s
CPU Usage       3745.90%

$ time head -n 1000000 huge.vcf | awk '{print $123}' | wc -l
1000000

Wall Time       0m10.329s
User Mode       0m12.447s
Kernel Mode     0m4.540s
CPU Usage       164.46%


Thanks very much!

-- 
Arun

Reply via email to