Re: [ccp4bb] Off-topic: Best Scripting Language

Quentin Delettre Wed, 12 Sep 2012 08:19:29 -0700

I agree with Pete. Moreover, Python doesn't have built-in statisticfunctions but adding package (numpy and scipy in this case) is very simple.


Quentin


Le 12/09/2012 17:11, Pete Meyer a écrit :

One thing to keep in mind is that there's usually a trade-off betweensetup (writing and testing) and execution time. For one-off dataprocessing, I'd focus on implementation speed rather than executionspeed (in other words, FORTRAN might not be ideal unless you'realready fluent with it).
That said, I'd take a look at python, octave or R. Python'srelatively easy to learn, and more flexible than octave/R; but itdoesn't have the built-in statistic functions that octave and R do.
One other tip which you've probably already though of - Depending onyour runtimes (I don't think 100s MB of data is usually considered anenormous amount, but it'll depend on what you're doing) it may beworth getting things working on a small subset of the data first.
Pete

Jacob Keller wrote:
Dear List,
since this probably comes up a lot in manipulation of pdb/reflectionfilesand so on, I was curious what people thought would be the bestlanguage forthe following: I have some huge (100s MB) tables of tab-delimiteddata onwhich I would like to do some math (averaging, sigmas, simplearithmetic,
etc) as well as some sorting and rejecting. It can be done in Excel, but
this is exceedingly slow even in 64-bit, so I am looking to do itthroughsome scripting. Just as an example, a "sort" which takes >10 min inExcel
takes ~10 sec max with the unix command sort (seems crazy, no?). Any
suggestions?

Thanks, and sorry for being off-topic,

Jacob

Re: [ccp4bb] Off-topic: Best Scripting Language

Reply via email to