A little story

bearophile Sun, 24 Jun 2012 14:57:46 -0700

Just a recent little coding story.

Python is a good language for "exploratory programming", becausethe programs are succinct, it's flexible, its data structures areeasy to use and good, there are already written modules to domost things, there are very easy to use libraries to plot resultsor import data and so on.

Doing this kind of programing I have written a smallsingle-module command line Python program, just few hundred lineslong. Once debugged it seemed to work, but only up to N=18,beyond that it uses too much memory and CPU time for me.

To solve the performance problem I've ported it to D. Thistranslation didn't took a lot of time, and the resultingsingle-module D program was just 10-15% longer than the Pythonmodule. But it didn't work, the results were different from thePython veersion. I have spent several hours to find the bug, thatwas a nasty D associative array bug already present in Bugzilla.

This was not nice, but once fixed that, it gave the same resultsas the Python version from N=1 to N=18. Now the program is tensof times faster and I'm able to solve for N=19 and even N=20,good. Being this exploratory programming I don't know the correctresults for those N=19 and N=20. I am only able to estimate thecorrect results and the estimate is compatible with the resultsgiven by the D program. This is good, but spending some hours tolook for a bug has made me suspicious of the D program, maybe itwas buggy still.

So I translate the Python program to FreePascal (a modern freeObject Pascal, similar to Delphi). This translation wasn't hardto do, I have used one library of mine that implementsassociative arrays in FreePascal, but it was a little slow andboring, and the resulting program was long.

When I run the FreePascal program it gives the same results fromN=1 to N=18 as the Python version, this is good. But with N=19 itgave a run-time integer overflow error. Up to N=18 a certain 32bit int variable was enough, but right with N=19 the programtried to store inside it a value past 2 billions. A compilationswitch of FreePascal allows to turn run-time overflows of allintegral values used by the program into run-time errors.

I have replaced that 32 bit int with a 64 bit int in theFreePascal code. Running again the FreePascal program again Ihave found another integral overflow error. If I replace thatvariable too with a 64 bit int, the program runs and it gives anumber slightly different from the number given by the D code. IfI compile and run the original FreePascal code without therun-time overflow tests it gives the same results as the Dversion for N=19 and N=20.

I can study the D code, looking for variables that causeoverflow, but this study requires time, because the program issmall, but its algorithms are intricate.

So for this program born from explorative programming that doessome integral number crunching:- Python language is not fit because it's too much slow andbecause in certain cases I prefer a little stronger static typesafety, that's useful to not waste time debugging the usage ofintricate data structures.- FreePascal is not fit because it's not flexible enough andbecause it's too much low-level for a exploration that must bequick.- And D is too much unsafe for such kind of programs, becauseintegral numbers can silently overflow.

Maybe C# running on Mono is a good enough language for thosepurposes (it's probably fast enough despite not running on thedotnet and it has structs to save memory, it detects run-timeintegral overflows, it has data structures and maybe it'sflexible enough).


Bye,
bearophile

A little story

Reply via email to