Re: [Rd] package development

Paul Gilbert Fri, 12 Dec 2008 06:16:54 -0800


Duncan Murdoch wrote:

On 11/12/2008 6:04 PM, Terry Therneau wrote:
  I'm making the move of the survival package from my own environment to,
and have stumbled into a vacuum. The R Extensions manual has reallynice
instructions about how to lay out the directories, order the files, and
run tests for DISTRIBUTION of a product, but I can't find anything on how
to set up a reasonable DEVELOPMENT environment.
In my local world, I had the .c and .s files in a common directory,witha Makefile that I had created, and the test suite in a subdirectory.Debugging
and development was quite nice.
    make
    cd test
    R
    attach("..")
        try something and perhaps it fails
    q()
    cd ..
Fix and repeat. The Makefile took some time to create, but paid foritself a
hundred times over.
So, I've now rearranged everything into standard R order. Then Idid the
only thing I could find
R CMD INSTALL ~/myRlib survival where "survival" is saiddirectory. This turns out to be not useful at all.The survival package is large, and I rather suspected that I wouldgoof something up, and I did, resulting in the following message
    Error in parse(n = -1, file = file) : unexpected end of input at
        14450: }
        14451:
It is not exactly obvious which of the 132 files in my R/ directory isthe culprit here.
That's something I would like to fix too. There are (at least) twopossible ways: stop concatenating the files (which is not really needednowadays, most packages install saved images), or add some markup to theconcatenated file so that the parser can report on the original filenameand line number (like the #LINE directives output by the C preprocessor).

I would certainly appreciate a fix for this. When this has happened tome, I usually end up just sourcing the individual files into an Rsession until I find the bad file, and get a report with the right linenumber. It always seems like a lot of work for something that should betrivial.

   In general:
1. The library is large, and recompiling/reparsing everything is veryfar from
instantaneous.  It is not the edit/load cycle I desire.
If you install from the directory, the compiling should only be doneonce (unless you change a file, of course). (The alternative isinstalling from the tarball, which is recommended for later stages oftesting before distribution, because it's possible something could gowrong in building the tarball. But it won't include any object files,so you'll recompile every time.)
You can also use option "--docs=none" to skip building the help system;this will save a bit of time.
2. I take testing seriously: the test suite takes on the order of 15minutes torun on a fast machine. I most certainly don't want to run it in themid cycle.
I don't quite follow this. If you want to run all your tests, you woulduse R CMD check. If you only want to run some of them, then you cansource things out of the tests directory while running interactively.
Someone must have tackled this. I'm hoping that there is somedocumentationthat I have managed to overlook which discussess a good setup for thismiddle
ground between concieving of a library and packaging it for delivery; the
"build, test, make sure it acually works" part of development.

In my experience there are two differ sorts of problems that I thinkcould benefit from some improvement. The first is that there should be astandard way to have extra tests, that do not get run in the normal CRANtesting cycle, or by developers when using a "quick" R CMD check, butcan be run with a standard mechanism. I do this by putting the tests ina separate package, and I have seen reports of different mechanisms, butI think they are all somewhat ad hoc. Currently, if you put too muchtesting in your package then all the testing gets omitted on some CRANtesting platforms. Just a common directory like extraTests/ would be agood start.

The second problem is that a developer usually needs to run tests/ whencode in R/ has been changed, but probably not run tests/ when thechanges are only in man/, demos/, or inst/doc/. The checking that needsto be done in those cases is reduced from the full R CMD check. Acommon way to attack this is with a good Makefile. I wrote an article inR News a few years ago about my attempts to do this, and my Makefilesare available, but there is some customization necessary, and there islots of room for improvement. It does (mostly) work with make -j, sodays of testing on a single processor machine can be accomplished in afew hours on multicore machines (for me, mileage may vary). I have notaddressed the idea of trying to specialize files in tests/ to specificcode files in R/. (I think others have tried to do this with a "unittesting" approach.)


Paul Gilbert

I find the process I follow is to organize the files in the distributionstructure from the beginning. When adding new functions, I'llgenerally use source() a few times to get the syntax right, and perhapsrun simple tests. (But remember, if you use a NAMESPACE, the functionsmay not behave the same when they're sourced into the globalenvironment.) In the early stages, I'll do a lot of installs of thepackages.
If I was porting a big package and wanted to find syntax errors, to workaround the not-very-helpful error message you saw I'd do something like
for (f in list.files("pkg/R", full=TRUE)) source(f)

which will report the error more informatively.

Duncan Murdoch

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

====================================================================================

La version française suit le texte anglais.

------------------------------------------------------------------------------------

This email may contain privileged and/or confidential in...{{dropped:26}}

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

Re: [Rd] package development

Reply via email to