Re: [Rd] CRAN Server download statistics (Was: R Usage Statistics)

2009-11-22 Thread Detlef Steuer
Hi!

Nice work!
But keep in mind, that for example the opensuse packages are no longer
kept up to date on CRAN, but in openSUSE's Build Service. So the stats
are biased towards windows and mac.

It seems you only count binary downloads of contributed packages?
Introduces some nice bias, too.

Nevertheless, a nice starting point. Good luck!

Detlef

On Sun, 22 Nov 2009 16:18:11 -0800
"Fellows, Ian"  wrote:

> Hi All,
> 
> It seems that the question of how may people use (or download) R, and it's 
> packages is one that comes up on a fairly regular basis in a variety of 
> forums (There was also recent thread on the subject on Stack Overflow). A 
> couple of students at UCLA (including myself), wanted to address the issue, 
> so we set up a system to get and parse the cran.stat.ucla.edu APACHE logs 
> every night, and display some basic statistics. Right now, we have a working 
> sketch of a site based on one week of observations.
> 
> http://neolab.stat.ucla.edu/cranstats/
> 
> We would very much like to incorporate data from all CRAN mirrors, including 
> cran.r-project.org. We would also like to set this up in a way that is 
> minimally invasive for the site administrators. Internally, our administrator 
> has set up a protected directory with the last couple days of cran activity. 
> We then pull that down using curl.
> 
> What would be the best and easiest way for the CRAN mirrors to share their 
> data? Is the contact information for the administrators available anywhere?
> 
> 
> Thank you,
> Ian Fellows
> 
> 
> 
> 
> From: r-devel-boun...@r-project.org [r-devel-boun...@r-project.org] On Behalf 
> Of Steven McKinney [smckin...@bccrc.ca]
> Sent: Thursday, November 19, 2009 2:21 PM
> To: Kevin R. Coombes; r-devel@r-project.org
> Subject: Re: [Rd] R Usage Statistics
> 
> Hi Kevin,
> 
> What a surprising comment from a reviewer for BMC Bioinformatics.
> 
> I just did a PubMed search for "limma" and "aroma.affymetrix",
> just two methods for which I use R software regularly.
> "limma" yields 28 hits, several of which are published
> in BMC Bioinformatics.  Bengtsson's aroma.affymetrix paper
> "Estimation and assessment of raw copy numbers at the single locus level."
> is already cited by 6 others.
> 
> It almost seems too easy to work up lists of usage of R packages.
> 
> Spotfire is an application built around S-Plus that has widespread use
> in the biopharmaceutical industry at a minimum.  Vivek Ranadive's
> TIBCO company just purchased Insightful, the S-Plus company.
> (They bought Spotfire previously.)
> Mr. Ranadive does not spend money on environments that are
> not appropriate for deploying applications.
> 
> You could easily cull a list of corporation names from the
> various R email listservs as well.
> 
> Press back with the reviewer.  Reviewers can learn new things
> and will respond to arguments with good evidence behind them.
> Good luck!
> 
> 
> Steven McKinney
> 
> 
> 
> From: r-devel-boun...@r-project.org [r-devel-boun...@r-project.org] On Behalf 
> Of Kevin R. Coombes [krcoom...@mdacc.tmc.edu]
> Sent: November 19, 2009 10:47 AM
> To: r-devel@r-project.org
> Subject: [Rd] R Usage Statistics
> 
> Hi,
> 
> I got the following comment from the reviewer of a paper (describing an
> algorithm implemented in R) that I submitted to BMC Bioinformatics:
> 
> "Finally, which useful for exploratory work and some prototyping,
> neither R nor S-Plus are appropriate environments for deploying user
> applications that would receive much use."
> 
> I can certainly respond by pointing out that CRAN contains more than
> 2000 packages and Bioconductor contains more than 350. However, does
> anyone have statistics on how often R (and possibly some R packages) are
> downloaded, or on how many people actually use R?
> 
> Thanks,
> Kevin
> 
> __
> R-devel@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel
> 
> __
> R-devel@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel
> 
> __
> R-devel@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


[Rd] CRAN Server download statistics (Was: R Usage Statistics)

2009-11-22 Thread Fellows, Ian
Hi All,

It seems that the question of how may people use (or download) R, and it's 
packages is one that comes up on a fairly regular basis in a variety of forums 
(There was also recent thread on the subject on Stack Overflow). A couple of 
students at UCLA (including myself), wanted to address the issue, so we set up 
a system to get and parse the cran.stat.ucla.edu APACHE logs every night, and 
display some basic statistics. Right now, we have a working sketch of a site 
based on one week of observations.

http://neolab.stat.ucla.edu/cranstats/

We would very much like to incorporate data from all CRAN mirrors, including 
cran.r-project.org. We would also like to set this up in a way that is 
minimally invasive for the site administrators. Internally, our administrator 
has set up a protected directory with the last couple days of cran activity. We 
then pull that down using curl.

What would be the best and easiest way for the CRAN mirrors to share their 
data? Is the contact information for the administrators available anywhere?


Thank you,
Ian Fellows




From: r-devel-boun...@r-project.org [r-devel-boun...@r-project.org] On Behalf 
Of Steven McKinney [smckin...@bccrc.ca]
Sent: Thursday, November 19, 2009 2:21 PM
To: Kevin R. Coombes; r-devel@r-project.org
Subject: Re: [Rd] R Usage Statistics

Hi Kevin,

What a surprising comment from a reviewer for BMC Bioinformatics.

I just did a PubMed search for "limma" and "aroma.affymetrix",
just two methods for which I use R software regularly.
"limma" yields 28 hits, several of which are published
in BMC Bioinformatics.  Bengtsson's aroma.affymetrix paper
"Estimation and assessment of raw copy numbers at the single locus level."
is already cited by 6 others.

It almost seems too easy to work up lists of usage of R packages.

Spotfire is an application built around S-Plus that has widespread use
in the biopharmaceutical industry at a minimum.  Vivek Ranadive's
TIBCO company just purchased Insightful, the S-Plus company.
(They bought Spotfire previously.)
Mr. Ranadive does not spend money on environments that are
not appropriate for deploying applications.

You could easily cull a list of corporation names from the
various R email listservs as well.

Press back with the reviewer.  Reviewers can learn new things
and will respond to arguments with good evidence behind them.
Good luck!


Steven McKinney



From: r-devel-boun...@r-project.org [r-devel-boun...@r-project.org] On Behalf 
Of Kevin R. Coombes [krcoom...@mdacc.tmc.edu]
Sent: November 19, 2009 10:47 AM
To: r-devel@r-project.org
Subject: [Rd] R Usage Statistics

Hi,

I got the following comment from the reviewer of a paper (describing an
algorithm implemented in R) that I submitted to BMC Bioinformatics:

"Finally, which useful for exploratory work and some prototyping,
neither R nor S-Plus are appropriate environments for deploying user
applications that would receive much use."

I can certainly respond by pointing out that CRAN contains more than
2000 packages and Bioconductor contains more than 350. However, does
anyone have statistics on how often R (and possibly some R packages) are
downloaded, or on how many people actually use R?

Thanks,
Kevin

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


[Rd] Dead link in Nile help documentation (PR#14079)

2009-11-22 Thread maarten . blaauw
When doing ?Nile, the url for the data source is dead. It says  http://www.=
ssfpack.com/dkbook/ but this has changed to=20
http://www.ssfpack.com/DKbook.html

Version:
 platform =3D i386-redhat-linux-gnu
 arch =3D i386
 os =3D linux-gnu
 system =3D i386, linux-gnu
 status =3D
 major =3D 2
 minor =3D 10.0
 year =3D 2009
 month =3D 10
 day =3D 26
 svn rev =3D 50208
 language =3D R
 version.string =3D R version 2.10.0 (2009-10-26)

Locale:
LC_CTYPE=3Den_US.UTF-8;LC_NUMERIC=3DC;LC_TIME=3Den_US.UTF-8;LC_COLLATE=3Den=
_US.UTF-8;LC_MONETARY=3DC;LC_MESSAGES=3Den_US.UTF-8;LC_PAPER=3Den_US.UTF-8;=
LC_NAME=3DC;LC_ADDRESS=3DC;LC_TELEPHONE=3DC;LC_MEASUREMENT=3Den_US.UTF-8;LC=
_IDENTIFICATION=3DC

Search Path:
 .GlobalEnv, package:stats, package:graphics, package:grDevices, package:ut=
ils, package:datasets, package:methods, Autoloads, package:base

Thanks,

Maarten

--
| Dr. Maarten Blaauw
|
| School of Geography, Archaeology & Palaeoecology
| Queen's University Belfast, UK
|
| www  http://www.chrono.qub.ac.uk/blaauw
| tel  +44 (0)28 9097 3895

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


[Rd] file_path_as_absolute duplicates "/" (PR#14078)

2009-11-22 Thread joehl
Full_Name: Jens Oehlschlägel
Version: 2.10.0
OS: Windows XP
Submission from: (NULL) (85.181.157.36)


file_path_as_absolute duplicates "/" for files in the root path, 
which goes back to the fact that file.path(dirname(x), basename(x)) 
currently is not guaranteed to restore x


> x <- "d:/x.RDAta"
> file_path_as_absolute(x)
[1] "d://x.RData"
> file.path(dirname(x), basename(x))
[1] "d://x.RData"
> dirname(x)
[1] "d:/"

> x <- "/"
> file.path(dirname(x), basename(x))
[1] "//"

a possible fix would be to use gsub in file.path before returning, as in 
gsub("/+","/","d://x.RData")

This would 
- standardize the path returned, e.g. help searching for this path in another
one
- make sure we can use that path in setwd(), because
> setwd("//")
Error in setwd("//") : cannot change working directory
> file.create("//a.txt")
[1] FALSE
Warning message:
In file.create("//a.txt") :
  cannot create file '//a.txt', reason 'Invalid argument'



Also note that the help on basename says

"
basename removes all of the path up to the last path separator (if any).
dirname returns the part of the path up to (but excluding) the last path
separator, or "." if there is no path separator. 
"

but obviously there is an undocumented exception for the root path (I guess to
have dirname always return a valid path that we can use in e.g. setwd())

> basename("/")
[1] ""
> dirname("/")
[1] "/"


It is not easy to understand that dirname/basename
- neither always split a string into a path and a file component (since in
"/a/b/" it avoids empty basename and "b" goes to basename)
- nor always split a string into the last token in basename and the rest in
dirname (since it avoids empty rest in "/")
So whatever the rules are, it would be helpful to find them in the help,
because
- not everyone knows what basename / dirname usually do under unix (IEEE Std
1003.1)
- R is not following IEEE Std 1003.1 (since basename("/")=="" while IEEE
requires "/")

Help should also explain why and what happens to "servername", is that
dirname, is that basename? Currently we have
> dirname(x)
[1] "."
> basename(x)
[1] "asdfasdf"
> file.path(dirname(x), basename(x))
[1] "./asdfasdf"


> version
   _
platform   i386-pc-mingw32  
arch   i386 
os mingw32  
system i386, mingw32
status  
major  2
minor  10.0 
year   2009 
month  10   
day26   
svn rev50208
language   R
version.string R version 2.10.0 (2009-10-26)

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] Surprising length() of POSIXlt vector (PR#14073)

2009-11-22 Thread Tony Plate

maech...@stat.math.ethz.ch wrote:

"PD" == Peter Dalgaard 
on Fri, 20 Nov 2009 09:54:34 +0100 writes:



PD> m...@celos.net wrote:
>> Arrays of POSIXlt dates always return a length of 9.  This
>> is correct (they're really lists of vectors of seconds,
>> hours, and so forth), but other methods disguise them as
>> flat vectors, giving superficially surprising behaviour:
>> 
>> strings <- paste('2009-1-', 1:31, sep='')

>> dates <- strptime(strings, format="%Y-%m-%d")
>> 
>> print(dates)

>> #  [1] "2009-01-01" "2009-01-02" "2009-01-03" "2009-01-04" "2009-01-05"
>> #  [6] "2009-01-06" "2009-01-07" "2009-01-08" "2009-01-09" "2009-01-10"
>> # [11] "2009-01-11" "2009-01-12" "2009-01-13" "2009-01-14" "2009-01-15"
>> # [16] "2009-01-16" "2009-01-17" "2009-01-18" "2009-01-19" "2009-01-20"
>> # [21] "2009-01-21" "2009-01-22" "2009-01-23" "2009-01-24" "2009-01-25"
>> # [26] "2009-01-26" "2009-01-27" "2009-01-28" "2009-01-29" "2009-01-30"
>> # [31] "2009-01-31"
>> 
>> print(length(dates))

>> # [1] 9
>> 
>> str(dates)

>> # POSIXlt[1:9], format: "2009-01-01" "2009-01-02" "2009-01-03" 
"2009-01-04" ...
>> 
>> print(dates[20])

>> # [1] "2009-01-20"
>> 
>> print(length(dates[20]))

>> # [1] 9
>> 
>> I've since realised that POSIXct makes date vectors easier,

>> but could we also have something like:
>> 
>> length.POSIXlt <- function(x) { length(x$sec) }
>> 
>> in datetime.R, to avoid breaking functions (like the

>> str.POSIXt method) which use length() in this way?


PD> [You need "wishlist" in the title for this sort of stuff.]

PD> I'd be wary of this. Just the other day we found that identical() broke 
PD> on some objects because a package had length() redefined as a class 
PD> method. I.e. the danger is that something wants to use length() with its 
PD> original low-level interpretation.


Yes, of course.
and Romain mentioned  str().  Note that we have needed to define
a "POSIXt" method for str(), partly just *because* of the
current anomaly:
As Tony Plate, e.g., has argued, entirely correctly in my view,
the anomaly is thatlength() and "["   are not compatible;
and while I think no R language definition says that they should
be, I still believe that you need very good reasons for them to
be incompatible, as they are for POSIXlt.

In the current case, for me the only good reason is backwards
compatibility.
My personal taste would be to change it and see what happens.
I would be willing to clean up after that change within R 'base'
and all packages I am coauthoring (quite a few), but of course
there are still a thousand more R packages..
My strong bet would be that less than 1% would be affected,
and my point guess for the percentage affected would be
rather in the order of  1/1000.

The question is if we (you too!), the R community, are willing to
bear the load of cleanup, after such a change which would really
*improve* consistency of that small corner of R.
For me, as I indicated above, I am willing to bear my share
(and actually have got it ready for R-devel)
  
Would be great to see this change!  Surely the right way to do things is 
that functions that wish to examine the low level structure of S3 
objects should use unclass() before looking at length and elements, so 
there's no reason for a class such as POSIXlt to not provide a 
logical-level length method.


At a broader level, when I've designed vector/array classes, I've 
wondered what methods I should define, but have been unable to find any 
specification of a set of methods.  When one thinks about it, there are 
actually quite a set of strongly-connected methods with quite a lot a 
behaviors to implement, e.g., length, '[' (with logical, numeric & 
character indicies, including 0 and NA possibilities), '[[', 'c', and 
then optionally 'names', and then for multi-dim objects, 'dim', 
'dimnames', etc.  Consequently, last time this discussion on length and 
'[' methods POSIXlt came up, I wrote a function that automatically 
tested behavior of all these methods on a specified class and summarizes 
the behavior.  If anyone is interested in such a thing, I'd be happy to 
dig it up and distribute it (I'd attach it to this message, but I'm on 
vacation and don't have access to the compute that I think it's on.)


-- Tony Plate


Martin Maechler, ETH Zurich (and R Core Team)

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel




__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] error message when running news()

2009-11-22 Thread Kurt Hornik
> Duncan Murdoch writes:

> On 22/11/2009 7:34 AM, Gabor Grothendieck wrote:
>> When running news() in I get this error message from print.news_db:
>> 
>>> news()
>> Error: invalid version specification 2.0.12.0.1 patched2.1.02.1.12.1.1
>> patched2.10.02.10.0 patched2.2.02.2.12.2.1 patched2.3.02.3.12.3.1
>> patched2.4.02.4.12.4.1 patched2.5.02.5.12.5.1
>> patched2.6.02.6.12.6.22.6.2 patched2.7.02.7.12.7.22.7.2
>> patched2.8.02.8.12.8.1 patched2.9.02.9.12.9.22.9.2 patched
>>> R.version.string
>> [1] "R version 2.10.0 Patched (2009-11-21 r50532)"


> This is also present in R-devel.  It's an error in utils:::print.news_db.

> The problem is that versions like "2.0.1 patched" can't be converted to 
> numeric versions, but print.news_db tries to do it to sort the entries. 
>   The repetitive message is just a bad error message from 
> .make_numeric_version.

> Presumably the intention is that "2.0.1 patched" should compare bigger 
> than "2.0.1" and less than "2.0.2", but I don't know the best fix for this.

> Duncan Murdoch

I'll have a look.

-k

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] Including local dynamic libraries

2009-11-22 Thread Elana Fertig
Thank you all so much!  It seems that using dyn.load() fixes the  
problem.


On Nov 20, 2009, at 8:16 PM, Simon Urbanek wrote:



On Nov 20, 2009, at 3:07 PM, Romain Francois wrote:


On 11/20/2009 08:43 PM, Elana Fertig wrote:

Hi All,

I am trying to install the package rjags into a local library on a
machine for which I do not have root permissions.

Using the command: R CMD INSTALL
--configure-args="--with-jags-include=${JAGSBIN}/include/JAGS
--with-jags-lib=${JAGSBIN}/lib/
--with-jags-modules=${JAGSBIN}/lib/JAGS/modules" CMD INSTALL -l
${JAGSBIN}/lib/ ../rjags_1.0.3-12.tar.gz

Where ${JAGSBIN} is defined as a local directory, the installation
seems
to be working fine. Then, I go into R and type the following
commands:

.libPaths("${JAGSBIN}/lib/")


Maybe this :

.libPaths( file.path( Sys.getenv("JAGSBIN", "lib" ) ) )



Unlikely - it's probably just a matter of setting LD_LIBRARY_PATH
correctly .. not really an R issue at all ...

Cheers,
S



library("rjags")


I get the following error

Loading required package: coda
Loading required package: lattice
Error in dyn.load(file, DLLpath = DLLpath, ...) :
unable to load shared library '${JAGSBIN}/lib/rjags/libs/rjags.so':
libjags.so.1: cannot open shared object file: No such file or
directory
Error : .onLoad failed in 'loadNamespace' for 'rjags'
Error: package/namespace load failed for 'rjags'

Ideally, I would fix this problem by adding the appropriate
libraries to
/user/local/lib64 . However, since I do not have root permissions I
am
not able to make this addition. How can I have R recognize c++
libraries
from my local path?

Thanks so much in advance for your help.



--
Romain Francois
Professional R Enthusiast
+33(0) 6 28 91 30 30
http://romainfrancois.blog.free.fr
|- http://tr.im/EAD5 : LondonR slides
|- http://tr.im/BcPw : celebrating R commit #5
`- http://tr.im/ztCu : RGG #158:161: examples of package IDPmisc

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel






__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] error message when running news()

2009-11-22 Thread Duncan Murdoch

On 22/11/2009 7:34 AM, Gabor Grothendieck wrote:

When running news() in I get this error message from print.news_db:


news()

Error: invalid version specification 2.0.12.0.1 patched2.1.02.1.12.1.1
patched2.10.02.10.0 patched2.2.02.2.12.2.1 patched2.3.02.3.12.3.1
patched2.4.02.4.12.4.1 patched2.5.02.5.12.5.1
patched2.6.02.6.12.6.22.6.2 patched2.7.02.7.12.7.22.7.2
patched2.8.02.8.12.8.1 patched2.9.02.9.12.9.22.9.2 patched

R.version.string

[1] "R version 2.10.0 Patched (2009-11-21 r50532)"



This is also present in R-devel.  It's an error in utils:::print.news_db.

The problem is that versions like "2.0.1 patched" can't be converted to 
numeric versions, but print.news_db tries to do it to sort the entries. 
 The repetitive message is just a bad error message from 
.make_numeric_version.


Presumably the intention is that "2.0.1 patched" should compare bigger 
than "2.0.1" and less than "2.0.2", but I don't know the best fix for this.


Duncan Murdoch

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


[Rd] error message when running news()

2009-11-22 Thread Gabor Grothendieck
When running news() in I get this error message from print.news_db:

> news()
Error: invalid version specification 2.0.12.0.1 patched2.1.02.1.12.1.1
patched2.10.02.10.0 patched2.2.02.2.12.2.1 patched2.3.02.3.12.3.1
patched2.4.02.4.12.4.1 patched2.5.02.5.12.5.1
patched2.6.02.6.12.6.22.6.2 patched2.7.02.7.12.7.22.7.2
patched2.8.02.8.12.8.1 patched2.9.02.9.12.9.22.9.2 patched
> R.version.string
[1] "R version 2.10.0 Patched (2009-11-21 r50532)"

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel