[R] different results of fisher.test function in R2.8.1 and R2.6.0
Hi; I use the function fisher.test to compute in R2.8.1 and R2.6.0,and the results are not identical.the last number is different. why? thank you ! Merry [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Combining multiple dataframes in a list
I have a list containing multiple dataframes. Depending on whether the dataframes have 1 column or more than 1 columns, the column names are named differently. How can I force single column dataframes to have prefixed column names ? > m<- list(fc=data.frame(A=1:3)) > do.call(cbind, m) A 1 1 2 2 3 3 > m<- list(fc=data.frame(A=1:3, B=2:4)) > do.call(cbind, m) fc.A fc.B 112 223 334 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Physical or Statistical Explanation for the "Funnel" Plot?
On Thu, 26 Mar 2009, Jason Rupert wrote: The R code below produces (after running for a few minutes on a decent computer) the plot shown at the following location: http://n2.nabble.com/Is-there-a-physical-and-quantitative-explanation-for-this-plot--td2542321.html I'm just taking the mean of a given set of random variables, where the set size is increased. There appears to be a quick convergence and then a pretty steady variance out to a set size of 10,. I don't have time to study your code, but it sounds like you are taking random normal variables with mean 0 and variance 1, but then taking the mean for sets of those. We know exactly the distribution for the mean of the "set" (a.k.a., "sample"). The mean has a normal distribution with mean 0 and variance 1/N where N is the size of the sample. When you allow N to vary, you produce a mixture of random normal variables all having mean 0 but with different variances. The plot you show looks correct -- the distributions in the mixture that have small variance pile up in the middle, while those with greater variance form the long tails. You could get a lot of different shapes depending ont he distribution of N. But save yourself some time. Instead of making N normal variables and taking the mean, just make one and divide it by sqrt(N) -- that will give you *exactly* the same result. Your graph looks a little weird - first, why turn it sideways? We normally plot density on the ordinate, not on the abscissa. Second, there is a thick black bar on the left, but that seems to be an artifact because at least half of it is below zero -- how can that happen? Mike __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Installing Rmpi on Centos 5.2
I am trying to install the R package "Rmpi" which needs libmpi. I've installed openmpi and lam in Centos 5.2: [r...@rab45-1 /]# rpm -qv openmpi openmpi-1.2.5-5.el5 openmpi-1.2.5-5.el5 [r...@rab45-1 /]# rpm -qv lam lam-7.1.2-14.el5 lam-7.1.2-14.el5 I'm using this to install Rmpi: R CMD INSTALL /home/rick/Rmpi_0.5-7.tar.gz --with-mpi=/usr/lib64/lam/include But I get the following error message: /usr/bin/ld: skipping incompatible /usr/lib/lam/lib/libmpi.so when searching for -lmpi I'm not sure what else to install/uninstall to fix this. Rick B. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Need some help at interfacing R with Fortran-90
I browsed through ch.5 of R manual "Writing R extensions". It seems to be written for experienced programmers rathern than for people who occasionally face this problem. A table summarising the main ley points for each supported platform would be very useful. Something like: Supported languages: C, Fortran-77, .. Windows guidelines: 1. compie Fotran code with (commands) from (terminal / command line ... ?) 2. load compiled code with (commands) from () 3. call Fortran subroutine from R passing arguments Linux guidelines: 1. 2. 3. Unix guidelines: 1. 2. 3. Coming to my problem. I have 4 Fortran-90 subroutines. There is a "main" routine calling the other three ones. Therefore I only need to call, and exchange data, between R and the "main" Fortran subroutine. Question-1: can Fotran-90 be interfaced to R ? The manual mentions Fortran-77. I found no mention of Fortran-90 (which supports dynamic memory allocation, and whose set of commands is bigger then Fortran-77) Question-2: Shall I generate executable code (compile the Fortran subroutines) through a Fortran compiler (which one ?) outside R environment ? Question-3: Shall I create a library ("so") object for each one of the 4 Fortran routiines or just the for the "main" one ? Question-4: What is the whole interfacing procedure on a Windows/XP system (steps sequence) ? Question-5: What is the whole interfacing procedure on a Mac/OS system (steps sequence) ? Thank you in advance for clarifying my confusion. Maura tutti i telefonini TIM! tutti i telefonini TIM! tutti i telefonini TIM! [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] how to quit mailing list
Dear Jiang, Go to https://stat.ethz.ch/mailman/listinfo/r-help and follow the instructions at the end. HTH, Jorge On Thu, Mar 26, 2009 at 11:27 PM, Jiang Peng wrote: > Dear list, > I will not use this account later, so i have to quit this list. > > >I don't know how to quit this list, I search the official website > and find no answer. > > thanks very much !! >[[alternative HTML version deleted]] > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] how to quit mailing list
On 27/03/2009, at 4:27 PM, Jiang Peng wrote: Dear list, I will not use this account later, so i have to quit this list. I don't know how to quit this list, I search the official website and find no answer. Well you didn't search very hard. Go to Mailing Lists ---> R.help web interface ---> Unsubscribe or edit options Doesn't seem too difficult to me. cheers, Rolf Turner ## Attention:\ This e-mail message is privileged and confid...{{dropped:9}} __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Bug? FORTTRAN help
G'day Kevin, On Thu, 26 Mar 2009 13:42:20 -0700 wrote: > I was feeling masochistic the other day [...] Welcome to the club. :) > and we have been having some wierd memory problems so I started > digging into the source for L-BFGS-B. In the lbgfsb.c file I see the > following code: > > /* Cholesky factorization of (2,2) block of wn. */ > F77_CALL(dpofa)(&wn[*col + 1 + (*col + 1) * wn_dim1], &m2, col, > info); if (*info != 0) { > *info = -2; > return; > } > > If I am not mistaken this says that there is a m2 * col matrix that > starts at 'col + 1 + (col + 1) * wn_dm1. Where wn_dm1 is 2 * m. I think your interpretation is not quite correct. Note that it makes only a sense to calculate a Cholesky factorization of a square matrix. The interface of dpofa (in the linpack library) is available at: http://www.netlib.org/linpack/dpofa.f Thus, the call above says, calculate the Cholesky factorization of a col * col matrix whose (1,1) element is stored at &wn[*col+1+(*col+1)] and that matrix is stored within a matrix which was allocated such that it has m2 rows. Or, in other words, calculate the Cholesky factorization of a col * col matrix whose (1,1) element is stored at &wn[*col+1+(*col+1)] and to move from the (1,1) element to the (1,2) element you have to move to the memory location m2*sizeof(double) ahead/behind of (1,1). Fortran uses a column major form to store arrays, i.e. element (1,1) is followed by element (2,1), (3,1) and so forth. To know where to find element (1,2) of the matrix, you have to tell Fortran with how many rows the "big matrix" that holds "your matrix" was allocated. > I am worried that the optimizer will silently write info memory that > it shouldn't [...] If you are worried about such issues, you should read chapter 4 of "Writing R extensions", in particular Section 4.3 on gctorture and valgrind. Then run R on a platform that supports valgrind. It is very useful to catch problems such as accessing or writing into memory that you should not access or write to. HTH. Cheers, Berwin === Full address = Berwin A TurlachTel.: +65 6516 4416 (secr) Dept of Statistics and Applied Probability+65 6516 6650 (self) Faculty of Science FAX : +65 6872 3919 National University of Singapore 6 Science Drive 2, Blk S16, Level 7 e-mail: sta...@nus.edu.sg Singapore 117546http://www.stat.nus.edu.sg/~statba __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] how to quit mailing list
Dear list, I will not use this account later, so i have to quit this list. I don't know how to quit this list, I search the official website and find no answer. thanks very much !! [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Conerned about Interfacing R with Fortran
G'day Maura, On Thu, 26 Mar 2009 18:21:01 +0100 wrote: > I am reading the manual sections illustrating how to call a Fortran > subroutine from R. I feel uneasy at the explicit statement about > ".Fortran" interface working with Fortran 77. I would like to call a > Fortran-90 subroutine from my R script. Is that supported at all ? Read the completely manual. :) It is pretty easy to use acroread, or other PDF readers, to search for "Fortran" in the PDF file; the HTML version should also be searchable from your browser. In chapter 1 (page 7 of the PDF 2.8.1 version) of the Writing R Extensions manual, you will find: [...] providing support for C, C++, FORTRAN 77, Fortran 9...@footnote{note that Ratfor is not supported. If you have Ratfor source code, you need to convert it to FORTRAN. Only FORTRAN-77 (which we write in upper case) is supported on all platforms, but most also support Fortran-95 (for which we use title case). If you want to ship Ratfor source files, please do so in a subdirectory of @file{src} and not in the main subdirectory.}, Objective C [...] and later in chapter 1, there is a complete section (namely 1.2.3) on F95 code: @subsection Using F95 code @R{} currently does not distinguish between FORTRAN 77 and Fortran 90/95 code, and assumes all FORTRAN comes in source files with extension @file{.f}. Commercial Unix systems typically use a F95 compiler, but only since the release of @code{gcc 4.0.0} in April 2005 have Linux and other non-commercial OSes had much support for F95. Only wih @R{} 2.6.0 did the Windows port adopt a Fortran 90 compiler. This means that portable packages need to be written in correct FORTRAN 77, which will also be valid Fortran 95. See @uref{http://developer.r-project.org/Portability.html} for reference resources. In particular, @emph{free source form} F95 code is not portable. On some systems an alternative F95 compiler is available: from the @code{gcc} family this might be @command{gfortran} or @command{g95}. Configuring @R{} will try to find a compiler which (from its name) appears to be a Fortran 90/95 compiler, and set it in macro @samp{FC}. Note that it does not check that such a compiler is fully (or even partially) compliant with Fortran 90/95. Packages making use of Fortran 90/95 features should use file extension @file{.f90} or @file{.f95} for the source files: the variable @code{PKG_FCFLAGS} specifies any special flags to be used. There is no guarantee that compiled Fortran 90/95 code can be mixed with any other type of code, nor that a build of @R{} will have support for such packages. Section 5.5 (Creating shared objects) also mentions Fortran 9x: Shared objects for loading into @R{} can be created using @command{R CMD SHLIB}. This accepts as arguments a list of files which must be object files (with extension @file{.o}) or sources for C, C++, FORTRAN 77, Fortran 9x, Objective C or Objective C++ (with extensions @file{.c}, @file{.cc} or @file{.cpp} or @file{.C}, @file{.f}, @file{.f90} or @file{.f95}, @file{.m}, and @file{.mm} or @file{.M}, respectively), or commands to be passed to the linker. See @kbd{R CMD SHLIB --help} (or the @R{} help for @code{SHLIB}) for usage information. Thus, it seems that calling Fortran 90 code from R is possible on some platforms and, presumably, on those where it is possible this is done via the .Fortran interface; although the Writing R Extensions manual does not seem to say so explicitly. OTOH, the help file for .Fortran states: Use '.Fortran' with care for compiled Fortran 9x code: it may not work if the Fortran 9x compiler used differs from the Fortran compiler used when configuring R, especially if the subroutine name is not lower-case or includes an underscore. HTH. Cheers, Berwin === Full address = Berwin A TurlachTel.: +65 6516 4416 (secr) Dept of Statistics and Applied Probability+65 6516 6650 (self) Faculty of Science FAX : +65 6872 3919 National University of Singapore 6 Science Drive 2, Blk S16, Level 7 e-mail: sta...@nus.edu.sg Singapore 117546http://www.stat.nus.edu.sg/~statba __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Some install package fixes for Ubuntu Hardy
On 26 March 2009 at 09:45, John C Nash wrote: | I encountered some failures in using install.packages() to install rgl | and rJava in some of my (multiple) Ubuntu Hardy systems. A quick search | of the 'Net did not show any debian packages for these. The | install.packages messages said header or other files were missing, | suggesting path and related woes. Email with Duncan Murdoch (thanks!) | pointed the way with rgl and led to a fix for rJava in similar fashion. | It may save others some frustration to know my resolution. See below. | | However, I do have a question which a brief rummage of r-project did not | answer. Where should information like this be put? My opinion is that it Maybe on the r-sig-debian list that is dedicated to Debian / Ubuntu and R? | should go on the wiki, but possibly there is a better solution if we can | get the right messages into the package installers, though I recognize | the load that puts on maintainers. | | Cheers, JN | | Ubuntu Hardy rgl install fix: | | The headers gl.h and glu.h are installed with the "dev" packages | libgl1-mesa-dev and libglu1-mesa-dev. So the fix is to run (in at | terminal as root) | | apt-get install libgl1-mesa-dev | apt-get install libglu1-mesa-dev Yes, which is why the r-cran-rgl package (available in Debian for over five years now, and hence in Ubuntu for probably 4 1/2) has the following Build-Depends (with my manual indentation here): Build-Depends: debhelper (>= 5.0.0), r-base-dev (>= 2.8.1), cdbs, \ libgl1-mesa-dev | libgl-dev, libglu1-mesa-dev | libglu-dev, \ libpng12-dev, libx11-dev, libxt-dev, x11proto-core-dev | then | R | . | install.packages("rgl") | | etc. Let's not forget the 'sudo apt-get install r-cran-rgl' alternative. | Ubuntu Hardy rJava install fix: | | Needed to get Sun JDK (not JRE) | | Then add new | ln -s /usr/java/jdkx/bin/java java | and | ln -s /usr/java/jdkx/bin/javac javac | | where xx is the version information on the jdk directory name -- in | my case 1.6.0_13 (see below) | | Then | | R CMD javareconf | | still fails to find the java compiler. | | Seems $JAVA_HOME may not be defined. | | Try | export JAVA_HOME="/usr/java/jdk1.6.0_13/" | | Then (as root) | | R CMD javareconf | | seems to work. | Then rJava installed OK. I was then able to install RWeka (my original | objective) and it seems to run OK. Likewise, the r-cran-rjava package has Build-Depends: debhelper (>= 7.0.0), r-base-dev (>= 2.8.1), cdbs, \ openjdk-6-jdk, automake and R is now configured for this Java version at the built. Again, questions on the r-sig-debian list may have been of help. Hope this helps, Dirk -- Three out of two people have difficulties with fractions. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Physical or Statistical Explanation for the "Funnel" Plot?
The R code below produces (after running for a few minutes on a decent computer) the plot shown at the following location: http://n2.nabble.com/Is-there-a-physical-and-quantitative-explanation-for-this-plot--td2542321.html I'm just taking the mean of a given set of random variables, where the set size is increased. There appears to be a quick convergence and then a pretty steady variance out to a set size of 10,. I'm just wondering if there is a statistical explanation out there for this convergence and it has been explored further. Thanks again. # First case N<-10 X<-rnorm(N) step_size<-1 # Groups g<-rep(1:(N/step_size),each=step_size) # The result tmp_output<-tapply(X[1:length(g)],g,mean) length_tmp_output<-length(tmp_output) tmp_x_vals<-rep(step_size,length_tmp_output) plot(tmp_x_vals, tmp_output, xlim=c(0,1)) #points(tmp_x_vals, tmp_output) for(ii in 1:1) { step_size<-ii # Groups g<-rep(1:(N/step_size),each=step_size) # The result #tmp_output<-tapply(X,g,mean) tmp_output<-tapply(X[1:length(g)],g,mean) length_tmp_output<-length(tmp_output) tmp_x_vals<-rep(step_size,length_tmp_output) points(tmp_x_vals, tmp_output) } __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Overlying a map on top of an image [SEC=UNCLASSIFIED]
Good day everyone, I have a *.jpeg map and I want to display it on top of an existing image (created using 'image'), Is there an easy way to do it? "map" does the job but only with pre-existing maps (from its database) not with my map. I would greatly appreciate any help in this matter. Thanks, Augusto Augusto Sanabria. MSc, PhD. Mathematical Modeller Risk & Impact Analysis Group Geospatial & Earth Monitoring Division Geoscience Australia (www.ga.gov.au) Cnr. Jerrabomberra Av. & Hindmarsh Dr. Symonston ACT 2601 Ph. (02) 6249-9155 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] use of "@" character in variable name
On 27/03/2009, at 2:52 PM, Marc Schwartz wrote: On Mar 26, 2009, at 8:40 PM, Rolf Turner wrote: On 27/03/2009, at 2:04 PM, Mike Miller wrote: Importing data with a header row using read.delim, one variable should be named @5HTT but it is automatically renamed to X.5HTT, presumably because the "@" is either unacceptable or misunderstood. I've tried to find out what the rules are on variable names but have been unsuccessful. I'll bet someone here can tell me where to look. Maybe it's hidden away in here somewhere: http://cran.r-project.org/doc/manuals/R-data.pdf I don't know if there is a comprehensive list of the rules governing variable names but the ``@'' sign is used to access ``slots'' under S4 classes and methods. See ?"@". So it is (like?) an operation/ operator and hence is ruled out just like ``+5HTT'' would be. Reserved words like ``break'' and ``while'' are also excluded. See fortune(18). Another rule is that a variable name can't begin with a digit. And it can't have white space in it. There are probably other rules, but essentially anything *sensible* as a variable name can be used as a variable name. See ?make.names for more information, which is noted in the description of the 'check.names' argument in the read.table() family of functions. It might also be useful to note that a list of reserved words can be found via ?Reserved (as I found out from ??"reserved words"). cheers, Rolf Turner ## Attention:\ This e-mail message is privileged and confid...{{dropped:9}} __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] use of "@" character in variable name
Dear Mike, As a slight simplification, a legal R name can start with a period (.), upper- or lower-case letter (A-Z, a-z), and can contain periods, underscores (_), letters, and numerals; depending upon the locale, some other characters may also be allowed. This information *is* in the R manuals, though it might not be that easy to locate: See section 10.3.2 of the R Language Definition manual or 1.8 of the Introduction to R manual. @ is used to access slots in an S4 object. I hope this helps, John > -Original Message- > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On > Behalf Of Mike Miller > Sent: March-26-09 9:04 PM > To: R-Help List > Subject: [R] use of "@" character in variable name > > Importing data with a header row using read.delim, one variable should be > named @5HTT but it is automatically renamed to X.5HTT, presumably because > the "@" is either unacceptable or misunderstood. I've tried to find out > what the rules are on variable names but have been unsuccessful. I'll bet > someone here can tell me where to look. Maybe it's hidden away in here > somewhere: > > http://cran.r-project.org/doc/manuals/R-data.pdf > > Thanks in advance. > > Mike > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] use of "@" character in variable name
On Mar 26, 2009, at 8:40 PM, Rolf Turner wrote: On 27/03/2009, at 2:04 PM, Mike Miller wrote: Importing data with a header row using read.delim, one variable should be named @5HTT but it is automatically renamed to X.5HTT, presumably because the "@" is either unacceptable or misunderstood. I've tried to find out what the rules are on variable names but have been unsuccessful. I'll bet someone here can tell me where to look. Maybe it's hidden away in here somewhere: http://cran.r-project.org/doc/manuals/R-data.pdf I don't know if there is a comprehensive list of the rules governing variable names but the ``@'' sign is used to access ``slots'' under S4 classes and methods. See ?"@". So it is (like?) an operation/ operator and hence is ruled out just like ``+5HTT'' would be. Reserved words like ``break'' and ``while'' are also excluded. See fortune(18). Another rule is that a variable name can't begin with a digit. And it can't have white space in it. There are probably other rules, but essentially anything *sensible* as a variable name can be used as a variable name. See ?make.names for more information, which is noted in the description of the 'check.names' argument in the read.table() family of functions. HTH, Marc Schwartz __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] pgmm (blundell-bond) help needed
I have been playing with more examples, and I now know that with larger NF's my example code actually produces a result, instead of a singular matrix error. interestingly, stata's xtabond2 command seems ok with these sorts of data sets. either R has more stringent requirements, or stata is too casual. in any case, I find it strange that Blundell-Bond would not work on data sets in which N=20 and T=10, and there is only one parameter to estimate. there should be more than enough degrees of freedom. I will experiment more with it. regards, /iaw __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] use of "@" character in variable name
On 27/03/2009, at 2:04 PM, Mike Miller wrote: Importing data with a header row using read.delim, one variable should be named @5HTT but it is automatically renamed to X.5HTT, presumably because the "@" is either unacceptable or misunderstood. I've tried to find out what the rules are on variable names but have been unsuccessful. I'll bet someone here can tell me where to look. Maybe it's hidden away in here somewhere: http://cran.r-project.org/doc/manuals/R-data.pdf I don't know if there is a comprehensive list of the rules governing variable names but the ``@'' sign is used to access ``slots'' under S4 classes and methods. See ?"@". So it is (like?) an operation/ operator and hence is ruled out just like ``+5HTT'' would be. Reserved words like ``break'' and ``while'' are also excluded. See fortune(18). Another rule is that a variable name can't begin with a digit. And it can't have white space in it. There are probably other rules, but essentially anything *sensible* as a variable name can be used as a variable name. cheers, Rolf Turner ## Attention:\ This e-mail message is privileged and confid...{{dropped:9}} __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] ROCR package finding maximum accuracy and optimal cutoff point
If we use the ROCR package to find the accuracy of a classifier pred <- prediction(svm.pred, testset[,2]) perf.acc <- performance(pred,"acc") Do we find the maximum accuracy as follows (is there a simplier way?): > max(perf@x.values[[1]]) Then to find the cutoff point that maximizes the accuracy do we do the following (is there a simpler way): > cutoff.list <- unlist(perf@x.values[[1]]) > cutoff.list[which.max(perf@y.values[[1]])] If the above is correct how is it possible to find the average false positive and negative rates from the following perf.fpr <- performance(pred, "fpr") perf.fnr <- performance(pred, "fnr") The dataset that consists of two columns; score and a binary response, similar to this: 2.5, 0 -1, 0 2, 1 6.3, 1 4.1, 0 3.3, 1 Thanks, Saeed --- R 2.8.1 Win XP Pro SP2 ROCR package v1.0-2 e1071 v1.5-19 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] use of "@" character in variable name
The '@' character is an operator used for accessing slots in S4 classes. Similarly the '$' character is the operator for accessing elements of lists, etc... Although R allows periods, SQL databases will choke on them. LaTeX will (sometimes?) choke on underscores, and of course any native R operator will cause problems. My take is that camelBackCapitalization is the best practice in naming columns, list elements, variables, and functions for R. Best, Krzysztof. Mike Miller wrote: Importing data with a header row using read.delim, one variable should be named @5HTT but it is automatically renamed to X.5HTT, presumably because the "@" is either unacceptable or misunderstood. I've tried to find out what the rules are on variable names but have been unsuccessful. I'll bet someone here can tell me where to look. Maybe it's hidden away in here somewhere: http://cran.r-project.org/doc/manuals/R-data.pdf Thanks in advance. Mike __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- --- Krzysztof Sakrejda-Leavitt Organismic and Evolutionary Biology University of Massachusetts, Amherst 319 Morrill Science Center South 611 N. Pleasant Street Amherst, MA 01003 work #: 413-325-6555 email: sakre...@nsm.umass.edu __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] use of "@" character in variable name
Importing data with a header row using read.delim, one variable should be named @5HTT but it is automatically renamed to X.5HTT, presumably because the "@" is either unacceptable or misunderstood. I've tried to find out what the rules are on variable names but have been unsuccessful. I'll bet someone here can tell me where to look. Maybe it's hidden away in here somewhere: http://cran.r-project.org/doc/manuals/R-data.pdf Thanks in advance. Mike __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Trellis barchart help
Hi, I apologise if this sounds like a really simple problem. I have a time-series data set with two columns of data: Catch and TACC. I want Catch plotted as a barchart and TACC plotted on the same graph as a series of points with an overlaying a line. I have created the code for a barchart using the Catch data but I can't seem to figure out how to add the TACC data as a line. Can someone please enlighten me? Regards, James Barchart code: Schart<-barchart(Catch~Year,data=SNA, scales=list(col = "black", tck = c(1, 0),x=list(rot=45)), par.settings = list(axis.line = list(col = "transparent")), axis = function(side, ...) { if (side == "left") grid.lines(x = c(0, 0), y = c(0, 1), default.units = "npc") else if (side == "bottom") grid.lines(x = c(0, 1), y = c(0, 0), default.units = "npc") axis.default(side = side, ...) } ) -- View this message in context: http://www.nabble.com/Trellis-barchart-help-tp22733771p22733771.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Snow Parallel R: makeCluster with more nodes than available
Hi all, I would like to know what would happen if using snow I create a cluster of size 50, for example using makeCluster(50,type='SOCK') on a machine with 2 Cores and run a function. Does snow run 25 and 25 functions on each of my 2 "real" processors or it just run 50 functions in one processor ? Thanks. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] [R-pkgs] New CRAN Package: WriteXLS
A new package called WriteXLS (Version 1.5.0), containing a single function of the same name, is now available via CRAN. The package is made available under the GPL, version >=2. The function is based upon a Perl script that I have previously posted on r-help, now wrapped in R for convenience. It provides a cross- platform function to create Excel 2003 (XLS) files from one or more data frames. Each data frame will be written to a separate named worksheet in the Excel spreadsheet. The worksheet name will be the name of the data frame it contains. The package requires that Perl and several specific Perl modules are installed in order to function. These modules, which may already be installed on systems with Perl installed, include: OLE::Storage_Lite Parse::RecDescent Getopt::Long File::Basename Spreadsheet::WriteExcel Encode File::Glob Text::CSV_XS With the exception of the last three modules listed, for convenience all are included in the CRAN package and will be made available by adding the installed package path for these modules to the Perl @INC array. The last three modules (Encode, File::Glob and Text::CSV_XS) are 'XS' based Perl modules, meaning that they call external C routines. Thus, they are not included as they require local compilation using the particular version of Perl installed on the user's system. As a result, only source versions of the WriteXLS package will be made available. This is because the examples on the function help page will not run without Perl and all of these modules being available on the user's system. Hence the package will presently fail 'R CMD check' on CRAN. For more information on installing source packages from CRAN, please see: http://cran.r-project.org/doc/manuals/R-admin.html#Installing- packages WriteXLS is being maintained using R-Forge at: http://r-forge.r-project.org/projects/writexls Please feel free to contact me directly with questions and comments. Many thanks to Prof. Brian Ripley for his assistance with the testing of this package prior to release. Thanks, Marc Schwartz ___ R-packages mailing list r-packa...@r-project.org https://stat.ethz.ch/mailman/listinfo/r-packages __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] R 2.8.1 and 2.9 alpha crash when running survest of Design package
Dear Prof Harrell and everyone, My PC: Window XP service pack 3 and service pack 2 R version 2.8.1 and 2.9 alpha For the last 3 days, after updating R, my two computers have been facing problems when running existing and runable R commands that involves with Design package I attempt to use 'survest', but I failed all the times with R (both 2.8.1 and 2.9 alpha) being shut down immediately with following error report messages. AppName: rgui.exe AppVer: 2.90.48212.0 ModName: survival.dll ModVer: 0.0.0.0 Offset: 7749 However, if I run these commands on other computers which have not been updated for 2 week, they run OK Could you please consider the matter and give me advice I am looking forward to hearing from you soon Regards Nguyen D Nguyen Garvan Institute of Medical Research Sydney, Australia [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] ApEn (Approximate Entropy), Total Corr, Information Interaction
Hi Vishal, re 1]: Ben Bolker very kindly shared an R reimplementation of Kaplan's Matlab code a little while ago: http://www.nabble.com/Approximate-Entropy--to21144062.html#a21149402 Best wishes Stephan Vishal Belsare schrieb: Is there any existing implementation in R/S of : 1] Pincus & Kalman's approximate entropy (ApEn) measure 2] Total Correlation / Multiinformation 3] Information Interaction A search doesn't quite reveal anything, but I'd be keen to not reinvent in case someone has worked on it. Many thanks in anticipation. Best, Vishal Belsare __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] for Interaction of continous var and categorical var, any way approach the categorical var to continous ?
Dear list, This is NOT a techical question ragrding use of R. I have a linear model where the response variable is neigborhood safety . It is projected poverty deteriorate safety and number of officers per thousand residents improve safety. The focal hypothesis is poverty poses less safety threat when officers number is high. To check the focal hypothesis, the continuous variable "officers" is recoded as catogorical with two levels (high and low). the results is below and support the hyothesis #= model <- lm(neigborhood safety ~ poverty * officers) The coefficients (all significant): poverty-0.05 officers 0.058 poverty : officers0.014 #== My question is how to check the weakened "poverty" effect with a minuscle increase of "officers". the coeeficient for the interaction term of continous "poverty" and "officers" is hard to interpret and is not suitable to check the focal hypothesis since, say, (povety=3 & officers=8) will be the same as (poverty=8 & officer=3). Thanks a lot in advance for any suggestions! Sincerely, Will __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Column name assignment problem
Way are you taking the forth argument of the paste ?paste("arunoff_",table_year, sep="")[4] Either way, way not use something like: colnames(x)[column to change] <- "text, or pasted text" Cheers, Tal On Thu, Mar 26, 2009 at 2:33 PM, Steve Murray wrote: > > Dear all, > > I'm trying to assign a name to the fourth column whilst using 'assign', but > keep encountering errors. What have I done wrong?! > > > assign(colnames(c(paste("arunoff_",table_year, sep="")[4]), "COUNT")) > > Error in if (do.NULL) NULL else if (nc> 0) paste(prefix, seq_len(nc), : > argument is not interpretable as logical > > > Hope someone is able to help. > > Thanks for any pointers, > > Steve > > > _ > [[elided Hotmail spam]] > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > -- -- My contact information: Tal Galili Phone number: 972-50-3373767 FaceBook: Tal Galili My Blogs: http://www.r-statistics.com/ http://www.talgalili.com http://www.biostatistics.co.il [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] pgmm (Blundell-Bond) sample needed
Dear R Experts--- Sorry for all the questions yesterday and today. I am trying to use Yves Croissant's pgmm function in the plm package with Blundell-Bond moments. I have read the Blundell-Bond paper, and want to run the simplest model first, d[i,t] = a*d[i,t-1] + fixed[i] + u[i,t] . no third conditioning variables yet. the full set of moment conditions recommended for system-GMM, which is (T-1)*(T-2)/2+(T-3), in which the u's interact with all possible lagged y's and delta y's. I believe that pgmm operates by demanding that "firm" (i) and "year" (t) be the first two columns in the data set. library(plm) NF=20; NT=10 d= data.frame( firm= rep(1:NF, each=NT), year= rep( 1:NT, NF), x=rnorm(NF*NT) ); # the following fails, because dynformula magic is required; learned this the hard way # v=pgmm( x ~ lag(x), data=d, gmm.inst=~x, lag.gmm=c(2,99), transformation="ld" ) formula= dynformula( x ~ 1, list(1)); # this creates x ~ lag(x) v=pgmm( formula, data=d, gmm.inst=~x, lag.gmm=c(2,99), transformation="ld" ) Error in solve.default(suml(Vi)) : system is computationally singular: reciprocal condition number = 8.20734e-20 obviously, I am confused. when I execute the same command on the included UKEmpl data set, it works. however, my inputs would seem perfectly reasonable. I would hope that the procedure could produce a lag(x) coefficient estimate of around 0, and then call it a day. could someone please tell me how to instruct pgmm to just estimate this simplest of all BB models? [My ultimate goal is to replicate what another author has run via "xtabond2 d ld, gmm(L.(d), lag(1 3)) robust" in Stata; if you know the magic of moving this statement into pgmm syntax, I would be even more grateful. Right now, I am so stuck on square 1 that I do not know how to move towards figuring out where I ultimately need to go.] regards, /iaw [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] ApEn (Approximate Entropy), Total Corr, Information Interaction
Is there any existing implementation in R/S of : 1] Pincus & Kalman's approximate entropy (ApEn) measure 2] Total Correlation / Multiinformation 3] Information Interaction A search doesn't quite reveal anything, but I'd be keen to not reinvent in case someone has worked on it. Many thanks in anticipation. Best, Vishal Belsare __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] VGAM and cnstraints
Dear R-Users, I am learning how to use the package VGAM to do multinomial regressions. I have worked through the example provided by UCLA stats group: http://www.ats.ucla.edu/stat/r/dae/mlogit.htm However - what I would like to learn is how to use the constraint option. So in the example given in the link i would like to introduce a constraint that does not allow individuals of age less than say 25 to buy brand 1 (for whatever reason) how do I enter this constraint? Thanks. Raff. _ [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Bug? FORTTRAN help
I was feeling masochistic the other day and we have been having some wierd memory problems so I started digging into the source for L-BFGS-B. In the lbgfsb.c file I see the following code: /* Cholesky factorization of (2,2) block of wn. */ F77_CALL(dpofa)(&wn[*col + 1 + (*col + 1) * wn_dim1], &m2, col, info); if (*info != 0) { *info = -2; return; } If I am not mistaken this says that there is a m2 * col matrix that starts at 'col + 1 + (col + 1) * wn_dm1. Where wn_dm1 is 2 * m. My first question is to verify that statement. Say I am trying to optimize the "banana function" as given in the documentation. In that case n = 2 and the default m = 5. So m2 is 10 and wn_dim1 is 20 and the dimension of wn is 100 (this is all by deduction. So if col is 5 then the offset into the array is 55 and there is not room in the vector for a 10 x 5 array. I am worried that the optimizer will silently write info memory that it shouldn't but more than likely it is something that I don't understand. So please vefify my first statement. Thank you. Kevin __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] ICC question: Interrater and intrarater variability (intraclass correlation coefficients)
Hello dear R help group. I encountered this old thread (http://tinyurl.com/dklgsk) containing the a similar question to the one I have, but left without an answer. I am and hoping one of you might help. A simplified situation: I have a factorial design (with 2^3 experiment combinations), for 167 subjects, each one has answered the same question twice (out of a bunch of "types" of questions). Each answer could get an integer number between 0 to 3. I wish to combine the two answers, but first to be sure I could, I would have liked to run an ICC (Intraclass correlation) check on the two answers. Naturally, I would use the irr, condord, or psy packages (as John Fox suggested back then), but I can't because of the repetitions of different design question for each patient. Since the mentioned packages (irr, condord, and psy) Can take only a n*m matrix for subjects and raters. But no place is given for the repetitions as data and therefore it will be impossible to get results for the INTRArater reliability. Thanks, Tal -- -- My contact information: Tal Galili Phone number: 972-50-3373767 FaceBook: Tal Galili My Blogs: http://www.r-statistics.com/ http://www.talgalili.com http://www.biostatistics.co.il [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Hazard rate estimation by smoothing baseline cumulative hazard from Cox model - Was: RE: Competing risks Kalbfleisch & Prentice method
Hi Eleni, I will take a look at this. I have some preliminary comments. You estimate the hazard function from the Cox model baseline cumulative hazard by differencing successive jumps. It seems that a better approach might be to estimate this using kernel smoothing, i.e. as the derivative of kernel-smoothed cumulative hazard function. This method is available in the "muhaz" package. However, the muhaz() function does not work with the Cox model baseline cumulative hazard. It requires you to input the original data on times and censoring indicators. It would be nice if this were possible. I am wondering why Terry Therneau's "survival" package doesn't have this option. Best, Ravi. --- Ravi Varadhan, Ph.D. Assistant Professor, The Center on Aging and Health Division of Geriatric Medicine and Gerontology Johns Hopkins University Ph: (410) 502-2619 Fax: (410) 614-9625 Email: rvarad...@jhmi.edu Webpage: http://www.jhsph.edu/agingandhealth/People/Faculty/Varadhan.html -Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On Behalf Of Eleni Rapsomaniki Sent: Thursday, March 26, 2009 1:17 PM To: Ravi Varadhan; Heinz Tuechler Cc: r-help@r-project.org Subject: Re: [R] Competing risks Kalbfleisch & Prentice method Ravi, I agree with you, that the Fine & Gray method does much more than calculating the cumulative incidence. The Kalbfleisch & Prentice formula relies on the strong assumption that the coefficients for the cause specific hazard models are valid in the presence of competing risks. You will find my code below, but it may be wrong (I'm not really a statistician!). And no, it doesn't even calculate confidence intervals... Thank you for pointing out these papers. Please let me know (nicely) if you find bugs!!! # #Method to calculate Cumulative Incidence adjusting for competing risks, based on the Kalbfleisch & Prentice formula, 1980, p.169 #create data ftime <- rexp(200, 0.3) fstatus <- sample(0:2,200,replace=TRUE) cov <- matrix(runif(600),nrow=200) dimnames(cov)[[2]] <- c('x1','x2','x3') dat=data.frame(ftime, fstatus, cov) diffrHaz=function(x){ hazard=x$hazard time=x$time hz.v=NULL for(e in 1:(length(hazard)-1)){ hz=hazard[e+1]-hazard[e] hz.v=c(hz.v,hz) } dhz=data.frame(hazard=hz.v, time=x$time[-length(x$time)]) return(dhz) } #build cause specific cox models library(Design) a.f=cph(Surv(ftime, fstatus==1) ~ x1+x2+x3, data=dat, surv=T, x=T, y=T) b.f=cph(Surv(ftime, fstatus==2) ~ x1+x2+x3, data=dat, surv=T, x=T, y=T) #get unique event times - up till the time of interest, eg. time=10 uts=unique(dat$ftime[dat$fstatus>0] ) uts=uts[uts<=10] # the times up till 10 years uts=uts[order(uts)] #order them #get baseline hazard rate (cumulative) a.bz=basehaz(a.f) b.bz=basehaz(b.f) #get instanteneous baseline hazard rate, apply above function a.dhz1=diffrHaz(a.bz) b.dhz1=diffrHaz(b.bz) P.t=NULL #intitialize where to store all results for(i in 1:nrow(dat)){ # i is the individual, iterate through the number of rows in dat P.i.t=NULL all.lambda.i.t=NULL for(u.i in 1:length(uts)){ u=uts[u.i] #get the instanteneous baseline hazard from each model corresponding to this timepoint (if there is no event at that time return 0) a.dhz.u=ifelse((u %in% a.dhz1$time)==F,0,a.dhz1$hazard[a.dhz1$time==u]) b.dhz.u=ifelse((u %in% b.dhz1$time)==F,0,b.dhz1$hazard[b.dhz1$time==u]) #multiply by the linear predictors to get the actual hazard at that point a.lambda.i.u=a.dhz.u*exp(a.f$linear.predictors[i]) b.lambda.i.u=b.dhz.u*exp(b.f$linear.predictors[i]) all.lambda.i.u=a.lambda.i.u+b.lambda.i.u #store in a vector all.lambda.i.t=c(all.lambda.i.t, all.lambda.i.u) #build a vector with all previous all.lambda.i.u's S.i.u=exp(-sum(all.lambda.i.t)) #this is the probability of surviving till time u p.i.u=S.i.u*a.lambda.i.u P.i.t=c(P.i.t, p.i.u) } P.t[i]=sum(P.i.t) } # to compare with unadjusted: a.risk = 1-survest(a.f, linear.predictors=a.f$linear.predictors, times=10)$surv plot(a.risk ~ P.t) #P.t is the CR adjusted, should always be lower mean(a.risk) mean(P.t) Eleni Rapsomaniki Research Associate Tel: +44 (0) 1223 740273 Strangeways Research Laboratory Department of Public Health and Primary Care University of Cambridge -Original Message- From: Ravi Varadhan [mailto:rvarad...@jhmi.edu] Sent: 26 March 2009 14:36 To: Eleni Rapsomaniki; 'Arthur Allignol' Cc: r-help@r-project.org Subject: RE: [R] Competing risks Kalbfleisch & Prentice method Hi Eleni, I would like to take a look at your R function for obtaining the cumulative incidence function (CIF) from individual Cox mod
Re: [R] Analogy for %in% for the whole columns (rather than individual values)
Thank you very much! I tried %in% and it did not work. However == works! Thank you! On Thu, Mar 26, 2009 at 3:05 PM, Fredrik Karlsson wrote: > Hi Dimitri, > > What about this: > > >> with(a, x == y) -> a$equal >> a > x y equal > 1 john mary FALSE > 2 mary mary TRUE > 3 mary john FALSE > 4 john john TRUE > > /Fredrik > > On Thu, Mar 26, 2009 at 7:48 PM, Dimitri Liakhovitski > wrote: >> >> Hello! >> >> I have a matrix a with 2 variables (see below) that contain character >> strings. >> I need to create a 3rd variable that contains True if the value in >> column x is equal to the value in column y. The code below does it. >> >> a<-data.frame(x=c("john", "mary", "mary", >> "john"),y=c("mary","mary","john","john")) >> a$x<-as.character(a$x) >> a$y<-as.character(a$y) >> a$equal<-NA >> for(i in 1:nrow(a)) {a[i,3]<-a[i,1] %in% a[i,2]} >> >> I was wondering if one can do it faster, without doing it row-by-row >> as I did it. Maybe there is a faster way to compare 2 columns? >> >> Thank you very much! >> >> -- >> Dimitri Liakhovitski >> MarketTools, Inc. >> dimitri.liakhovit...@markettools.com >> >> __ >> R-help@r-project.org mailing list >> https://stat.ethz.ch/mailman/listinfo/r-help >> PLEASE do read the posting guide >> http://www.R-project.org/posting-guide.html >> and provide commented, minimal, self-contained, reproducible code. > > > > -- > "Life is like a trumpet - if you don't put anything into it, you don't get > anything out of it." > -- Dimitri Liakhovitski MarketTools, Inc. dimitri.liakhovit...@markettools.com __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Reading in files with variable parts to names
Dear Steve, Another option would be using list() to storage your files into R: # Year/Month year <- 1986:1995 month <- sprintf("%02d", 1:12) # Names Files <- paste(year,month,'.asc',sep="") # Data ListFiles <- sapply(Files, read.table, header=TRUE,sep="") # To access the fist file ListFiles[[1]] #for the first HTH, Jorge On Thu, Mar 26, 2009 at 2:40 PM, Steve Murray wrote: > > Dear all, > > I'm trying to read in a whole directory of files which have two variable > parts to the file name: year and month. E.g. comp198604.asc represents April > of 1986 - 'comp' is fixed in each case. Years range between 1986 to 1995 and > months are between 1 and 12. > > Just to be clear, there are 12 files associated with each year: e.g. > comp198601, comp198602, ... comp198612 through to comp199501, comp199502 > ... comp199512. > > I am trying to automate the reading in of these files, but am struggling to > find an adequate way of achieving this. The closest I've got is by doing: > > > > year <- 1986:1995 > month <- sprintf("%02d", 1:12) # formats numbers to 2 digits (for > maintaining leading zeros in file names) > > filelist <- paste("C:\\Documents and > Settings\\Data\\comp",year,month,".asc", sep="") > > filelist > > [1] "C:\\Documents and Settings\\Data\\comp198601.asc" > [2] "C:\\Documents and Settings\\Data\\comp198702.asc" > [3] "C:\\Documents and Settings\\Data\\comp198803.asc" > [4] "C:\\Documents and Settings\\Data\\comp198904.asc" > [5] "C:\\Documents and Settings\\Data\\comp199005.asc" > [6] "C:\\Documents and Settings\\Data\\comp199106.asc" > [7] "C:\\Documents and Settings\\Data\\comp199207.asc" > [8] "C:\\Documents and Settings\\Data\\comp199308.asc" > [9] "C:\\Documents and Settings\\Data\\comp199409.asc" > [10] "C:\\Documents and Settings\\Data\\comp199510.asc" > [11] "C:\\Documents and Settings\\Data\\comp198611.asc" > [12] "C:\\Documents and Settings\\Data\\comp198712.asc" > > > I need 1986 to remain fixed whilst it cycles through 01 to 12, before it > moves onto 1987 and cycles again. There should be 120 outputs in total (10 > years each with 12 months), but at present it's only reaching 12 outputs. > > I'd be grateful to learn what I'm doing wrong here so that I can solve > this. > > Many thanks as ever, > > Steve > > > _ > 25GB of FREE Online Storage Find out more > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Reading in files with variable parts to names
Steve Murray hotmail.com> writes: > I'm trying to read in a whole directory of files which have two variable > parts to the file name: year and month. E.g. comp198604.asc represents < April of 1986 - 'comp' is fixed in each case. Years range between > 1986 to 1995 and months are between 1 and 12. > > Just to be clear, there are 12 files associated with each year: e.g. comp198601, comp198602, ... > comp198612 through to comp199501, comp199502 ... comp199512. gr = expand.grid(as.character(1986:1995),sprintf("%02d", 1:12) , stringsAsFactors =FALSE) filelist = paste(a[,1],a[2,],".asc",sep="") __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Analogy for %in% for the whole columns (rather than individual values)
Hi Dimitri, What about this: > with(a, x == y) -> a$equal > a xy equal 1 john mary FALSE 2 mary mary TRUE 3 mary john FALSE 4 john john TRUE /Fredrik On Thu, Mar 26, 2009 at 7:48 PM, Dimitri Liakhovitski wrote: > Hello! > > I have a matrix a with 2 variables (see below) that contain character > strings. > I need to create a 3rd variable that contains True if the value in > column x is equal to the value in column y. The code below does it. > > a<-data.frame(x=c("john", "mary", "mary", > "john"),y=c("mary","mary","john","john")) > a$x<-as.character(a$x) > a$y<-as.character(a$y) > a$equal<-NA > for(i in 1:nrow(a)) {a[i,3]<-a[i,1] %in% a[i,2]} > > I was wondering if one can do it faster, without doing it row-by-row > as I did it. Maybe there is a faster way to compare 2 columns? > > Thank you very much! > > -- > Dimitri Liakhovitski > MarketTools, Inc. > dimitri.liakhovit...@markettools.com > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > -- "Life is like a trumpet - if you don't put anything into it, you don't get anything out of it." [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Reading in files with variable parts to names
Try this to generate your year/month combinations: > expand.grid(year=1986:1995, month=1:12) Obviously you'll have to format the months. -Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On Behalf Of Steve Murray Sent: Thursday, March 26, 2009 2:40 PM To: r-help@r-project.org Subject: [R] Reading in files with variable parts to names Dear all, I'm trying to read in a whole directory of files which have two variable parts to the file name: year and month. E.g. comp198604.asc represents April of 1986 - 'comp' is fixed in each case. Years range between 1986 to 1995 and months are between 1 and 12. Just to be clear, there are 12 files associated with each year: e.g. comp198601, comp198602, ... comp198612 through to comp199501, comp199502 ... comp199512. I am trying to automate the reading in of these files, but am struggling to find an adequate way of achieving this. The closest I've got is by doing: year <- 1986:1995 month <- sprintf("%02d", 1:12) # formats numbers to 2 digits (for maintaining leading zeros in file names) filelist <- paste("C:\\Documents and Settings\\Data\\comp",year,month,".asc", sep="") filelist [1] "C:\\Documents and Settings\\Data\\comp198601.asc" [2] "C:\\Documents and Settings\\Data\\comp198702.asc" [3] "C:\\Documents and Settings\\Data\\comp198803.asc" [4] "C:\\Documents and Settings\\Data\\comp198904.asc" [5] "C:\\Documents and Settings\\Data\\comp199005.asc" [6] "C:\\Documents and Settings\\Data\\comp199106.asc" [7] "C:\\Documents and Settings\\Data\\comp199207.asc" [8] "C:\\Documents and Settings\\Data\\comp199308.asc" [9] "C:\\Documents and Settings\\Data\\comp199409.asc" [10] "C:\\Documents and Settings\\Data\\comp199510.asc" [11] "C:\\Documents and Settings\\Data\\comp198611.asc" [12] "C:\\Documents and Settings\\Data\\comp198712.asc" I need 1986 to remain fixed whilst it cycles through 01 to 12, before it moves onto 1987 and cycles again. There should be 120 outputs in total (10 years each with 12 months), but at present it's only reaching 12 outputs. I'd be grateful to learn what I'm doing wrong here so that I can solve this. Many thanks as ever, Steve _ 25GB of FREE Online Storage - Find out more __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- This message w/attachments (message) may be privileged, confidential or proprietary, and if you are not an intended recipient, please notify the sender, do not use or share it and delete it. Unless specifically indicated, this message is not an offer to sell or a solicitation of any investment products or other financial product or service, an official confirmation of any transaction, or an official statement of Merrill Lynch. Subject to applicable law, Merrill Lynch may monitor, review and retain e-communications (EC) traveling through its networks/systems. The laws of the country of each sender/recipient may impact the handling of EC, and EC may be archived, supervised and produced in countries other than the country in which you are located. This message cannot be guaranteed to be secure or error-free. References to "Merrill Lynch" are references to any company in the Merrill Lynch & Co., Inc. group of companies, which are wholly-owned by Bank of America Corporation. Secu! rities and Insurance Products: * Are Not FDIC Insured * Are Not Bank Guaranteed * May Lose Value * Are Not a Bank Deposit * Are Not a Condition to Any Banking Service or Activity * Are Not Insured by Any Federal Government Agency. Attachments that are part of this E-communication may have additional important disclosures and disclaimers, which you should read. This message is subject to terms available at the following link: http://www.ml.com/e-communications_terms/. By messaging with Merrill Lynch you consent to the foregoing. -- __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Reading in files with variable parts to names
Hi, If your directory contains only files you want to load anyway, then list.files() is your friend, list.files(pattern = "comp") # or pattern =".asc" for example If you do need to create the names manually, then you could create the combinations with expand.grid, as in, do.call(paste, as.list(expand.grid(x = seq(1950,1960), y = 1:10))) # you'll want to tweak paste to suit your needs HTH, baptiste On 26 Mar 2009, at 18:40, Steve Murray wrote: Dear all, I'm trying to read in a whole directory of files which have two variable parts to the file name: year and month. E.g. comp198604.asc represents April of 1986 - 'comp' is fixed in each case. Years range between 1986 to 1995 and months are between 1 and 12. Just to be clear, there are 12 files associated with each year: e.g. comp198601, comp198602, ... comp198612 through to comp199501, comp199502 ... comp199512. I am trying to automate the reading in of these files, but am struggling to find an adequate way of achieving this. The closest I've got is by doing: year <- 1986:1995 month <- sprintf("%02d", 1:12) # formats numbers to 2 digits (for maintaining leading zeros in file names) filelist <- paste("C:\\Documents and Settings\\Data\ \comp",year,month,".asc", sep="") filelist [1] "C:\\Documents and Settings\\Data\\comp198601.asc" [2] "C:\\Documents and Settings\\Data\\comp198702.asc" [3] "C:\\Documents and Settings\\Data\\comp198803.asc" [4] "C:\\Documents and Settings\\Data\\comp198904.asc" [5] "C:\\Documents and Settings\\Data\\comp199005.asc" [6] "C:\\Documents and Settings\\Data\\comp199106.asc" [7] "C:\\Documents and Settings\\Data\\comp199207.asc" [8] "C:\\Documents and Settings\\Data\\comp199308.asc" [9] "C:\\Documents and Settings\\Data\\comp199409.asc" [10] "C:\\Documents and Settings\\Data\\comp199510.asc" [11] "C:\\Documents and Settings\\Data\\comp198611.asc" [12] "C:\\Documents and Settings\\Data\\comp198712.asc" I need 1986 to remain fixed whilst it cycles through 01 to 12, before it moves onto 1987 and cycles again. There should be 120 outputs in total (10 years each with 12 months), but at present it's only reaching 12 outputs. I'd be grateful to learn what I'm doing wrong here so that I can solve this. Many thanks as ever, Steve _ 25GB of FREE Online Storage – Find out more __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. _ Baptiste Auguié School of Physics University of Exeter Stocker Road, Exeter, Devon, EX4 4QL, UK Phone: +44 1392 264187 http://newton.ex.ac.uk/research/emag __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Analogy for %in% for the whole columns (rather than individual values)
Dear Dimitri: Try this: a<-data.frame(x=c("john", "mary", "mary","john"),y=c("mary","mary","john","john")) a$x<-as.character(a$x) a$y<-as.character(a$y) a$TEST<-apply(a,1,function(x) x[1]==x[2]) a See ?apply for more details. HTH, Jorge On Thu, Mar 26, 2009 at 2:48 PM, Dimitri Liakhovitski wrote: > Hello! > > I have a matrix a with 2 variables (see below) that contain character > strings. > I need to create a 3rd variable that contains True if the value in > column x is equal to the value in column y. The code below does it. > > a<-data.frame(x=c("john", "mary", "mary", > "john"),y=c("mary","mary","john","john")) > a$x<-as.character(a$x) > a$y<-as.character(a$y) > a$equal<-NA > for(i in 1:nrow(a)) {a[i,3]<-a[i,1] %in% a[i,2]} > > I was wondering if one can do it faster, without doing it row-by-row > as I did it. Maybe there is a faster way to compare 2 columns? > > Thank you very much! > > -- > Dimitri Liakhovitski > MarketTools, Inc. > dimitri.liakhovit...@markettools.com > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Analogy for %in% for the whole columns (rather than individual values)
Hello! I have a matrix a with 2 variables (see below) that contain character strings. I need to create a 3rd variable that contains True if the value in column x is equal to the value in column y. The code below does it. a<-data.frame(x=c("john", "mary", "mary", "john"),y=c("mary","mary","john","john")) a$x<-as.character(a$x) a$y<-as.character(a$y) a$equal<-NA for(i in 1:nrow(a)) {a[i,3]<-a[i,1] %in% a[i,2]} I was wondering if one can do it faster, without doing it row-by-row as I did it. Maybe there is a faster way to compare 2 columns? Thank you very much! -- Dimitri Liakhovitski MarketTools, Inc. dimitri.liakhovit...@markettools.com __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Reading in files with variable parts to names
Dear all, I'm trying to read in a whole directory of files which have two variable parts to the file name: year and month. E.g. comp198604.asc represents April of 1986 - 'comp' is fixed in each case. Years range between 1986 to 1995 and months are between 1 and 12. Just to be clear, there are 12 files associated with each year: e.g. comp198601, comp198602, ... comp198612 through to comp199501, comp199502 ... comp199512. I am trying to automate the reading in of these files, but am struggling to find an adequate way of achieving this. The closest I've got is by doing: year <- 1986:1995 month <- sprintf("%02d", 1:12) # formats numbers to 2 digits (for maintaining leading zeros in file names) filelist <- paste("C:\\Documents and Settings\\Data\\comp",year,month,".asc", sep="") filelist [1] "C:\\Documents and Settings\\Data\\comp198601.asc" [2] "C:\\Documents and Settings\\Data\\comp198702.asc" [3] "C:\\Documents and Settings\\Data\\comp198803.asc" [4] "C:\\Documents and Settings\\Data\\comp198904.asc" [5] "C:\\Documents and Settings\\Data\\comp199005.asc" [6] "C:\\Documents and Settings\\Data\\comp199106.asc" [7] "C:\\Documents and Settings\\Data\\comp199207.asc" [8] "C:\\Documents and Settings\\Data\\comp199308.asc" [9] "C:\\Documents and Settings\\Data\\comp199409.asc" [10] "C:\\Documents and Settings\\Data\\comp199510.asc" [11] "C:\\Documents and Settings\\Data\\comp198611.asc" [12] "C:\\Documents and Settings\\Data\\comp198712.asc" I need 1986 to remain fixed whilst it cycles through 01 to 12, before it moves onto 1987 and cycles again. There should be 120 outputs in total (10 years each with 12 months), but at present it's only reaching 12 outputs. I'd be grateful to learn what I'm doing wrong here so that I can solve this. Many thanks as ever, Steve _ 25GB of FREE Online Storage – Find out more __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] boxplot in subgroups
Thank you very much Annette, With your help I can do exactly what I wanted to. Best wishes, -Pooja On 26 Mar 2009, at 06:59, Annette Heisswolf wrote: Hei, the error message below just tells you that you have specified nine places along the axis where you want to have labels (i.e. at=1:9) but obviously you have only specified three labels (i.e. labels = c("C","M","S")). Thus, R is complaining that there are not enough labels for all positions. However, I guess from your description that you want to have only three labels below your plot, one for each group of three boxes that share the same ASA1 value. Thus, you could for example do like this: axis(side=1,at=c(2,5,8),labels=c("C","S","M")) Then you would get the label always below the middle one of the three boxes of each group. Now the boxes are, however, still distributed evenly across the plot, but in case you want to have those that belong to the same subgroup a bit closer together, you can specify their position via the "at" option on the boxplot function (see ?boxplot). An example with random data might look like this: ASA1=factor(rep(c("C","S","M"),each=100)) ASA2=factor(rep(c("C","S","M"),100)) C1_C2=runif(300) boxplot(C1_C2~ASA2*ASA1,xlim=c(0,12),at=c(1:3,5:7,9:11),xaxt="n") axis(side=1,at=c(2,6,10),labels=c("C","S","M")) See ?axis for further options, e.g. tcl=0 in case you don't want to have a tick at the axis at the position of the label. Hope that helps, Annette Pooja Jain schrieb: Hi, I have data that looks like this: ASA1ASA2C1_C2 C M 9.0225 S S 2.4315 M C 3.4894 M S 4.5282 C M 1.3183 C S 1.3735 S C 1.0488 S M 7.948 M C 4.5827 I need to plot Boxplots for a given ASA1 (either C,S, or M) with respect to C1_C2. However, instead of one boxplot I want to plot three boxplots for a given ASA1 value such that each sub-boxplot represent the distribution for C1-C2 for each of the three possible values of ASA2 (C,S or M). I want to show this subgroup with a single xtics (either C, S or M) corresponding to the ASA1 value selected. I tried to do it, but ending with the following error. I am not sure how exactly I should build the object to plot as subgroup. Error in axis(side = 1, at = 1:9, labels = c("C", "M", "S" : 'at' and 'labels' lengths differ, 9 != 3 In addition: Warning message: In bxp(list(stats = c(1e-04, 2.82745, 6.0193, 10.5957, 22.2476, : some notches went outside hinges ('box'): maybe set notch=FALSE I would greatly appreciate any help in this matter. Thank you. -Kanu -- Annette Heisswolf Section of Ecology Department of Biology University of Turku 20014 Turku, Finland phone + 358 2 333 6006 fax + 358 2 333 6550 mailannette.heissw...@utu.fi __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Conerned about Interfacing R with Fortran
I am reading the manual sections illustrating how to call a Fortran subroutine from R. I feel uneasy at the explicit statement about ".Fortran" interface working with Fortran 77. I would like to call a Fortran-90 subroutine from my R script. Is that supported at all ? Thank you, Maura tutti i telefonini TIM! [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] problem with "choose.files" command & interactive functions
Perfect. That's fixed the problem. Thanks very much for the advice. Shay If you don't specify where to open the dialog, it will open in the last location where it was opened. This saves time if you are running it more than once, because often you will open files from the same directory. If you don't want that, you can specify which directory to open, and get it to always open in the current directory using choose.files("./*"). Duncan Murdoch __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Competing risks Kalbfleisch & Prentice method
Ravi, I agree with you, that the Fine & Gray method does much more than calculating the cumulative incidence. The Kalbfleisch & Prentice formula relies on the strong assumption that the coefficients for the cause specific hazard models are valid in the presence of competing risks. You will find my code below, but it may be wrong (I'm not really a statistician!). And no, it doesn't even calculate confidence intervals... Thank you for pointing out these papers. Please let me know (nicely) if you find bugs!!! # #Method to calculate Cumulative Incidence adjusting for competing risks, based on the Kalbfleisch & Prentice formula, 1980, p.169 #create data ftime <- rexp(200, 0.3) fstatus <- sample(0:2,200,replace=TRUE) cov <- matrix(runif(600),nrow=200) dimnames(cov)[[2]] <- c('x1','x2','x3') dat=data.frame(ftime, fstatus, cov) diffrHaz=function(x){ hazard=x$hazard time=x$time hz.v=NULL for(e in 1:(length(hazard)-1)){ hz=hazard[e+1]-hazard[e] hz.v=c(hz.v,hz) } dhz=data.frame(hazard=hz.v, time=x$time[-length(x$time)]) return(dhz) } #build cause specific cox models library(Design) a.f=cph(Surv(ftime, fstatus==1) ~ x1+x2+x3, data=dat, surv=T, x=T, y=T) b.f=cph(Surv(ftime, fstatus==2) ~ x1+x2+x3, data=dat, surv=T, x=T, y=T) #get unique event times - up till the time of interest, eg. time=10 uts=unique(dat$ftime[dat$fstatus>0] ) uts=uts[uts<=10] # the times up till 10 years uts=uts[order(uts)] #order them #get baseline hazard rate (cumulative) a.bz=basehaz(a.f) b.bz=basehaz(b.f) #get instanteneous baseline hazard rate, apply above function a.dhz1=diffrHaz(a.bz) b.dhz1=diffrHaz(b.bz) P.t=NULL #intitialize where to store all results for(i in 1:nrow(dat)){ # i is the individual, iterate through the number of rows in dat P.i.t=NULL all.lambda.i.t=NULL for(u.i in 1:length(uts)){ u=uts[u.i] #get the instanteneous baseline hazard from each model corresponding to this timepoint (if there is no event at that time return 0) a.dhz.u=ifelse((u %in% a.dhz1$time)==F,0,a.dhz1$hazard[a.dhz1$time==u]) b.dhz.u=ifelse((u %in% b.dhz1$time)==F,0,b.dhz1$hazard[b.dhz1$time==u]) #multiply by the linear predictors to get the actual hazard at that point a.lambda.i.u=a.dhz.u*exp(a.f$linear.predictors[i]) b.lambda.i.u=b.dhz.u*exp(b.f$linear.predictors[i]) all.lambda.i.u=a.lambda.i.u+b.lambda.i.u #store in a vector all.lambda.i.t=c(all.lambda.i.t, all.lambda.i.u) #build a vector with all previous all.lambda.i.u's S.i.u=exp(-sum(all.lambda.i.t)) #this is the probability of surviving till time u p.i.u=S.i.u*a.lambda.i.u P.i.t=c(P.i.t, p.i.u) } P.t[i]=sum(P.i.t) } # to compare with unadjusted: a.risk = 1-survest(a.f, linear.predictors=a.f$linear.predictors, times=10)$surv plot(a.risk ~ P.t) #P.t is the CR adjusted, should always be lower mean(a.risk) mean(P.t) Eleni Rapsomaniki Research Associate Tel: +44 (0) 1223 740273 Strangeways Research Laboratory Department of Public Health and Primary Care University of Cambridge -Original Message- From: Ravi Varadhan [mailto:rvarad...@jhmi.edu] Sent: 26 March 2009 14:36 To: Eleni Rapsomaniki; 'Arthur Allignol' Cc: r-help@r-project.org Subject: RE: [R] Competing risks Kalbfleisch & Prentice method Hi Eleni, I would like to take a look at your R function for obtaining the cumulative incidence function (CIF) from individual Cox models for cause-specific hazards (CSH). Does your code predict the CIF (with pointwise confidence intervals and global confidence bands) for different sub-groups? Have you seen the paper by Cheng, Fine, and Wei (Biometrics 1998) that does this? A major advantage of the F&G model is that you can get a direct, numerical measure of the effect of a covariate on the CIF. This cannot be obtained by modeling all the CSHs and then combining them. The idiosyncratic assumption concerning risk set in F&G model is made mainly for mathematical purposes so that a proportional hazards form may be obtained for the CIF. You can test this assumption by plotting schonefeld-type residuals (this is available in cmprsk). Fine (Biostatistics 2006) provides a different approach that relaxes this assumption (it also uses a different estimation approach), but I don't know if there is an R implementation for that. Thanks, Ravi. --- Ravi Varadhan, Ph.D. Assistant Professor, The Center on Aging and Health Division of Geriatric Medicine and Gerontology Johns Hopkins University Ph: (410) 502-2619 Fax: (410) 614-9625 Email: rvarad...@jhmi.edu Webpage: http://www.jhsph.edu/agingandhealth/People/Faculty/Varadhan.html -Original Message- From: r-help-boun...@r-project.org [mailt
Re: [R] Density estimation: scale back for calendar time
Pradeep Raje gmail.com> writes: > > Thanks David for your response.I had done that. But not shared it. > Problem is not with the > computations, but in the interpretation. > Assume that x ordinates are 'time' [1:3472], and y are associated parameter > values. > Now density gives me 512/1024/2048 x-ordinates, of which some (7 to be > precise) are negative. What do I make of the non-zero probability at > **negative** times? Obviously you are having problems with interpretation which is why I suggested that you provide a worked example. Since you appear to be resisting that suggestion, let's talk about hypotheticals. Two results of a density estimation will be density(dat)$x and density(dat)$y. The first of these is not a probability and could be negative in any instances where negative numbers were in the domain of the dat object. And it might be negative in situations with postive values where an estimation extended slightly beyond the original dataset. It is the y values that are necessarily positive. The idea that dates associated with a numerical object could be used to label a density plot seems strange. The process of density estimation would scrample the order of the original data so that the densities could be calculated. In a sense the y-axis of the original object will become the new x-axis of the density object. There would be no way to restore that "date" order. It had been my hope that a request for an example would make that more clear. Back to you. -- David Winsemius > __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] programming creating different functions in a loop
Good point, Greg. So to handle i as a vector, you'd probably want something like f <- function(x,i)outer(x,i,"+") -- Bert Bert Gunter Genentech Nonclinical Biostatistics 650-467-7374 -Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On Behalf Of Greg Snow Sent: Thursday, March 26, 2009 9:54 AM To: Bert Gunter; 'Florin Maican'; r-help@r-project.org Subject: Re: [R] programming creating different functions in a loop But wrong questions requiring complicated answers are sometimes more fun :-). One difference though is in my last example, your function will give a different answer for f(1:3, 1:10), but for the simpler cases, yours is probably the better. -- Gregory (Greg) L. Snow Ph.D. Statistical Data Center Intermountain Healthcare greg.s...@imail.org 801.408.8111 > -Original Message- > From: Bert Gunter [mailto:gunter.ber...@gene.com] > Sent: Thursday, March 26, 2009 10:33 AM > To: Greg Snow; 'Florin Maican'; r-help@r-project.org > Subject: RE: [R] programming creating different functions in a loop > > ... but perhaps even more straightforward is: > > f <- function(i,x)x+i > > Of course, we don't know exactly what the poster is after with his > rather > arcane construction, so this may not suit. But you know the rule: if > the > question requires a complex, tricky answer, it's probably the wrong > question. > > Cheers, > Bert > > Bert Gunter > Genentech Nonclinical Biostatistics > > > -Original Message- > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r- > project.org] On > Behalf Of Greg Snow > Sent: Thursday, March 26, 2009 9:25 AM > To: Florin Maican; r-help@r-project.org > Subject: Re: [R] programming creating different functions in a loop > > Anytime that you are tempted to use assign and a loop, you should > consider > using lapply (or sapply) and a list instead. > > Consider this alternative: > > > f <- lapply( 1:3, function(i){ force(i); function(x) x+i} ) > > > > f[[1]](3) > [1] 4 > > f[[2]](10) > [1] 12 > > f[[3]](0) > [1] 3 > > > > sapply( f, function(f) f(1:10) ) > [,1] [,2] [,3] > [1,]234 > [2,]345 > [3,]456 > [4,]567 > [5,]678 > [6,]789 > [7,]89 10 > [8,]9 10 11 > [9,] 10 11 12 > [10,] 11 12 13 > > Hope this helps, > > -- > Gregory (Greg) L. Snow Ph.D. > Statistical Data Center > Intermountain Healthcare > greg.s...@imail.org > 801.408.8111 > > > > -Original Message- > > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r- > > project.org] On Behalf Of Florin Maican > > Sent: Thursday, March 26, 2009 9:18 AM > > To: r-help@r-project.org > > Subject: [R] programming creating different functions in a loop > > > > Hi > > > > I want to create the following functions in a loop > > > > f1<-function(x){x+1} > > f2<-function(x){x+2} > > f3<-function(x){x+3} > > > > Output f1(2)=3 > > f2(2)=4 > > f3(2)=5 > > > > > > I tried to create the in a loop as bellow but I get wrong on answers > > because the value of i change > > > > for(i in 1:3){ > > assign(paste("f",i,sep="") > > ,function(x) > > x+i > > ) > > } # end for > > > > Output f1(2)=5 > > f2(2)=5 > > f3(2)=5 > > But it is not what I want. The question is how I can > > fix in R the value of "i" in my functions? I tried to use assign() > and > > get(),but I did not manage. > > > > Thanks in advance, > > Florin > > > > -- > > Florin Maican > > == > > > > Department of Economics, > > School of Business, Economics and Law, > > Gothenburg University, Sweden > > --- > > P.O. Box 640 SE-405 30, > > Gothenburg, Sweden > > > > Mobil: +46 76 235 3039 > > Phone: +46 31 786 4866 > > Fax:+46 31 786 4154 > > Home Page: http://maicanfg.googlepages.com/index.html > > E-mail: florin.mai...@handels.gu.se > > > > "Not everything that counts can be > > counted, and not everything that can be > > counted counts." > > --- Einstein --- > > > > __ > > R-help@r-project.org mailing list > > https://stat.ethz.ch/mailman/listinfo/r-help > > PLEASE do read the posting guide http://www.R-project.org/posting- > > guide.html > > and provide commented, minimal, self-contained, reproducible code. > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting- > guide.html > and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide
Re: [R] programming creating different functions in a loop
But wrong questions requiring complicated answers are sometimes more fun :-). One difference though is in my last example, your function will give a different answer for f(1:3, 1:10), but for the simpler cases, yours is probably the better. -- Gregory (Greg) L. Snow Ph.D. Statistical Data Center Intermountain Healthcare greg.s...@imail.org 801.408.8111 > -Original Message- > From: Bert Gunter [mailto:gunter.ber...@gene.com] > Sent: Thursday, March 26, 2009 10:33 AM > To: Greg Snow; 'Florin Maican'; r-help@r-project.org > Subject: RE: [R] programming creating different functions in a loop > > ... but perhaps even more straightforward is: > > f <- function(i,x)x+i > > Of course, we don't know exactly what the poster is after with his > rather > arcane construction, so this may not suit. But you know the rule: if > the > question requires a complex, tricky answer, it's probably the wrong > question. > > Cheers, > Bert > > Bert Gunter > Genentech Nonclinical Biostatistics > > > -Original Message- > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r- > project.org] On > Behalf Of Greg Snow > Sent: Thursday, March 26, 2009 9:25 AM > To: Florin Maican; r-help@r-project.org > Subject: Re: [R] programming creating different functions in a loop > > Anytime that you are tempted to use assign and a loop, you should > consider > using lapply (or sapply) and a list instead. > > Consider this alternative: > > > f <- lapply( 1:3, function(i){ force(i); function(x) x+i} ) > > > > f[[1]](3) > [1] 4 > > f[[2]](10) > [1] 12 > > f[[3]](0) > [1] 3 > > > > sapply( f, function(f) f(1:10) ) > [,1] [,2] [,3] > [1,]234 > [2,]345 > [3,]456 > [4,]567 > [5,]678 > [6,]789 > [7,]89 10 > [8,]9 10 11 > [9,] 10 11 12 > [10,] 11 12 13 > > Hope this helps, > > -- > Gregory (Greg) L. Snow Ph.D. > Statistical Data Center > Intermountain Healthcare > greg.s...@imail.org > 801.408.8111 > > > > -Original Message- > > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r- > > project.org] On Behalf Of Florin Maican > > Sent: Thursday, March 26, 2009 9:18 AM > > To: r-help@r-project.org > > Subject: [R] programming creating different functions in a loop > > > > Hi > > > > I want to create the following functions in a loop > > > > f1<-function(x){x+1} > > f2<-function(x){x+2} > > f3<-function(x){x+3} > > > > Output f1(2)=3 > > f2(2)=4 > > f3(2)=5 > > > > > > I tried to create the in a loop as bellow but I get wrong on answers > > because the value of i change > > > > for(i in 1:3){ > > assign(paste("f",i,sep="") > > ,function(x) > > x+i > > ) > > } # end for > > > > Output f1(2)=5 > > f2(2)=5 > > f3(2)=5 > > But it is not what I want. The question is how I can > > fix in R the value of "i" in my functions? I tried to use assign() > and > > get(),but I did not manage. > > > > Thanks in advance, > > Florin > > > > -- > > Florin Maican > > == > > > > Department of Economics, > > School of Business, Economics and Law, > > Gothenburg University, Sweden > > --- > > P.O. Box 640 SE-405 30, > > Gothenburg, Sweden > > > > Mobil: +46 76 235 3039 > > Phone: +46 31 786 4866 > > Fax:+46 31 786 4154 > > Home Page: http://maicanfg.googlepages.com/index.html > > E-mail: florin.mai...@handels.gu.se > > > > "Not everything that counts can be > > counted, and not everything that can be > > counted counts." > > --- Einstein --- > > > > __ > > R-help@r-project.org mailing list > > https://stat.ethz.ch/mailman/listinfo/r-help > > PLEASE do read the posting guide http://www.R-project.org/posting- > > guide.html > > and provide commented, minimal, self-contained, reproducible code. > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting- > guide.html > and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Locale problem between WinXP and Ubuntu
Hi R list, I have a peculiar problem in trying to Sweave a document in Ubuntu. The error message I get is the following: Error in gsub("\\n", "_", dd_name_level_nominal[k, 1], perl = TRUE) : input string 1 is invalid in this locale dd_name_level_nominal[k, 1] is a character object. I initially read these characters from a csv file which I create in Excel 2007. I have created the tex documents on my local WinXP machine but when I try to create it on my Ubuntu server, it does'nt seem to work(and throws the above error). I need to create on the Ubuntu server, since I am using Rserv and using R as a service. I wish I could do everything on one platform (WinXP preferably) but there isnt an R server for Windows which is as good or better than Rserv for Linux. This may be a very trivial problem, but I have tried to change the locale on R Ubuntu to en_us.utf8. On the windoes machine the locale is "LC_COLLATE=English_United States.1252;LC_CTYPE=English_United States.1252;LC_MONETARY=English_United States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252" and > l10n_info() $MBCS [1] FALSE $`UTF-8` [1] FALSE $`Latin-1` [1] TRUE $codepage [1] 1252 The locales i test in R ubuntu do not provide the Latin-1. Is there a way to circumvent this problem? I have tried to clean my input csv file by removing spaces and other special characters. I only have alphabets and '_' only. There could be a new line character in there, but I am not able to remove it in a csv reader such as EXCEL. Thanks for your help. Harsh Singhal Decision Systems, Mu Sigma Inc. [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] programming creating different functions in a loop
... but perhaps even more straightforward is: f <- function(i,x)x+i Of course, we don't know exactly what the poster is after with his rather arcane construction, so this may not suit. But you know the rule: if the question requires a complex, tricky answer, it's probably the wrong question. Cheers, Bert Bert Gunter Genentech Nonclinical Biostatistics -Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On Behalf Of Greg Snow Sent: Thursday, March 26, 2009 9:25 AM To: Florin Maican; r-help@r-project.org Subject: Re: [R] programming creating different functions in a loop Anytime that you are tempted to use assign and a loop, you should consider using lapply (or sapply) and a list instead. Consider this alternative: > f <- lapply( 1:3, function(i){ force(i); function(x) x+i} ) > > f[[1]](3) [1] 4 > f[[2]](10) [1] 12 > f[[3]](0) [1] 3 > > sapply( f, function(f) f(1:10) ) [,1] [,2] [,3] [1,]234 [2,]345 [3,]456 [4,]567 [5,]678 [6,]789 [7,]89 10 [8,]9 10 11 [9,] 10 11 12 [10,] 11 12 13 Hope this helps, -- Gregory (Greg) L. Snow Ph.D. Statistical Data Center Intermountain Healthcare greg.s...@imail.org 801.408.8111 > -Original Message- > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r- > project.org] On Behalf Of Florin Maican > Sent: Thursday, March 26, 2009 9:18 AM > To: r-help@r-project.org > Subject: [R] programming creating different functions in a loop > > Hi > > I want to create the following functions in a loop > > f1<-function(x){x+1} > f2<-function(x){x+2} > f3<-function(x){x+3} > > Output f1(2)=3 > f2(2)=4 > f3(2)=5 > > > I tried to create the in a loop as bellow but I get wrong on answers > because the value of i change > > for(i in 1:3){ > assign(paste("f",i,sep="") > ,function(x) > x+i > ) > } # end for > > Output f1(2)=5 > f2(2)=5 > f3(2)=5 > But it is not what I want. The question is how I can > fix in R the value of "i" in my functions? I tried to use assign() and > get(),but I did not manage. > > Thanks in advance, > Florin > > -- > Florin Maican > == > > Department of Economics, > School of Business, Economics and Law, > Gothenburg University, Sweden > --- > P.O. Box 640 SE-405 30, > Gothenburg, Sweden > > Mobil: +46 76 235 3039 > Phone: +46 31 786 4866 > Fax:+46 31 786 4154 > Home Page: http://maicanfg.googlepages.com/index.html > E-mail: florin.mai...@handels.gu.se > > "Not everything that counts can be > counted, and not everything that can be > counted counts." > --- Einstein --- > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting- > guide.html > and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Way to rotate a histogram?
In case anyone is still interested, a slight improvement is to plot both density and normal distributions on top of the empirical histogram (previous version plotted only density): library(ggplot2) test_data<-rnorm(100,mean=10,sd=4) a = data.frame(obs = test_data,condition = 'None') p1 = ggplot( data = a ,aes( x = obs ) )+geom_histogram( aes( y = ..density.. ) )+stat_density( mapping=aes(ymax=max(..density..)) ,geom='path' ,colour='red' )+stat_function( fun = dnorm ,args = list( m=mean(a$obs) ,sd=sd(a$obs) ) ,colour = 'green' )+scale_x_continuous( limits = range(a$obs) )+opts( panel.grid.minor = theme_blank() ,panel.grid.major = theme_blank() ,panel.background = theme_rect() )+coord_flip( ) p2 = ggplot( data = a ,aes( x = condition ,y = obs ) )+geom_boxplot( )+scale_y_continuous( limits = range(a$obs) )+scale_x_discrete( name = '' ,labels = '' )+opts( panel.grid.minor = theme_blank() ,panel.grid.major = theme_blank() ,panel.background = theme_rect() ,axis.ticks = theme_blank() ,axis.text.y = theme_blank() ,axis.title.y = theme_blank() ) p3 = ggplot( data = a ,aes( sample = (obs-mean(obs))/sd(obs) ) )+stat_qq( distribution=qnorm )+geom_abline( intercept=0 ,slope=1 )+opts( panel.grid.minor = theme_blank() ,panel.grid.major = theme_blank() ,panel.background = theme_rect() ,axis.ticks = theme_blank() ,axis.text.y = theme_blank() ,axis.title.y = theme_blank() ) print(p1,vp = viewport(width = 1/3,height = 1,x = 1/3*.5,y = .5)) print(p2,vp = viewport(width = 1/3,height = 1,x = 1/3+1/3*.5,y = .5)) print(p3,vp = viewport(width = 1/3,height = 1,x = 2/3+1/3*.5,y = .5)) -- Mike Lawrence Graduate Student Department of Psychology Dalhousie University Looking to arrange a meeting? Check my public calendar: http://tinyurl.com/mikes-public-calendar ~ Certainty is folly... I think. ~ __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] How to rotate axis labels? 2009
Usuario R gmail.com> writes: > > Hi, > > What you want is done with argument srt of par function. See ?par and then > look for srt argument. > > Regards > To clarify: if you just want labels rotated parallel/perpendicular to axes, then par("las") will do everything you need. If you need arbitrary rotation (e.g. 45 degrees), then as far as I know the hack in the FAQ is still the appropriate solution. Ben Bolker __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] programming creating different functions in a loop
Anytime that you are tempted to use assign and a loop, you should consider using lapply (or sapply) and a list instead. Consider this alternative: > f <- lapply( 1:3, function(i){ force(i); function(x) x+i} ) > > f[[1]](3) [1] 4 > f[[2]](10) [1] 12 > f[[3]](0) [1] 3 > > sapply( f, function(f) f(1:10) ) [,1] [,2] [,3] [1,]234 [2,]345 [3,]456 [4,]567 [5,]678 [6,]789 [7,]89 10 [8,]9 10 11 [9,] 10 11 12 [10,] 11 12 13 Hope this helps, -- Gregory (Greg) L. Snow Ph.D. Statistical Data Center Intermountain Healthcare greg.s...@imail.org 801.408.8111 > -Original Message- > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r- > project.org] On Behalf Of Florin Maican > Sent: Thursday, March 26, 2009 9:18 AM > To: r-help@r-project.org > Subject: [R] programming creating different functions in a loop > > Hi > > I want to create the following functions in a loop > > f1<-function(x){x+1} > f2<-function(x){x+2} > f3<-function(x){x+3} > > Output f1(2)=3 > f2(2)=4 > f3(2)=5 > > > I tried to create the in a loop as bellow but I get wrong on answers > because the value of i change > > for(i in 1:3){ > assign(paste("f",i,sep="") > ,function(x) > x+i > ) > } # end for > > Output f1(2)=5 > f2(2)=5 > f3(2)=5 > But it is not what I want. The question is how I can > fix in R the value of "i" in my functions? I tried to use assign() and > get(),but I did not manage. > > Thanks in advance, > Florin > > -- > Florin Maican > == > > Department of Economics, > School of Business, Economics and Law, > Gothenburg University, Sweden > --- > P.O. Box 640 SE-405 30, > Gothenburg, Sweden > > Mobil: +46 76 235 3039 > Phone: +46 31 786 4866 > Fax:+46 31 786 4154 > Home Page: http://maicanfg.googlepages.com/index.html > E-mail: florin.mai...@handels.gu.se > > "Not everything that counts can be > counted, and not everything that can be > counted counts." > --- Einstein --- > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting- > guide.html > and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] installing contributed programs
On 3/26/2009 10:11 AM, David Covell wrote: Dear R-help, I'm sure this is contained within the documentation, but I have not yet located it (with good effort nonetheless). I am attempting to install the binary for randomForests. After unpacking the zip, I extracted the contents to my R-2.8.1 folder. However, when I try to execute the command library(randomForest) I get an unable to locate error. Is there another library request that needs to be made? I see that the zip extraction created the RandomForest folder under the R-2.8.1 folder. Is there a way to specify this location within the library command? You should let R do the installation. The easiest way on Windows (which is implied by the .zip extension on your binary, but you should have said...) is within the Rgui: menu item "Packages|Install package(s) from local zip files" (or even better, install from CRAN). Duncan Murdoch __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] programming creating different functions in a loop
Thanks Luke! It works! My mistake was that I used "local binding" only for "i" and not for the whole function. Best regards, Florin On Thu, 26 Mar 2009 10:57:21 -0500 (CDT) l...@stat.uiowa.edu wrote: > for() does not creae separete bindings for the index each iteration, > so the function bodies see the global binding of i, which in this case > will be the final value. One possible solution is to use local(), > e.g. > > for(i in 1:3){ > assign(paste("f",i,sep=""), > local({ > k <- i # create local binding with current loop > index value function(x) x + k > })) > } > > > luke > > > On Thu, 26 Mar 2009, Florin Maican wrote: > > > Hi > > > > I want to create the following functions in a loop > > > > f1<-function(x){x+1} > > f2<-function(x){x+2} > > f3<-function(x){x+3} > > > > Output f1(2)=3 > > f2(2)=4 > > f3(2)=5 > > > > > > I tried to create the in a loop as bellow but I get wrong on answers > > because the value of i change > > > > for(i in 1:3){ > >assign(paste("f",i,sep="") > >,function(x) > > x+i > >) > > } # end for > > > > Output f1(2)=5 > > f2(2)=5 > > f3(2)=5 > > But it is not what I want. The question is how I can > > fix in R the value of "i" in my functions? I tried to use assign() > > and get(),but I did not manage. > > > > Thanks in advance, > > Florin > > > > > -- Florin G. Maican == Ph.D. candidate, Department of Economics, School of Business, Economics and Law, Gothenburg University, Sweden --- P.O. Box 640 SE-405 30, Gothenburg, Sweden Mobil: +46 76 235 3039 Phone: +46 31 786 4866 Fax:+46 31 786 4154 Home Page: http://maicanfg.googlepages.com/index.html E-mail: florin.mai...@handels.gu.se "Not everything that counts can be counted, and not everything that can be counted counts." --- Einstein --- __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] How to rotate axis labels? 2009
Hi, What you want is done with argument srt of par function. See ?par and then look for srt argument. Regards 2009/3/26 Jonas Stein > Hi, > > while searching for a solution i found many solutions in the internet. > But the postings seemed to be many years old and the workaround was a > dirty hack like this: > > > http://cran.r-project.org/doc/FAQ/R-FAQ.html#How-can-I-create-rotated-axis-labels_003f > > Now its the year 2009 and there might have been some improvement... > Does anyone know a nice and easy way to turn labels on the y axis in the > same direction as the x labels? > > Thank you very much for reading and hints, > > -- > Jonas Stein > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] standard plot font
Martin Batholdy wrote: hi, I was wondering what font-type R uses in plots (title etc.). Is that depending on the system you are working on? (working with mac OS) In fact, it is dependent on the *Device* you are plotting to. See ?Devices and the help page for the device you are going to plot to. Uwe Ligges thanks for help! __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] How to rotate axis labels? 2009
Jonas Stein wrote: Hi, while searching for a solution i found many solutions in the internet. But the postings seemed to be many years old and the workaround was a dirty hack like this: http://cran.r-project.org/doc/FAQ/R-FAQ.html#How-can-I-create-rotated-axis-labels_003f Now its the year 2009 and there might have been some improvement... Does anyone know a nice and easy way to turn labels on the y axis in the same direction as the x labels? This has always been documented in ?par: plot(1:10, las=1) Uwe Ligges Thank you very much for reading and hints, __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Odp: installing contributed programs
Hi r-help-boun...@r-project.org napsal dne 26.03.2009 15:11:32: > Dear R-help, > > I'm sure this is contained within the documentation, but I have not yet located > it (with good effort nonetheless). I am attempting to install the binary for > randomForests. After unpacking the zip, I extracted the contents to > my R-2.8.1 folder. However, when I try to execute the command > library(randomForest) I get an unable to locate error. Is there another > library request that needs to be made? I see that the zip extraction created > the RandomForest folder under the R-2.8.1 folder. Is there a way to It shall be in /library subfolder of your R -2.8.1 main directory. If it is not, just copy it there. Regards Petr > specify this location within the library command? > > Thanks for your help, > David > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] How to rotate axis labels? 2009
Hi, while searching for a solution i found many solutions in the internet. But the postings seemed to be many years old and the workaround was a dirty hack like this: http://cran.r-project.org/doc/FAQ/R-FAQ.html#How-can-I-create-rotated-axis-labels_003f Now its the year 2009 and there might have been some improvement... Does anyone know a nice and easy way to turn labels on the y axis in the same direction as the x labels? Thank you very much for reading and hints, -- Jonas Stein __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] standard plot font
hi, I was wondering what font-type R uses in plots (title etc.). Is that depending on the system you are working on? (working with mac OS) thanks for help! __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] programming creating different functions in a loop
for() does not creae separete bindings for the index each iteration, so the function bodies see the global binding of i, which in this case will be the final value. One possible solution is to use local(), e.g. for(i in 1:3){ assign(paste("f",i,sep=""), local({ k <- i # create local binding with current loop index value function(x) x + k })) } luke On Thu, 26 Mar 2009, Florin Maican wrote: Hi I want to create the following functions in a loop f1<-function(x){x+1} f2<-function(x){x+2} f3<-function(x){x+3} Output f1(2)=3 f2(2)=4 f3(2)=5 I tried to create the in a loop as bellow but I get wrong on answers because the value of i change for(i in 1:3){ assign(paste("f",i,sep="") ,function(x) x+i ) } # end for Output f1(2)=5 f2(2)=5 f3(2)=5 But it is not what I want. The question is how I can fix in R the value of "i" in my functions? I tried to use assign() and get(),but I did not manage. Thanks in advance, Florin -- Luke Tierney Chair, Statistics and Actuarial Science Ralph E. Wareham Professor of Mathematical Sciences University of Iowa Phone: 319-335-3386 Department of Statistics andFax: 319-335-3017 Actuarial Science 241 Schaeffer Hall email: l...@stat.uiowa.edu Iowa City, IA 52242 WWW: http://www.stat.uiowa.edu __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Odp: Tables
Hi r-help-boun...@r-project.org napsal dne 26.03.2009 16:39:18: > Hi, > > I am just starting using R. Hence, sorry for asking probably rather easy > questions. > > I used "read.table" to bring an .txt Table to R. Unfortunately the > columns do not have the same length. I tried "fill=TRUE", to fill the > blank space with "na". In a certain kind of way it worked, but befor > filling the spaces it moved data from later columns to the first: > > 1a b c 1 a b c > 2 a b c 2 a b c > 3 a b c 3 a bc > 4 b c 4 b cNA > 5c 5 c NA NA How do you know that c is in third column? If you can expect fixed width fields try read.fwf but if there is nothing what separates fields and they have variable length it would be quite difficult. Maybe to read each line and then to use some regexpr function no properly divide each line into appropriate columns. But maybe somebody will come with better idea. Regards Petr > > What is my mistake? > > Thank you > Sebastian > > > -- > *** > > Dipl. Biol. Sebastian Krug > PhD - student > IFM - GEOMAR > Leibniz Institute of Marine Sciences > Research Division 2 - Marine Biogeochemistry > Düsternbrooker Weg 20 > D - 24105 Kiel > Germany > > Tel.: +49 431 600-4282 > Fax.: +49 431 600-4446 > email: sk...@ifm-geomar.de > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] the by function
Hey, I have a dataframe with subjects who have watched films. The variables of interest are Pixel number and Temperature on the face. For each subject there are 8 films, for each film for each subject I need to measure the mean number of pixels then merge this vector with the data frame. I have mean.pixels<-as.data.frame(by(Final[,5],Final[,1:2],mean)[1:13,]) where column 5 is the pixels variable and 1 and 2 are subject and film number respectively. Now I can do this, but the average value of the pixel is usually around 7000. For some reason the machine has only calculated a certain second of the film as having say 500 pixels. I need to use the by function to calculate the mean pixels but I only want it to consider those values of the pixel variable that are above 1000. Does anyone know how I can modify the command I already have to make sure this happens? Sorry it's a bit confusing but I find it hard to explain. -- View this message in context: http://www.nabble.com/the-by-function-tp22723918p22723918.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] installing contributed programs
Dear R-help, I'm sure this is contained within the documentation, but I have not yet located it (with good effort nonetheless). I am attempting to install the binary for randomForests. After unpacking the zip, I extracted the contents to my R-2.8.1 folder. However, when I try to execute the command library(randomForest) I get an unable to locate error. Is there another library request that needs to be made? I see that the zip extraction created the RandomForest folder under the R-2.8.1 folder. Is there a way to specify this location within the library command? Thanks for your help, David __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Tables
How are you sure it moved the data? What is the column separator that you have? Is it just 'white space' as opposed to a tab or comma? It you have a CSV file with a separator, the system knows where the columns are. If it sees 'a b c' and then 'a c' as the next row, 'c' will be in column 2 even though you thought 'b' was missing. On Thu, Mar 26, 2009 at 11:39 AM, skrug wrote: > Hi, > > I am just starting using R. Hence, sorry for asking probably rather easy > questions. > > I used "read.table" to bring an .txt Table to R. Unfortunately the columns > do not have the same length. I tried "fill=TRUE", to fill the blank space > with "na". In a certain kind of way it worked, but befor filling the spaces > it moved data from later columns to the first: > > 1 a b c 1 a b c > 2 a b c 2 a b c 3 a b c > 3 a b c 4 b c 4 b c NA > 5 c 5 c NA NA > > What is my mistake? > > Thank you > Sebastian > > > -- > *** > > Dipl. Biol. Sebastian Krug > PhD - student > IFM - GEOMAR > Leibniz Institute of Marine Sciences > Research Division 2 - Marine Biogeochemistry > Düsternbrooker Weg 20 > D - 24105 Kiel > Germany > > Tel.: +49 431 600-4282 > Fax.: +49 431 600-4446 > email: sk...@ifm-geomar.de > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > -- Jim Holtman Cincinnati, OH +1 513 646 9390 What is the problem that you are trying to solve? __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Tables
Hi, I am just starting using R. Hence, sorry for asking probably rather easy questions. I used "read.table" to bring an .txt Table to R. Unfortunately the columns do not have the same length. I tried "fill=TRUE", to fill the blank space with "na". In a certain kind of way it worked, but befor filling the spaces it moved data from later columns to the first: 1a b c 1 a b c 2 a b c 2 a b c 3 a b c 3 a bc 4 b c 4 b cNA 5c 5 c NA NA What is my mistake? Thank you Sebastian -- *** Dipl. Biol. Sebastian Krug PhD - student IFM - GEOMAR Leibniz Institute of Marine Sciences Research Division 2 - Marine Biogeochemistry Düsternbrooker Weg 20 D - 24105 Kiel Germany Tel.: +49 431 600-4282 Fax.: +49 431 600-4446 email: sk...@ifm-geomar.de __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] arima, xreg, and the armax model
Hello all, I'm having fun again with the arima function. This time I read in: http://www.stat.pitt.edu/stoffer/tsa2/R_time_series_quick_fix.htm <> (by R.H. Shumway & D.S. Stoffer) This is quite surprising... Does anybody know anything about it? Marc Vinyes (AleaSoft) [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] package installation on OSX --- suggestion
dear R experts: I am trying to install packages in OSX, R 2.8.1. Since I do this about every 2 years, I have completely forgotten it. However, this should not be difficult: http://wiki.r-project.org/rwiki/doku.php?id=getting-started:installation:packages nice document. beautiful method. so, I start with update.packages() the final message tells me that it saved all the packages into /var/folders/Ia/IaQbr8K+GQ8DqdaGMAC18yU/-Tmp-/RtmpjRkMV7/downloaded_packages/ . not exactly user-friendly. at this point, I don't know whether they were also installed or just downloaded. the same happens when I do an install.package("plm", dependencies=T). would it not make sense if the package were installed in the standard R library location at this point, and the final message to tell me that the package was indeed installed, and not about the temporary directory? [I suspect that it actually did the install, so this is just a "final message" issue.] just a suggestion... [and thanks everybody for all the help yesterday. now back to my moments.] regards, /ivo __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] programming creating different functions in a loop
Hi I want to create the following functions in a loop f1<-function(x){x+1} f2<-function(x){x+2} f3<-function(x){x+3} Output f1(2)=3 f2(2)=4 f3(2)=5 I tried to create the in a loop as bellow but I get wrong on answers because the value of i change for(i in 1:3){ assign(paste("f",i,sep="") ,function(x) x+i ) } # end for Output f1(2)=5 f2(2)=5 f3(2)=5 But it is not what I want. The question is how I can fix in R the value of "i" in my functions? I tried to use assign() and get(),but I did not manage. Thanks in advance, Florin -- Florin Maican == Department of Economics, School of Business, Economics and Law, Gothenburg University, Sweden --- P.O. Box 640 SE-405 30, Gothenburg, Sweden Mobil: +46 76 235 3039 Phone: +46 31 786 4866 Fax:+46 31 786 4154 Home Page: http://maicanfg.googlepages.com/index.html E-mail: florin.mai...@handels.gu.se "Not everything that counts can be counted, and not everything that can be counted counts." --- Einstein --- __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Travel funding for DSC/useR 2009 for young researchers at U.S. institutions
*** Travel and Accommodation Support *** [Deadline extended to April 15, 2009] Funds from the U.S. National Science Foundation may be available to provide partial support for travel and accommodation for some graduate students and junior faculty at U.S. post-secondary institutions to attend DSC 2009 and useR 2009. If you wish to apply for this support, please send an application to l...@stat.uiowa.edu. Your application should include a brief CV a copy of your abstract if you have submitted a paper or a poster a statement that demonstrates your eligibility, your need for support, and an amount of support requested students should include a brief letter of support from their supervisor faculty or post-doc applicants should include a brief statement about other funding sources The allocation will be based on merit and need; women and minority candidates are encouraged to apply. Applications must be made by April 15 (2009-04-15) and successful applicants will be notified by email soon thereafter. Please visit the conference web pages at http://www.r-project.org/dsc-2009/ and http://www.agrocampus-rennes.fr/math/useR-2009/ for conference details. -- Luke Tierney Chair, Statistics and Actuarial Science Ralph E. Wareham Professor of Mathematical Sciences University of Iowa Phone: 319-335-3386 Department of Statistics andFax: 319-335-3017 Actuarial Science 241 Schaeffer Hall email: l...@stat.uiowa.edu Iowa City, IA 52242 WWW: http://www.stat.uiowa.edu ___ r-annou...@stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-announce __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Competing risks Kalbfleisch & Prentice method
Hi Eleni, I would like to take a look at your R function for obtaining the cumulative incidence function (CIF) from individual Cox models for cause-specific hazards (CSH). Does your code predict the CIF (with pointwise confidence intervals and global confidence bands) for different sub-groups? Have you seen the paper by Cheng, Fine, and Wei (Biometrics 1998) that does this? A major advantage of the F&G model is that you can get a direct, numerical measure of the effect of a covariate on the CIF. This cannot be obtained by modeling all the CSHs and then combining them. The idiosyncratic assumption concerning risk set in F&G model is made mainly for mathematical purposes so that a proportional hazards form may be obtained for the CIF. You can test this assumption by plotting schonefeld-type residuals (this is available in cmprsk). Fine (Biostatistics 2006) provides a different approach that relaxes this assumption (it also uses a different estimation approach), but I don't know if there is an R implementation for that. Thanks, Ravi. --- Ravi Varadhan, Ph.D. Assistant Professor, The Center on Aging and Health Division of Geriatric Medicine and Gerontology Johns Hopkins University Ph: (410) 502-2619 Fax: (410) 614-9625 Email: rvarad...@jhmi.edu Webpage: http://www.jhsph.edu/agingandhealth/People/Faculty/Varadhan.html -Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On Behalf Of Eleni Rapsomaniki Sent: Thursday, March 26, 2009 10:18 AM To: Arthur Allignol Cc: r-help@r-project.org Subject: Re: [R] Competing risks Kalbfleisch & Prentice method Thank you for your reply. It wasn't too hard to code actually, which is probably why it doesn't have a special package dedicated to it. The results are almost identical to Fine & Gray regression model. The problem with the latter is that my colleagues are not convinced that the model assumptions (people who die from competing causes remaining in the risk set) are theoretically sound. If anybody is interested in the Kalbfleisch & Prentice based cumulative incidence adjusting for competing risks with covariates, I'm happy to supply the code. Eleni Rapsomaniki Research Associate Tel: +44 (0) 1223 740273 Strangeways Research Laboratory Department of Public Health and Primary Care University of Cambridge -Original Message- From: Arthur Allignol [mailto:arthur.allig...@fdm.uni-freiburg.de] Sent: 26 March 2009 10:36 To: Eleni Rapsomaniki Cc: r-help@r-project.org Subject: Re: [R] Competing risks Kalbfleisch & Prentice method I don't think there is a package to do that. But you could have a look at ?predict.crr. Best regards, Arthur Allignol Eleni Rapsomaniki wrote: > > > Dear R users > > > > I would like to calculate the Cumulative incidence for an event > adjusting for competing risks and adjusting for covariates. One way to > do this in R is to use the cmprsk package, function crr. This uses the > Fine & Gray regression model. However, a simpler and more classical > approach would be to implement the Kalbfleisch & Prentice method (1980, > p 169), where one fits cause specific cox models for the event of > interest and each type of competing risk, and then calculates incidence > based on the overall survival. I believe that this is what the cuminc > function in the aforementioned package does, but it does not allow to > adjust for a vector of covariates. > > > > My question is, is there an R package that implements the Kalbfleisch & > Prentice method for competing risks with covariates? > > > > for example, if k1 is the cause of interest among k competing causes: > > P_k1(t; x)=P(T<=t, cause=k1|x)=Sum(u=0, ..., u=t) {hazard_k(u;x)*S(u;x)} > > where S(u;x) = exp{-sum_of_k(sum(hazard_k(u))} > > > > I have searched extensively for an implementation of this in many > packages, but it appears that more complex approaches are more commonly > implemented, such as timereg package. > > > > Eleni Rapsomaniki > > > > Research Associate > > Strangeways Research Laboratory > > Department of Public Health and Primary Care > > > > University of Cambridge > > > > > > > [[alternative HTML version deleted]] > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. ___
Re: [R] Competing risks Kalbfleisch & Prentice method
Thank you for your reply. It wasn't too hard to code actually, which is probably why it doesn't have a special package dedicated to it. The results are almost identical to Fine & Gray regression model. The problem with the latter is that my colleagues are not convinced that the model assumptions (people who die from competing causes remaining in the risk set) are theoretically sound. If anybody is interested in the Kalbfleisch & Prentice based cumulative incidence adjusting for competing risks with covariates, I'm happy to supply the code. Eleni Rapsomaniki Research Associate Tel: +44 (0) 1223 740273 Strangeways Research Laboratory Department of Public Health and Primary Care University of Cambridge -Original Message- From: Arthur Allignol [mailto:arthur.allig...@fdm.uni-freiburg.de] Sent: 26 March 2009 10:36 To: Eleni Rapsomaniki Cc: r-help@r-project.org Subject: Re: [R] Competing risks Kalbfleisch & Prentice method I don't think there is a package to do that. But you could have a look at ?predict.crr. Best regards, Arthur Allignol Eleni Rapsomaniki wrote: > > > Dear R users > > > > I would like to calculate the Cumulative incidence for an event > adjusting for competing risks and adjusting for covariates. One way to > do this in R is to use the cmprsk package, function crr. This uses the > Fine & Gray regression model. However, a simpler and more classical > approach would be to implement the Kalbfleisch & Prentice method (1980, > p 169), where one fits cause specific cox models for the event of > interest and each type of competing risk, and then calculates incidence > based on the overall survival. I believe that this is what the cuminc > function in the aforementioned package does, but it does not allow to > adjust for a vector of covariates. > > > > My question is, is there an R package that implements the Kalbfleisch & > Prentice method for competing risks with covariates? > > > > for example, if k1 is the cause of interest among k competing causes: > > P_k1(t; x)=P(T<=t, cause=k1|x)=Sum(u=0, ..., u=t) {hazard_k(u;x)*S(u;x)} > > where S(u;x) = exp{-sum_of_k(sum(hazard_k(u))} > > > > I have searched extensively for an implementation of this in many > packages, but it appears that more complex approaches are more commonly > implemented, such as timereg package. > > > > Eleni Rapsomaniki > > > > Research Associate > > Strangeways Research Laboratory > > Department of Public Health and Primary Care > > > > University of Cambridge > > > > > > > [[alternative HTML version deleted]] > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Some install package fixes for Ubuntu Hardy
I encountered some failures in using install.packages() to install rgl and rJava in some of my (multiple) Ubuntu Hardy systems. A quick search of the 'Net did not show any debian packages for these. The install.packages messages said header or other files were missing, suggesting path and related woes. Email with Duncan Murdoch (thanks!) pointed the way with rgl and led to a fix for rJava in similar fashion. It may save others some frustration to know my resolution. See below. However, I do have a question which a brief rummage of r-project did not answer. Where should information like this be put? My opinion is that it should go on the wiki, but possibly there is a better solution if we can get the right messages into the package installers, though I recognize the load that puts on maintainers. Cheers, JN Ubuntu Hardy rgl install fix: The headers gl.h and glu.h are installed with the "dev" packages libgl1-mesa-dev and libglu1-mesa-dev. So the fix is to run (in at terminal as root) apt-get install libgl1-mesa-dev apt-get install libglu1-mesa-dev then R . install.packages("rgl") etc. Ubuntu Hardy rJava install fix: Needed to get Sun JDK (not JRE) Then add new ln -s /usr/java/jdkx/bin/java java and ln -s /usr/java/jdkx/bin/javac javac where xx is the version information on the jdk directory name -- in my case 1.6.0_13 (see below) Then R CMD javareconf still fails to find the java compiler. Seems $JAVA_HOME may not be defined. Try export JAVA_HOME="/usr/java/jdk1.6.0_13/" Then (as root) R CMD javareconf seems to work. Then rJava installed OK. I was then able to install RWeka (my original objective) and it seems to run OK. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Minimal R-installation (and it's size)
Hello, for very small systems (embedded devices) I think about a minimalistic R-installation. When I look at the R-binary, it has just 5472 Bytes of size. I think this will only be a small initial program, that afterwards starts other executables and also will load standard/base package(s). Can you give me a hint, on how to maintain a minimalistic R-installation and how much (how less) memory/disk usage it presumably will need? Or asked the other way around: how much space (memory/disk) does R need when it is used only with the packages that are initially loaded, when I start it up in a distribution like Debian for example)? Ciao, Oliver __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Plot the highest point in a contour plot
I'm plotting contour plots with contourplot. which.max gives me the index of the highest point of a matrix. I can find the point in the matrix from here, convert it to a point on a graph and add it with points, but you'd think someone's already written this. How do I plot the highest point in a contourplot automatically? Tom [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] problem with "choose.files" command & interactive functions
On 3/26/2009 9:09 AM, Shay O'Farrell wrote: Hi all, I’m new to R, and I’m having a small but annoying problem with the choose.files command, using version 2.8.1 in Windows XP. I am holding my data text files in C:\\Data\\, and I’m writing a short script that first resets the default directory to this directory, then allows me to interactively select which text file to import and attach, using the Windows widget. Here is my script: setwd("C:\\Data\\") data<-read.table(choose.files(),header=T,as.is=T) attach(data) If I copy and paste the script into R Editor (or directly into RGui) and run it, it works perfectly. However, if have imported the script using the Open Script button in RGui, then I have a glitch... when I run the script, the widget looks in the last folder that I opened interactively (i.e., the folder from which I imported my script file) instead of looking in the new working directory. BUT... if I run line 3 again straight after, it works perfectly the second time. What am I doing wrong? Any help would be appreciated. If you don't specify where to open the dialog, it will open in the last location where it was opened. This saves time if you are running it more than once, because often you will open files from the same directory. If you don't want that, you can specify which directory to open, and get it to always open in the current directory using choose.files("./*"). Duncan Murdoch __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] problem with "choose.files" command & interactive functions
Hi all, I’m new to R, and I’m having a small but annoying problem with the choose.files command, using version 2.8.1 in Windows XP. I am holding my data text files in C:\\Data\\, and I’m writing a short script that first resets the default directory to this directory, then allows me to interactively select which text file to import and attach, using the Windows widget. Here is my script: setwd("C:\\Data\\") data<-read.table(choose.files(),header=T,as.is=T) attach(data) If I copy and paste the script into R Editor (or directly into RGui) and run it, it works perfectly. However, if have imported the script using the Open Script button in RGui, then I have a glitch... when I run the script, the widget looks in the last folder that I opened interactively (i.e., the folder from which I imported my script file) instead of looking in the new working directory. BUT... if I run line 3 again straight after, it works perfectly the second time. What am I doing wrong? Any help would be appreciated. Thanks Shay __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] same value in column-->delete
Patrizio Frederic wrote: this works which.is.not.unique <- apply(x,2,function(x)ifelse(length(unique(x))==1,F,T)) x[,which.is.not.unique] or you simplify that idea and say x[, apply(x, 2, function(x) length(unique(x)) > 1)] Uwe Ligges patrizio 2009/3/26 Duijvesteijn, Naomi : Hi Readers, I have a question. I have a large dataset and want to throw away columns that have the same value in the column itself and I want to know which column this was. For example > x<-data.frame(id=c(1,2,3), snp1=c("A","G", "G"),snp2=c("G","G","G"),snp3=c("G","G","A")) > x id snp1 snp2 snp3 1 1AGG 2 2GGG 3 3GGA Now I want to know that snp2 in monomorphic (the same value for the column) and after I know which column it is I want to take these columns out. Thanks, Naomi Disclaimer: De informatie opgenomen in dit bericht (en bijlagen) kan vertrouwelijk zijn en is uitsluitend bestemd voor de geadresseerde(n). Indien u dit bericht ten onrechte ontvangt, wordt u geacht de inhoud niet te gebruiken, de afzender direct te informeren en het bericht te vernietigen. Aan dit bericht kunnen geen rechten of plichten worden ontleend. Disclaimer: The information contained in this message may be confidential and is intended to be exclusively for the addressee. Should you receive this message unintentionally, you are expected not to use the contents herein, to notify the sender immediately and to destroy the message. No rights can be derived from this message. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Column name assignment problem
Dear all, I'm trying to assign a name to the fourth column whilst using 'assign', but keep encountering errors. What have I done wrong?! > assign(colnames(c(paste("arunoff_",table_year, sep="")[4]), "COUNT")) Error in if (do.NULL) NULL else if (nc> 0) paste(prefix, seq_len(nc), : argument is not interpretable as logical Hope someone is able to help. Thanks for any pointers, Steve _ [[elided Hotmail spam]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] R: Fortran-90 and R
Perhaps you should consider also the possibility of simply export the data from the R workspace to the system and make your fortran code to read these system files. > On Thu, 2009-03-26 at 11:01 +0100, mau...@alice.it wrote: >> I do hope the chapte about "Interfacing R and Fortran" is sufficient. >> I have a heavy load of work from two projects and no previous experience >> in either cases (analysis packages, >> algorithms, and so on ...). I am a postdoc. > > Ranjan Maitra > > Posted this link > > http://www.stat.umn.edu/~charlie/rc/ > > last week. I found this quite simple to follow, so it might be a useful > companion to the Writing R Extensions manual already suggested. The link > above doesn't specifically address fortran 90, but if special handling > of f90 code is required, the Writing R Extensions manual will tell you > what needs to be done differently. > > HTH > > G > >> Thank you. >> Maura >> >> -Messaggio originale- >> Da: Peter Dalgaard [mailto:p.dalga...@biostat.ku.dk] >> Inviato: gio 26/03/2009 8.57 >> A: mau...@alice.it >> Cc: r-help@r-project.org >> Oggetto: Re: [R] Fortran-90 and R >> >> mau...@alice.it wrote: >> > I have some code in Fortran-90 that outperforms my R implementation of >> the same algorithm (in terms of speed). >> > I wonder whether it is possible to interface R with Fortran-90. that >> is would like to call a Fortran routine from my R >> > script, passing to the Fortran routine some real numbers array, some >> scalar real numbers, and an integer number . >> > I have no idea how to do that because Fortran is a compiled language >> whereas R s an interpreted language .. >> > therefore I do not know whetehr it makes any sense to build a make >> file for the linker ... Is the linker necessary ? >> > >> > Thank you very much. >> > Maura >> >> You need to read this: >> >> http://cran.r-project.org/doc/manuals/R-exts.pdf >> >> -p >> >> >> __ >> R-help@r-project.org mailing list >> https://stat.ethz.ch/mailman/listinfo/r-help >> PLEASE do read the posting guide >> http://www.R-project.org/posting-guide.html >> and provide commented, minimal, self-contained, reproducible code. > -- > %~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~% > Dr. Gavin Simpson [t] +44 (0)20 7679 0522 > ECRC, UCL Geography, [f] +44 (0)20 7679 0565 > Pearson Building, [e] gavin.simpsonATNOSPAMucl.ac.uk > Gower Street, London [w] http://www.ucl.ac.uk/~ucfagls/ > UK. WC1E 6BT. [w] http://www.freshwaters.org.uk > %~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~% > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] same value in column-->delete
this works which.is.not.unique <- apply(x,2,function(x)ifelse(length(unique(x))==1,F,T)) x[,which.is.not.unique] patrizio 2009/3/26 Duijvesteijn, Naomi : > > Hi Readers, > > > I have a question. > > > I have a large dataset and want to throw away columns that have the same > value in the column itself and I want to know which column this was. > > > For example > > > x<-data.frame(id=c(1,2,3), snp1=c("A","G", > "G"),snp2=c("G","G","G"),snp3=c("G","G","A")) > > > x > > id snp1 snp2 snp3 > > 1 1 A G G > > 2 2 G G G > > 3 3 G G A > > > Now I want to know that snp2 in monomorphic (the same value for the column) > and after I know which column it is I want to take these columns out. > > > Thanks, > > Naomi > > > > > > Disclaimer: De informatie opgenomen in dit bericht (en bijlagen) kan > vertrouwelijk zijn en is uitsluitend bestemd voor de geadresseerde(n). > Indien u dit bericht ten onrechte ontvangt, wordt u geacht de inhoud niet te > gebruiken, de afzender direct te informeren en het bericht te vernietigen. > Aan dit bericht kunnen geen rechten of plichten worden ontleend. > > > > > Disclaimer: The information contained in this message may be confidential > and is intended to be exclusively for the addressee. Should you receive this > message unintentionally, you are expected not to use the contents herein, to > notify the sender immediately and to destroy the message. No rights can be > derived from this message. > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > > __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Centring variables in Cox Proportional Hazards Model
Laura Bonnett wrote: Dear All, I am contemplating centering the covariates in my Cox model to reduce multicollinearity between the predictors and the interaction term and to render a more meaningful interpretation of the regression coefficient. Suppose I have two indicator variables, x1 and x2 which represent age categories (x1 is patients less than 16 while x2 is for patients older than 65). If I use the following Cox model, is there anyway I can centre the variables? Do I have to do it before I fit them into the model and if so, how? fit2=coxph(Surv(rem.Remtime,rem.Rcens)~x1(partial)+x2(partial),data=partial,method="breslow") Thank you, Laura There is no need to center the variables. Also, you are going to find a huge lack of fit for the shape of the age effect you are using, as opposed to using a smooth function in continuous age. Your notation is treating x1 and x2 as functions of data frame which is strange. With data=partial you would ordinarily just have something like x1+x2 in the model. Frank -- Frank E Harrell Jr Professor and Chair School of Medicine Department of Biostatistics Vanderbilt University __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] same value in column-->delete
On Thu, Mar 26, 2009 at 12:15 PM, Duijvesteijn, Naomi wrote: > > Hi Readers, > > > I have a question. > > > I have a large dataset and want to throw away columns that have the same > value in the column itself and I want to know which column this was. > > > For example > > > x<-data.frame(id=c(1,2,3), snp1=c("A","G", > "G"),snp2=c("G","G","G"),snp3=c("G","G","A")) > > > x > > id snp1 snp2 snp3 > > 1 1 A G G > > 2 2 G G G > > 3 3 G G A > > > Now I want to know that snp2 in monomorphic (the same value for the column) > and after I know which column it is I want to take these columns out. > > > Thanks, > > Naomi > Another, perhaps slightly more intuitive solution than Jim's would be the following: x<-data.frame(id=c(1,2,3), snp1=c("A","G", "G"),snp2=c("G","G","G"),snp3=c("G","G","A")) is.monovalued<-function(df){ sapply(df,function(x){ length(unique(x))==1 }) } monovaluedCols<-is.monovalued(x) which(monovaluedCols) x[!monovaluedCols] /Gustaf -- Gustaf Rydevik, M.Sci. tel: +46(0)703 051 451 address:Essingetorget 40,112 66 Stockholm, SE skype:gustaf_rydevik __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Getting started,
Try this: library(zoo) library(chron) my.url <- "http://www.federalreserve.gov/releases/h15/data/Business_day/H15_ED_M1.txt"; z <- read.zoo(my.url, skip = 8, header = TRUE, sep = ",", na.strings = "ND", strip.white = TRUE, FUN = as.chron) plot(z[1:50,]) # or to plot with NAs interpolated plot(na.approx(z[1:50,])) See the three zoo vignettes: vignette(package = "zoo") vignette("zoo") # etc. and for more on dates see R News 4/1. On Thu, Mar 26, 2009 at 6:58 AM, gug wrote: > > Hello, > > First of all, I'm new to R and I don't have anyone who already knows the > language to ask for tips, so please excuse my ignorance. > > I'm trying to download data direct from the Federal Reserve statistics > website and graph it, using the following: > > #This downloads the data from the Fed website > download.file("http://www.federalreserve.gov/releases/h15/data/Business_day/H15_ED_M1.txt","Eurodollar_deposits1M.dat";) > > #This reads that data into a table > data_download=read.table("Eurodollar_deposits1M.dat",skip=8,sep=",",head=T) > > #This is attempting to get R to read the date column, which is in > "mm/dd/" format > #I have restricted it to the first 50 data points (to keep it simple) > date_vec = as.Date(data_download[1:50,1],"%m/%d/%y") > > #This reads the values again for just the first 50 data points > eurodollar_vec = data_download[1:50,2] > > #This plots the date and data values. > plot(date_vec,eurodollar_vec) > > > A few issues with the output from this: > > 1) The date_vec data ends up looking like this "2019-01-04" - for some > reason all year 2019, irrespective of the actual year (e.g. 1971, 1972). > Any suggestions as to where I am going wrong? > > 2) The chart ends up as a scatter diagram, not surprisingly. I initially > tried to use the ts.plot() function but when I did, I found that it doesn't > use the date column in the data as the x axis. Is there a way of creating a > time series that uses the date column, or alternatively, how would I turn > these two columns (date & value) into a line graph? > > 3) The y-axis output should be from about 4.5 to about 8.5, but the y-axis > shows as being about 430-670. There also seems to be a line of points > across the top which are all of the same value - as though anything above > about 670 gets charted as 670. > > > I'm sorry for this being a "what am I doing wrong" post, but I have looked > through the manuals without success, and I am new enough to R that I don't > have a feel for ways round these sorts of problems. Any suggestions would > be greatly appreciated. Thanks. > -- > View this message in context: > http://www.nabble.com/Getting-started%2C-tp22719735p22719735.html > Sent from the R help mailing list archive at Nabble.com. > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] same value in column-->delete
Try this: > x id snp1 snp2 snp3 1 1AGG 2 2GGG 3 3GGA > str(x) 'data.frame': 3 obs. of 4 variables: $ id : num 1 2 3 $ snp1: Factor w/ 2 levels "A","G": 1 2 2 $ snp2: Factor w/ 1 level "G": 1 1 1 $ snp3: Factor w/ 2 levels "A","G": 2 2 1 > # test for which columns are the same > apply(x, 2, function(.col) all(head(.col, -1) == tail(.col, -1))) id snp1 snp2 snp3 FALSE FALSE TRUE FALSE > On Thu, Mar 26, 2009 at 7:15 AM, Duijvesteijn, Naomi wrote: > > Hi Readers, > > > I have a question. > > > I have a large dataset and want to throw away columns that have the same > value in the column itself and I want to know which column this was. > > > For example > > > x<-data.frame(id=c(1,2,3), snp1=c("A","G", > "G"),snp2=c("G","G","G"),snp3=c("G","G","A")) > > > x > > id snp1 snp2 snp3 > > 1 1 A G G > > 2 2 G G G > > 3 3 G G A > > > Now I want to know that snp2 in monomorphic (the same value for the column) > and after I know which column it is I want to take these columns out. > > > Thanks, > > Naomi > > > > > > Disclaimer: De informatie opgenomen in dit bericht (en bijlagen) kan > vertrouwelijk zijn en is uitsluitend bestemd voor de geadresseerde(n). > Indien u dit bericht ten onrechte ontvangt, wordt u geacht de inhoud niet te > gebruiken, de afzender direct te informeren en het bericht te vernietigen. > Aan dit bericht kunnen geen rechten of plichten worden ontleend. > > > > > Disclaimer: The information contained in this message may be confidential > and is intended to be exclusively for the addressee. Should you receive this > message unintentionally, you are expected not to use the contents herein, to > notify the sender immediately and to destroy the message. No rights can be > derived from this message. > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > > -- Jim Holtman Cincinnati, OH +1 513 646 9390 What is the problem that you are trying to solve? __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Getting started,
In answer to your first question, your statement should be: date_vec = as.Date(data_download[1:50,1],"%m/%d/%Y") Notice the capital "Y"; lower case says the year is only 2 digits, so you were pickup up the '19' from the date. Use 'plot(..., type='l')' for a line plot. Your data is being read in as factors: > str(data_download) 'data.frame': 9972 obs. of 2 variables: $ DATE: Factor w/ 9972 levels "01/01/1973","01/01/1974",..: 84 112 140 167 195 279 307 335 362 390 ... $ EDM1: Factor w/ 673 levels " 0.70"," 0.75",..: 599 597 593 594 591 586 583 571 574 586 ... You need to convert EDM1 to numeric data_download$EDM1 <- as.numeric(as.character(data_download$EDM1)) and then your data will plot as you like. On Thu, Mar 26, 2009 at 6:58 AM, gug wrote: > > Hello, > > First of all, I'm new to R and I don't have anyone who already knows the > language to ask for tips, so please excuse my ignorance. > > I'm trying to download data direct from the Federal Reserve statistics > website and graph it, using the following: > > #This downloads the data from the Fed website > download.file("http://www.federalreserve.gov/releases/h15/data/Business_day/H15_ED_M1.txt","Eurodollar_deposits1M.dat";) > > #This reads that data into a table > data_download=read.table("Eurodollar_deposits1M.dat",skip=8,sep=",",head=T) > > #This is attempting to get R to read the date column, which is in > "mm/dd/" format > #I have restricted it to the first 50 data points (to keep it simple) > date_vec = as.Date(data_download[1:50,1],"%m/%d/%y") > > #This reads the values again for just the first 50 data points > eurodollar_vec = data_download[1:50,2] > > #This plots the date and data values. > plot(date_vec,eurodollar_vec) > > > A few issues with the output from this: > > 1) The date_vec data ends up looking like this "2019-01-04" - for some > reason all year 2019, irrespective of the actual year (e.g. 1971, 1972). > Any suggestions as to where I am going wrong? > > 2) The chart ends up as a scatter diagram, not surprisingly. I initially > tried to use the ts.plot() function but when I did, I found that it doesn't > use the date column in the data as the x axis. Is there a way of creating a > time series that uses the date column, or alternatively, how would I turn > these two columns (date & value) into a line graph? > > 3) The y-axis output should be from about 4.5 to about 8.5, but the y-axis > shows as being about 430-670. There also seems to be a line of points > across the top which are all of the same value - as though anything above > about 670 gets charted as 670. > > > I'm sorry for this being a "what am I doing wrong" post, but I have looked > through the manuals without success, and I am new enough to R that I don't > have a feel for ways round these sorts of problems. Any suggestions would > be greatly appreciated. Thanks. > -- > View this message in context: > http://www.nabble.com/Getting-started%2C-tp22719735p22719735.html > Sent from the R help mailing list archive at Nabble.com. > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > -- Jim Holtman Cincinnati, OH +1 513 646 9390 What is the problem that you are trying to solve? __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Centring variables in Cox Proportional Hazards Model
Dear All, I am contemplating centering the covariates in my Cox model to reduce multicollinearity between the predictors and the interaction term and to render a more meaningful interpretation of the regression coefficient. Suppose I have two indicator variables, x1 and x2 which represent age categories (x1 is patients less than 16 while x2 is for patients older than 65). If I use the following Cox model, is there anyway I can centre the variables? Do I have to do it before I fit them into the model and if so, how? fit2=coxph(Surv(rem.Remtime,rem.Rcens)~x1(partial)+x2(partial),data=partial,method="breslow") Thank you, Laura __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Using JRclient in java application
Hi, I'm wondering if anyone can help me. I'm writing java application that using JRclient and Rserve to communicate with R. I want to get response from R for command: x <- rbind(matrix(rnorm(100, sd = 0.3), ncol = 2),matrix(rnorm(100, mean = 1, sd = 0.3), ncol = 2)) kmeans(x, 2, 22, algorithm = "Hartigan-Wong") and put result of kmeans() function in my TextArea.I've tried something like that: zz <- file("ex.txt", "w") capture.output(kmeans(x, 2, 22, algorithm = "Hartigan-Wong"), file = zz, append = TRUE) and then get "ex.txt" file from R and rewrite it to my local file using java method: public void saveFileFromServer(Rconnection rConnection, String fileName, String pathToSave) throws Exception { RFileInputStream ris = rConnection.openFile(fileName); FileOutputStream outs = new FileOutputStream(pathToSave + "/" + new File(fileName).getName()); byte[] buf = new byte[1024]; int n = 0; while ((n = ris.read(buf)) != -1) { outs.write(buf, 0, n); } outs.flush(); ris.close(); outs.close(); } It works fine (i have file "ex.txt" with results) when i run my application from Eclipse. Problems starts when i put application in jar. In that case my "ex.txt" file is empty. Can anyone tell what i'm doing wrong? Thanks in advance Tomek Spulak [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] same value in column-->delete
Hi Readers, I have a question. I have a large dataset and want to throw away columns that have the same value in the column itself and I want to know which column this was. For example > x<-data.frame(id=c(1,2,3), snp1=c("A","G", "G"),snp2=c("G","G","G"),snp3=c("G","G","A")) > x id snp1 snp2 snp3 1 1AGG 2 2GGG 3 3GGA Now I want to know that snp2 in monomorphic (the same value for the column) and after I know which column it is I want to take these columns out. Thanks, Naomi Disclaimer: De informatie opgenomen in dit bericht (en bijlagen) kan vertrouwelijk zijn en is uitsluitend bestemd voor de geadresseerde(n). Indien u dit bericht ten onrechte ontvangt, wordt u geacht de inhoud niet te gebruiken, de afzender direct te informeren en het bericht te vernietigen. Aan dit bericht kunnen geen rechten of plichten worden ontleend. Disclaimer: The information contained in this message may be confidential and is intended to be exclusively for the addressee. Should you receive this message unintentionally, you are expected not to use the contents herein, to notify the sender immediately and to destroy the message. No rights can be derived from this message. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Manual sort in a for loop
Thanks all - I'm fairly new to R, so I was oblivious to the pros and cons of using a data frame as opposed to a list! The 'get' command also seemed to work successfully. Thanks again, Steve _ 25GB of FREE Online Storage – Find out more __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Warning messages from quantreg
Hi, I keep getting warning messages from quantreg about "tiny diagonals replaced with Inf when calling blkfct". Is there any cause for concern like improper codes, NAs in datasets or missing values ? Thanks Stanley __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] R: Fortran-90 and R
On Thu, 2009-03-26 at 11:01 +0100, mau...@alice.it wrote: > I do hope the chapte about "Interfacing R and Fortran" is sufficient. > I have a heavy load of work from two projects and no previous experience in > either cases (analysis packages, > algorithms, and so on ...). I am a postdoc. Ranjan Maitra Posted this link http://www.stat.umn.edu/~charlie/rc/ last week. I found this quite simple to follow, so it might be a useful companion to the Writing R Extensions manual already suggested. The link above doesn't specifically address fortran 90, but if special handling of f90 code is required, the Writing R Extensions manual will tell you what needs to be done differently. HTH G > Thank you. > Maura > > -Messaggio originale- > Da: Peter Dalgaard [mailto:p.dalga...@biostat.ku.dk] > Inviato: gio 26/03/2009 8.57 > A: mau...@alice.it > Cc: r-help@r-project.org > Oggetto: Re: [R] Fortran-90 and R > > mau...@alice.it wrote: > > I have some code in Fortran-90 that outperforms my R implementation of the > > same algorithm (in terms of speed). > > I wonder whether it is possible to interface R with Fortran-90. that is > > would like to call a Fortran routine from my R > > script, passing to the Fortran routine some real numbers array, some scalar > > real numbers, and an integer number . > > I have no idea how to do that because Fortran is a compiled language > > whereas R s an interpreted language .. > > therefore I do not know whetehr it makes any sense to build a make file for > > the linker ... Is the linker necessary ? > > > > Thank you very much. > > Maura > > You need to read this: > > http://cran.r-project.org/doc/manuals/R-exts.pdf > > -p > > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. -- %~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~% Dr. Gavin Simpson [t] +44 (0)20 7679 0522 ECRC, UCL Geography, [f] +44 (0)20 7679 0565 Pearson Building, [e] gavin.simpsonATNOSPAMucl.ac.uk Gower Street, London [w] http://www.ucl.ac.uk/~ucfagls/ UK. WC1E 6BT. [w] http://www.freshwaters.org.uk %~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~% __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Getting started,
Hello, First of all, I'm new to R and I don't have anyone who already knows the language to ask for tips, so please excuse my ignorance. I'm trying to download data direct from the Federal Reserve statistics website and graph it, using the following: #This downloads the data from the Fed website download.file("http://www.federalreserve.gov/releases/h15/data/Business_day/H15_ED_M1.txt","Eurodollar_deposits1M.dat";) #This reads that data into a table data_download=read.table("Eurodollar_deposits1M.dat",skip=8,sep=",",head=T) #This is attempting to get R to read the date column, which is in "mm/dd/" format #I have restricted it to the first 50 data points (to keep it simple) date_vec = as.Date(data_download[1:50,1],"%m/%d/%y") #This reads the values again for just the first 50 data points eurodollar_vec = data_download[1:50,2] #This plots the date and data values. plot(date_vec,eurodollar_vec) A few issues with the output from this: 1) The date_vec data ends up looking like this "2019-01-04" - for some reason all year 2019, irrespective of the actual year (e.g. 1971, 1972). Any suggestions as to where I am going wrong? 2) The chart ends up as a scatter diagram, not surprisingly. I initially tried to use the ts.plot() function but when I did, I found that it doesn't use the date column in the data as the x axis. Is there a way of creating a time series that uses the date column, or alternatively, how would I turn these two columns (date & value) into a line graph? 3) The y-axis output should be from about 4.5 to about 8.5, but the y-axis shows as being about 430-670. There also seems to be a line of points across the top which are all of the same value - as though anything above about 670 gets charted as 670. I'm sorry for this being a "what am I doing wrong" post, but I have looked through the manuals without success, and I am new enough to R that I don't have a feel for ways round these sorts of problems. Any suggestions would be greatly appreciated. Thanks. -- View this message in context: http://www.nabble.com/Getting-started%2C-tp22719735p22719735.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Density estimation: scale back for calendar time
Thanks David for your response.I had done that. Problem is not with the computations, but in the interpretation. Assume that x ordinates are 'time' [1:3472], and y are associated parameter values. Now density gives me 512/1024/2048 x-ordinates, of which some (7 to be precise) are negative. What do I make of the non-zero probability at **negative** times? If you see your rnorm case, you get the first few items negative. If the x items are bounded non-negative, what would density estimates at negative x-points mean? I can't do a linear scaling because that will disturb the density estimates on different days. Regards, pradeep On Wed, Mar 25, 2009 at 11:39 PM, David Winsemius wrote: > I am afraid your notion of a "concrete idea" is less concrete than what I > would need to understand what you are requesting. Your first lines of > example code should be: > > library() > > density(x, ) > > ... since stats::density() by default would return 512 y estimates, even > if the length of x were longer. > > > x <- rnorm(3471) > > plot(density(x)) > > str(density(x)) > List of 7 > $ x: num [1:512] -3.98 -3.96 -3.94 -3.93 -3.91 ... > $ y: num [1:512] 7.98e-06 > > -- > David Winsemius > > > On Mar 25, 2009, at 9:30 AM, Pradeep Raje wrote: > > Dear all:Request your indulgence. The econophysics gurus do this stuff all >> the time: all their PDFs are smooth, with neat log x axis. >> 1. The kernel density estimate (KDE) function returns the empirical >> probability density at 2^n points (min: 512). The big question is how do I >> scale back the x-values (say, density$x) to x-values in terms of the >> original dataset? >> 2. To give you a concrete idea, i have a dataset of 3471 obs (x=date >> index, >> y=parameter values). Now the density estimate d<-density(x) gives be 2048 >> x-values. When I plot the PDF, the x axis is obviously d$x, length=2048. >> 3. How can I scale back these 2048 values to get a sense of calendar time >> (original date index)? >> 4. Subsidiary question is: how do i bring in the remaining values >> (3471-2048)? >> > > You seem to have the idea that the original data is "lined up" with the > density estimates. That is not so. > > >> Thanks very much in advance. >> pradeep >> >>[[alternative HTML version deleted]] >> >> __ >> R-help@r-project.org mailing list >> https://stat.ethz.ch/mailman/listinfo/r-help >> PLEASE do read the posting guide >> http://www.R-project.org/posting-guide.html >> and provide commented, minimal, self-contained, reproducible code. >> > > David Winsemius, MD > Heritage Laboratories > West Hartford, CT > > [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Sort by timestamp
#Good morning alltogheter. I'm using R for a short time to analyse TimeSeries and I have the following Problem: #I have a bunch of Time Series: #First of all I import them from a txt File data.input01 <-read.csv("./LD/20081030.txt", header = TRUE, sep = ";", quote="\"", dec=",", fill = TRUE, comment.char="") data.input02 <-read.csv("./LD/20090305.txt", header = TRUE, sep = ";", quote="\"", dec=",", fill = TRUE, comment.char="") data.input03 <-read.csv("./LD/20081114.txt", header = TRUE, sep = ";", quote="\"", dec=",", fill = TRUE, comment.char="") data.input04 <-read.csv("./LD/20081201.txt", header = TRUE, sep = ";", quote="\"", dec=",", fill = TRUE, comment.char="") data.input05 <-read.csv("./LD/20081219.txt", header = TRUE, sep = ";", quote="\"", dec=",", fill = TRUE, comment.char="") data.input06 <-read.csv("./LD/20090107.txt", header = TRUE, sep = ";", quote="\"", dec=",", fill = TRUE, comment.char="") #After the import they look like that: V1 V2 1 2008-10-14 08:45:00 92130.68 2 2008-10-14 08:50:00 94051.70 3 2008-10-14 08:55:00 97050.85 4 2008-10-14 09:00:00 81133.81 5 2008-10-14 09:05:00 70705.40 6 2008-10-14 09:10:00 75213.92 7 2008-10-14 09:15:00 90876.14 8 2008-10-14 09:20:00 85995.17 #Next steps are to combine them with rbind and sort duplicates out data.troughput01 <- rbind(data.input03,data.input01,data.input04,data.input02,data.input05,data.input06) data.troughput02 <- unique(data.troughput01) #The Problem is that the dates are mixed and I want to sort/order them by the date and time. #The class of the Date/time is as followed: class(data.input01$V1) [1] "factor" # I've already tried sort and order but it didn't work #Are there any suggestions, how I can solve this issue?? Thanks in advance Johannes -- View this message in context: http://www.nabble.com/Sort-by-timestamp-tp22717322p22717322.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] loop problem
Muhammad Azam wrote: Dear R members I have a problem regarding storing the lists. Let L=number of distinct values of any predictor (say L=5) P=number of predictors (say P=20) g1 <- c() for(i in 1:P){ if(L > 1){ for(j in 1:(L-1)){ g <- g1[j] <- g } } g2[]=sort.list(g1) } Now the question is: What should we use inside brackets of g2[], whether "i" or some thing else? If L is not greater than 1 then there will be a "NULL" for g2. We don't want to store it in g2, so how can we handle this problem. Looking forward for some help. Thanks and Hi Muhammad, The first thing I would ask is "Why not store NULL in g2[[i]] if that element is NULL?". If you do this: for(i in 1:P) { if(L > 1) { g1<-rep(NA,L) for(j in 1:(L-1)) g1[j]<-... g2[[i]]<-sort.list(g1) } } you will have fewer elements in g2 than you have sets of predictors and will then have to match up the sets of predictors with the elements in g2. I think you could just test for NULL is.null(g2[[i]]) later on if you don't want to process these in a subsequent step. Jim __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Manual sort in a for loop
I would suggest avoiding the function ?assign inside a loop. I used top use this until someone here kindly pointed out that it was much easier to catch the data of interest in a list... eg. df.list <- vector("list", length(10)) for (i in 1:10)} df.list[[i]]<-data.frame(arunoff_,table_year,_temp)[c(10,7,9,5,4,12,1,3,2,8,11,6),] } HTH Simon. - Original Message - From: "Steve Murray" To: Sent: Wednesday, March 25, 2009 5:58 PM Subject: [R] Manual sort in a for loop Dear all, I am trying to manually re-sort rows in a number of tables. The rows aren't sorted on any particular values but are simply ordered by user choice (as shown by the row numbers in the code). I have been able to carry out each re-arrangement without the use of the 'for' loop, but cannot seem to successfully execute the statements when incorporated into the loop. The code I have is as follows: table_year=1951 for (i in (paste("arunoff_",year,"_temp",sep=""))) { assign(paste("arunoff_",table_year, sep=""),paste("arunoff_",table_year,"_temp")[c(10,7,9,5,4,12,1,3,2,8,11,6),]) table_year = table_year+1 } The error I get is: Error in paste("arunoff_", table_year, "_temp")[c(10, 7, 9, 5, 4, 12, : incorrect number of dimensions ...depsite this not occurring when I do each table individually (so it can't be a case of there not being enough rows, as> dim(arunoff_1951_temp) gives [1] 12 11 I have a feeling that it may be a syntax error, possibly between 'temp' and the square bracket, but I can't be sure of this. Any solutions or advice offered would be gratefully received. Many thanks, Steve _ [[elided Hotmail spam]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.