[R] The KJV

2010-02-05 Thread Charlotte Maia
Hey all, Does anyone know if there are any R packages with a copy of the KJV? I'm guessing the answer is no... So the next question, and the more important one is: Does anyone think it would be useful (e.g. for text-mining purposes)? I know almost nothing about theology, so I'm not sure what kind

Re: [R] duplicating records

2010-02-05 Thread Jim Lemon
On 02/06/2010 12:53 PM, El-Tahtawy, Ahmed wrote: Dear friends, I need to fill in (duplicate the whole record) the missing days with the same record values as long as AE is the same value (i.e. "1"), once AE value changes, the process of duplication should proceed with the new AE value till it

[R] embedFonts with pdf files and Windows 7

2010-02-05 Thread James M. Curran
I am trying to embed fonts in my PDF images so that they are embedded for the publisher of my book. I am running: Windows 7 - 64 Enterprise R 2.10.1 Ghostscript 8.70 Ghostview 4.9 MiKTeX 2.8 I have this tiny test script: pdf("test.pdf") plot(matrix(rnorm(200),nc=2)) graphics.off() myCall = e

Re: [R] question about bigmemory: releasing RAM from a big.matrix that isn't used anymore

2010-02-05 Thread Steve Lianoglou
Hi, On Fri, Feb 5, 2010 at 9:24 PM, Matthew Keller wrote: > Hi all, > > I'm on a Linux server with 48Gb RAM. I did the following: > > x <- > big.matrix(nrow=2,ncol=50,type='short',init=0,dimnames=list(1:2,1:50)) > #Gets around the 2^31 issue - yeah! > > in Unix, when I hit the "t

Re: [R] number of Excel worksheets

2010-02-05 Thread Kevin Wright
Sorry, I thought the code was clear, but probably not. As far as I know, the "gdata" package requires perl. My read.xls function requires RODBC. If you don't have perl, my function works well, but if you have perl, the gdata package can do a better job of reading mixed-type (character/numeric)

[R] question about bigmemory: releasing RAM from a big.matrix that isn't used anymore

2010-02-05 Thread Matthew Keller
Hi all, I'm on a Linux server with 48Gb RAM. I did the following: x <- big.matrix(nrow=2,ncol=50,type='short',init=0,dimnames=list(1:2,1:50)) #Gets around the 2^31 issue - yeah! in Unix, when I hit the "top" command, I see R is taking up about 18Gb RAM, even though the object x

Re: [R] Reading large files

2010-02-05 Thread Gabor Grothendieck
Note that the filter= argument on read.csv.sql can be used to pass the input through a filter written in perl, [g]awk or other language. For example: read.csv.sql(..., filter = "gawk -f myfilter.awk") gawk has the FIELDWIDTHS variable for automatically parsing fixed width fields, e.g. http://www.d

[R] embedFonts with pdf files and Windows 7

2010-02-05 Thread James M. Curran
I am trying to embed fonts in my PDF images so that they are embedded for the publisher of my book. I am running: Windows 7 - 64 Enterprise R 2.10.1 Ghostscript 8.70 Ghostview 4.9 MiKTeX 2.8 I have this tiny test script: pdf("test.pdf") plot(matrix(rnorm(200),nc=2)) graphics.off() myCall = e

[R] duplicating records

2010-02-05 Thread El-Tahtawy, Ahmed
Dear friends, I need to fill in (duplicate the whole record) the missing days with the same record values as long as AE is the same value (i.e. "1"), once AE value changes, the process of duplication should proceed with the new AE value till it changes again. e.g. I need to fill in records: day

Re: [R] Reading large files

2010-02-05 Thread Vadlamani, Satish {FLNA}
Hi Gabor: Thanks. My files are all in fixed width format. They are a lot of them. It would take me some effort to convert them to CSV. I guess this cannot be avoided? I can write some Perl scripts to convert fixed width format to CSV format and then start with your suggestion. Could you let me k

[R] parallel xyplot, different y scales, conditioned

2010-02-05 Thread Jacob Wegelin
The syntax below creates parallel time-series plots of three different y variables conditioned by a dichotomous factor. (Thanks to several people who answered an earlier post from Thursday, http://tolstoy.newcastle.edu.au/R/e9/help/10/02/3848.html. The syntax below is based on their helpful

[R] Multinomial glm with nested and random Effects

2010-02-05 Thread hugo Mathe
Hi, I'd like to do a multinomial glm with nested and random Effects The "multinom" function from (nnet) give me this: > multinom(mc$c ~ (1|mc$date)+mc$lma + mc$poid+(mc$pop) +mc$male %in% > mc$pop,family="multinomial" ) # weights: 60 (44 variable)

Re: [R] Incompatible types error

2010-02-05 Thread Don MacQueen
There appear to be quite a few problems. The first and most glaring to my eyes is that you initialize four vectors, Et, fx, Tx, and Fitness inside the loop. Since you set their lengths to 1000, one might assume that the 1000 values are to be calculated as the loop index, i, goes from 1 to 1000

Re: [R] reading csv files

2010-02-05 Thread Jim Lemon
On 02/06/2010 09:05 AM, analys...@hotmail.com wrote: On Feb 5, 8:57 am, Barry Rowlingson wrote: On Fri, Feb 5, 2010 at 10:23 AM, analys...@hotmail.com wrote: the csv files are downloaded from a database and it looks like some character fields contain the CR-LF sequence within them. This

Re: [R] Strange "rownames"

2010-02-05 Thread Peter Ehlers
Zhang Jian wrote: I find one row in my large dataset. But when I use the "rownames" for the data on the 100,000 row, the result show nothing. I try it by the following example, it still likes that. tst[rownames(tst)==10,] [1] x y <0 rows> (or 0-length row.names) ## tst=data.

[R] Append multiple optim result with a loop (or apply?)

2010-02-05 Thread Guillaume Théroux Rancourt
Hello R-helpers, Thanks to Ravi Varadhan, I have improve the function I am working on top optimize two equations. Now, my objective is to do a series of optimization from a data table, where each row is one data serie (i.e data from one plant) to be optimized. The function below works up to th

Re: [R] reading csv files

2010-02-05 Thread analys...@hotmail.com
On Feb 5, 8:57 am, Barry Rowlingson wrote: > On Fri, Feb 5, 2010 at 10:23 AM, analys...@hotmail.com > > wrote: > > the csv files are downloaded from a database and it looks like some > > character fields contain the CR-LF sequence within them. > > > This causes R to see a new record/row and the

[R] Strange "rownames"

2010-02-05 Thread Zhang Jian
I find one row in my large dataset. But when I use the "rownames" for the data on the 100,000 row, the result show nothing. I try it by the following example, it still likes that. > tst[rownames(tst)==10,] [1] x y <0 rows> (or 0-length row.names) ## > tst=data.frame(x=1:20,y=

Re: [R] maximum elements in an ff object?

2010-02-05 Thread Benilton Carvalho
Oh... I forgot to mention that you can also use the ffdf() approach. For example, if you're able to load one column at a time, you could do something like: res = vector("list", nColumns) for (i in 1:nColumns) res[[i]] = ff(, ...) finalFfObj = do.call(ffdf, res) Of course you can use other aproa

Re: [R] maximum elements in an ff object?

2010-02-05 Thread Benilton Carvalho
Hi Matt, you're correct: length(ffObject) must be smaller than 2^31-1... at least until R has a 64bit integer type, it seems... in the meantime, use the bigmemory package. ;-) b On Fri, Feb 5, 2010 at 4:18 PM, Matthew Keller wrote: > Hello all, > > I hate to add to the daily queries regarding

Re: [R] ecdf error

2010-02-05 Thread Frank E Harrell Jr
El-Tahtawy, Ahmed wrote: Dear friends, Ecdf used to work nicely before. e.g. ecdf(df1v[,c(1:6)],group=df1v$trt,q=.5,col=1:2,label.curves=list(keys="l ines"), datadensity="rug") but now it does not work at all, even for a simpler code??? By the way, I used the same libraries.

[R] ecdf error

2010-02-05 Thread El-Tahtawy, Ahmed
Dear friends, Ecdf used to work nicely before. e.g. ecdf(df1v[,c(1:6)],group=df1v$trt,q=.5,col=1:2,label.curves=list(keys="l ines"), datadensity="rug") but now it does not work at all, even for a simpler code??? By the way, I used the same libraries. Thanks for your help. Ahmed

Re: [R] Reading large files

2010-02-05 Thread Gabor Grothendieck
If your problem is just how long it takes to load the file into R try read.csv.sql in the sqldf package. A single read.csv.sql call can create an SQLite database and table layout for you, read the file into the database (without going through R so R can't slow this down), extract all or a portion

Re: [R] Spider Plot color problem

2010-02-05 Thread Peter Ehlers
Ioan, The beauty of R is that you get the code and you can make (often simple) changes to suit your needs. Attached is stars1.R which makes a few minor changes to stars() to let you produce starplots with or without the polygon and with arbitrary colours for the radii. (This might already exist i

Re: [R] Spider Plot color problem

2010-02-05 Thread Jim Lemon
On 02/06/2010 02:21 AM, IoanLoft wrote: Hi all, I have encountered a problem which appears to have defeated my (admittedly nascent) R skills. I want to draw a spider plot with many cases (just over 300). I am primarily interested in the difference between 4 categories of cases, and want to disp

Re: [R] Random number quality

2010-02-05 Thread Daniel Nordlund
> -Original Message- > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] > On Behalf Of b k > Sent: Friday, February 05, 2010 2:04 PM > To: r-help@r-project.org > Subject: [R] Random number quality > > Hello, > > I'm running R 2.10.1 on Windows Vista. I'm selectin

[R] metafor package: effect sizes are not fully independent

2010-02-05 Thread Gang Chen
In a classical meta analysis model y_i = X_i * beta_i + e_i, data {y_i} are assumed to be independent effect sizes. However, I'm encountering the following two scenarios: (1) Each source has multiple effect sizes, thus {y_i} are not fully independent with each other. (2) Each source has multiple e

Re: [R] reading csv files

2010-02-05 Thread Jim Lemon
On 02/06/2010 12:57 AM, Barry Rowlingson wrote: On Fri, Feb 5, 2010 at 10:23 AM, analys...@hotmail.com wrote: the csv files are downloaded from a database and it looks like some character fields contain the CR-LF sequence within them. This causes R to see a new record/row and the number of ro

[R] Incompatible types error

2010-02-05 Thread apellegrini001
I'm trying to write a loop for a series of nested functions. and I get an incompatible types error when trying to run it. It's supposed to be a simulation (1000 iterations) with a random value for "rand" being chosen each time. After each rand value is chosen, the rest of the functions are evaluat

Re: [R] multicore children not quiting

2010-02-05 Thread Simon Urbanek
On Feb 5, 2010, at 10:04 AM, Markus Loecher wrote: > Hi Simon, > I recently switched to a different flavor of Linux and find that the children > that are spawn off by e.g. mclapply() seem to persist forever until I kill > them explicitly. > Is there anything I can do ? > I can reproduce the i

[R] Random number quality

2010-02-05 Thread b k
Hello, I'm running R 2.10.1 on Windows Vista. I'm selecting a random sample of several hundred items out of a larger population of several thousand. I realize there is srswor() in package sampling for exactly this purpose, but as far as I can tell it uses the native PRNG which may or may not be ra

Re: [R] a function

2010-02-05 Thread David Winsemius
On Feb 5, 2010, at 3:33 PM, li li wrote: Dear all, I need to write the following function in R. Can anyone give me some hint? Thank you! Please see the attachment for the function. Hannah __ A three argument function with two instances of sap

Re: [R] t.test

2010-02-05 Thread Peter Dalgaard
Erik Iverson wrote: See the identical example in ?t.test I.e., t.test(Score~Group, data=mydata) parallel to the sleep data example. Actually, that example is unfortunate, because the sleep data are really paired, not independent groups. -- O__ Peter Dalgaard Øster Fa

[R] a function

2010-02-05 Thread li li
Dear all, I need to write the following function in R. Can anyone give me some hint? Thank you! Please see the attachment for the function. Hannah question to R-heip.pdf Description: Adobe PDF document __ R-help@r-project.org mailing list https

Re: [R] (Another) Bates fortune?

2010-02-05 Thread Peter Dalgaard
Peter Ehlers wrote: I vote to 'fortunize' Doug Bates on Hierarchical data sets: which software to use? "The widespread use of spreadsheets or SPSS data sets or SAS data sets which encourage the "single table with a gargantuan number of columns, most of which are missing data in most cases" app

Re: [R] Filling a logical matrices with values

2010-02-05 Thread Paul-
If I understand your question, this may work for you: dat <- matrix(as.logical(sample(T:F, 30, T)),5,6) colnames(dat) <- letters[1:6] rownames(dat) <- paste(letters[1:5],1:5, sep="") dat1 <- matrix(NA,5,6) colnames(dat1) <- colnames(dat) rownames(dat1) <- rownames(dat) dat1[dat] <- unlist(sapply

Re: [R] Filling a logical matrices with values

2010-02-05 Thread Paul-
Sorry, I noticed my previous code does not work if a column has all NAs. Try this instead: dat <- matrix(as.logical(sample(T:F, 30, T)),5,6) colnames(dat) <- letters[1:6] rownames(dat) <- paste(letters[1:5],1:5, sep="") dat1 <- matrix(NA,5,6) colnames(dat1) <- colnames(dat) rownames(dat1) <- row

Re: [R] sum a particular column by group

2010-02-05 Thread Fang (Betty) Yang
Thanks for your help. Finally, I got it. From: Dennis Murphy [mailto:djmu...@gmail.com] Sent: Friday, February 05, 2010 12:20 PM To: Fang (Betty) Yang Cc: r-help@r-project.org Subject: Re: [R] sum a particular column by group Hi: This is not an elegant solution by any means, but it gets w

Re: [R] ggplot stacked bar chart help

2010-02-05 Thread Ryan Garner
Almost there! So my previous data example was just a small subset of my true data. I have all the months from 2007-2009 in order. So when I create the barplot following your script, it bins all 12 months for each year -- a stacked barplot with 3 bars (2007,2008,2009), which I don't want. I want e

Re: [R] Reading large files

2010-02-05 Thread jim holtman
What you need to do is to take a smaller sample of you data (e.g. 50-100MB) and load that data and determine how big the resulting object is. Depends a lot on how you are loading it. Are you using 'scan' or 'read.table'; if 'read.table' have you define the class of the columns? I typically read i

Re: [R] t.test

2010-02-05 Thread Dennis Murphy
That typically assumes equal population variances, and not everybody wants that in a two-sample test. t.test() provides more options with less work. Dennis On Fri, Feb 5, 2010 at 9:38 AM, Jacobs wrote: > > I would use aov( ) > > mod<-aov(Score~Group, data=x) > summary(mod) > > JJ > -- > View th

Re: [R] ggplot2 / time series with different scales

2010-02-05 Thread Chuck White
Thank you both for your response. As the names suggest, I am plotting the sales & price data for items over time to understand the how certain items may be more responsive than others to price changes. Another way of displaying this information on the same chart as the one showing sales, would

Re: [R] Importing data coming from Splus into R.

2010-02-05 Thread William Dunlap
> -Original Message- > From: gerald.j...@dgag.ca [mailto:gerald.j...@dgag.ca] > Sent: Friday, February 05, 2010 10:58 AM > To: William Dunlap > Cc: Uwe Ligges; r-help@r-project.org > Subject: RE: [R] Importing data coming from Splus into R. > > Hello Bill, > > here is what I tried with

Re: [R] ggplot stacked bar chart help

2010-02-05 Thread Dennis Murphy
Hi: Is this what you were expecting? Calling your data set below 'data', # order months chronologically data$month <- ordered(data$month, levels = c('SEP09', 'OCT09', 'DEC09')) # Plot it... p <- ggplot(data, aes(x = month, fill = type)) p + geom_bar(aes(weight = volume), position = 'stack') HTH,

Re: [R] Reading large files

2010-02-05 Thread Charlie Sharpsteen
On Thu, Feb 4, 2010 at 5:27 PM, Vadlamani, Satish {FLNA} wrote: > Folks: > I am trying to read in a large file. Definition of large is: > Number of lines: 333, 250 > Size: 850 MB Perhaps this post by JD Long will provide an example that is suitable to your situation: http://www.cerebralmasticat

Re: [R] Reading large files

2010-02-05 Thread Matthew Dowle
I can't help you further than whats already been posted to you. Maybe someone else can. Best of luck. "Satish Vadlamani" wrote in message news:1265397089104-1470667.p...@n4.nabble.com... > > Matthew: > If it is going to help, here is the explanation. I have an end state in > mind. It is given b

Re: [R] sum a particular column by group

2010-02-05 Thread Dennis Murphy
Hi: This is not an elegant solution by any means, but it gets what you want...using the data frame from your bootstrap sample, # All combinations of the three factors xx <- with(beds, expand.grid(Region = levels(Region), Gender = levels(Gender), Agegr = levels(Agegr)) ) > dim(xx) [

Re: [R] Reading large files

2010-02-05 Thread Satish Vadlamani
Matthew: If it is going to help, here is the explanation. I have an end state in mind. It is given below under "End State" header. In order to get there, I need to start somewhere right? I started with a 850 MB file and could not load in what I think is reasonable time (I waited for an hour). The

Re: [R] Importing data coming from Splus into R.

2010-02-05 Thread gerald . jean
Hello Bill, here is what I tried with the Splus built-in data set "claims". In Splus: apply(claims, 2, class) age car.age type costnumber "ordered" "ordered" "factor" "numeric" "numeric" dump(list = "claims", fileout = "/home/jeg002/splus/R/Exemples/R/myclaims.txt",

Re: [R] Using coxph with Gompertz-distributed survival data.

2010-02-05 Thread Göran Broström
On Fri, Feb 5, 2010 at 5:48 PM, Terry Therneau wrote: > Before being helpful let me raise a couple of questions: > >  1. "I know I'm looking at longevity data (which is believed to have a > Gompertz distribution for mammals dying from 'old age')". >   I'm not as convinced.  The Gompertz is a nice

[R] ggplot stacked bar chart help

2010-02-05 Thread Ryan Garner
I'm trying to create a stacked bar chart with x=month, y=volume, and factor=type. volume type month 100A SEP09 200A OCT09 300A DEC09 400B SEP09 500B OCT09 600B DEC09 700C SEP09 800C

Re: [R] t.test

2010-02-05 Thread Jacobs
I would use aov( ) mod<-aov(Score~Group, data=x) summary(mod) JJ -- View this message in context: http://n4.nabble.com/t-test-tp1470539p1470558.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https:/

Re: [R] glm models with more than one response

2010-02-05 Thread Dennis Murphy
Hi Steve: On Fri, Feb 5, 2010 at 10:10 AM, wrote: > > Hi everyone, > > I am trying to construct a glm and am running into a couple of questions. > > The data set I am using consists of 6 categories for the response and 6 > independent predictors representing nutrient concentrations at sample poi

Re: [R] 2 dimensional interpolation from matlab to R

2010-02-05 Thread Charlie Sharpsteen
On Thu, Feb 4, 2010 at 4:03 AM, karine heerah wrote: > > Hi everybody. > > > > i have 2 datasets : one with a long grid a lat grid and a bathymetry grid > > the second one only has the long and lat coordinates. > > So i want to know the bathymetry associated to the second dataset. I thought > it

Re: [R] histogram scott

2010-02-05 Thread Duncan Murdoch
On 05/02/2010 12:21 PM, maram salem wrote: Dear all, I want to use the histogtam as a density estimator, with the binwidths calculated using scott's formula which is binwidth = 3.49*ST.dev.*n^(-1/3) for the following data (30 data points) 12-9-3-6-1-23-21-7-18-16-15-4-19-22-20-2-3-18-8-10-1-7-5

Re: [R] lattice barchart using a time scale in x axis

2010-02-05 Thread Fran Velasco
Thanks a lot for your help, the examples worked fine, just have to change the colours to produce a b&w plot, and add the legend On 05/02/2010 17:11, RICHARD M. HEIBERGER wrote: Fran, The trick is to use box.width, not box.ratio. xyplot(Perc ~ as.POSIXct(hora,format="%d-%m-%Y %H:%M"), d

Re: [R] glm models with more than one response

2010-02-05 Thread Gavin Simpson
On Fri, 2010-02-05 at 13:10 -0500, steve_fried...@nps.gov wrote: > Hi everyone, > > I am trying to construct a glm and am running into a couple of questions. > > The data set I am using consists of 6 categories for the response and 6 > independent predictors representing nutrient concentrations a

Re: [R] Reg: Rgraphviz installation

2010-02-05 Thread Ista Zahn
Also, a general suggestion about errors like this: google them! Searching for "libcdt-4.dll was not found" brings up http://n4.nabble.com/Rgraphviz-install-td878526.html as the first hit, and appears to describe the solution. Best, Ista On Fri, Feb 5, 2010 at 2:44 PM, Martin Morgan wrote: > On 0

[R] Non-linearity test in R

2010-02-05 Thread Kim Jung Hwa
Hi All, I'm fitting a linear (multiple) regression model with 3 predictors + their interactions. Can anyone suggest some test in R which can help me know whether I need a non-linear (regression) model or some transformation? I'm mostly concerned about finding a way to know whether I should fit a

[R] glm models with more than one response

2010-02-05 Thread Steve_Friedman
Hi everyone, I am trying to construct a glm and am running into a couple of questions. The data set I am using consists of 6 categories for the response and 6 independent predictors representing nutrient concentrations at sample point locations. Ultimately I'd like to use the probabilities for

[R] RJDBC with MS Access

2010-02-05 Thread Paul-
Has anybody used RJDBC to read tables from an MS Access mdb file? I know how to do it with RODBC, and I have used RJDBC with SQL Server. I am, however interested now in this particular combination. Is it possible? -- View this message in context: http://n4.nabble.com/RJDBC-with-MS-Access-tp1470

Re: [R] mysterious extra spaces appearing in expression paste

2010-02-05 Thread Charlie Sharpsteen
On Fri, Feb 5, 2010 at 9:41 AM, George Locke wrote: > > Hey, > > So I found workaround for what I'm now pretty sure is a bug. > > the reason I'm sure it's a bug is that i found that I found that > linebreaks are handled in a wonky but regular way.  There's a rule, > and it's a bad rule, ie a bug.

[R] how to draw two series in bwplot

2010-02-05 Thread Weidong Gu
Hi, I have two numeric columns (dat1 and dat2) to bwplot data=expand.grid(dat1=rnorm(20,10,5),class=1:5,school=letters[1:3])) data$dat2=rnorm(300,13,6) What I want is to plot two boxes side by side for class, bwplot((dat1+dat2)~as.factor(class)|school,data=data) but the code does not work thi

[R] (Another) Bates fortune?

2010-02-05 Thread Peter Ehlers
I vote to 'fortunize' Doug Bates on Hierarchical data sets: which software to use? "The widespread use of spreadsheets or SPSS data sets or SAS data sets which encourage the "single table with a gargantuan number of columns, most of which are missing data in most cases" approach to organization

Re: [R] t.test

2010-02-05 Thread Gavin Simpson
On Fri, 2010-02-05 at 17:44 +0100, ErikRH wrote: > Hello! I have a fairly simple (I guess) question. Suppose I have a table > like this: > > Score Group > 1 Ctrl > 2 Ctrl > ....... > 10Treat > 11Treat > .. ... ... > 25Treat > > What is(are) t

Re: [R] open script file from command line

2010-02-05 Thread Charlie Sharpsteen
> > >> On Linux and Unix machines, such as OS X, a the following hashbang line > could be added to the top of the script: > > #!/usr/bin/env Rscript > > Then the script can be run from a terminal using: > > cd path/to/script/files > I forgot to mention that in order to use the hashbang method

Re: [R] open script file from command line

2010-02-05 Thread Charlie Sharpsteen
On Thu, Feb 4, 2010 at 4:53 PM, Wade Wall wrote: > Hi all, > > Is there a function to open a script file from the command line? I have > several students who are Mac users and when they open up a script file it > does not send commands to the console, and unfortunately I don't know how > to > so

Re: [R] mysterious extra spaces appearing in expression paste

2010-02-05 Thread George Locke
Hey, So I found workaround for what I'm now pretty sure is a bug. the reason I'm sure it's a bug is that i found that I found that linebreaks are handled in a wonky but regular way. There's a rule, and it's a bad rule, ie a bug. The wonky rule is as follows: text(x,y,expression(paste("thing1 \n

Re: [R] Importing data coming from Splus into R.

2010-02-05 Thread William Dunlap
For a data.frame with only numeric and factor columns using dump() on the S+ end and source() on the R end ought to work. If you have timeDate columns you will need to convert them to character data before exporting and convert them to your favorite R time/date class after importing them. If you

Re: [R] t.test

2010-02-05 Thread Erik Iverson
See the identical example in ?t.test ErikRH wrote: Hello! I have a fairly simple (I guess) question. Suppose I have a table like this: Score Group 1 Ctrl 2 Ctrl ....... 10Treat 11Treat .. ... ... 25Treat What is(are) the simplest command(

[R] Quasi-binomial GLM and model selection

2010-02-05 Thread Heather Kharouba
Hi, I'm using a GLM with a quasi binomial error distribution and I would like to do a model selection method similar to step(AIC) to carry out a restricted search for the "best" model. I would like to know which of my 5 predictor variables would be included in the "best" model if I start with a 'f

Re: [R] Importing data coming from Splus into R.

2010-02-05 Thread gerald . jean
Uwe Ligges a écrit sur 2010/02/05 11:04:44 : > 1. I am stuck with a copy of S-PLUS 4.x. At that time I used dump() in > S-PLUS and source() to get things into R afterwards ... > > 2. Why do you think that 32-bit vs. 64-bit issues matter? The file > format does not change (well, this is guessed s

[R] t.test

2010-02-05 Thread ErikRH
Hello! I have a fairly simple (I guess) question. Suppose I have a table like this: Score Group 1 Ctrl 2 Ctrl ....... 10Treat 11Treat .. ... ... 25Treat What is(are) the simplest command(s) in R to perform an ordinary t-test for significant d

[R] Spider Plot color problem

2010-02-05 Thread IoanLoft
Hi all, I have encountered a problem which appears to have defeated my (admittedly nascent) R skills. I want to draw a spider plot with many cases (just over 300). I am primarily interested in the difference between 4 categories of cases, and want to display them as different colors. the col.star

[R] histogram scott

2010-02-05 Thread maram salem
Dear all, I want to use the histogtam as a density estimator, with the binwidths calculated using scott's formula which is binwidth = 3.49*ST.dev.*n^(-1/3) for the following data  (30 data points) 12-9-3-6-1-23-21-7-18-16-15-4-19-22-20-2-3-18-8-10-1-7-5-4-11-12-3-9-19-7 so first,I' ve tried this

[R] sum a particular column by group

2010-02-05 Thread Fang (Betty) Yang
Dear all, I have a table like this: > eds R.ID Region Gender Agegr Time nvisits 11 A F 60--64 1:00 1 22 OF 55--591:20 1 33 OF 55--59 3:45 3 44 S

Re: [R] Extract p-value from lm for the whole model

2010-02-05 Thread S Ellison
> I would like to ask how to extract the p-value for the whole model > from > summary(lm). If you mean the p-value given at the end of the summary() printout, it isn;t held in the summary object. But information to get it is. Using the ?lm example: ctl <- c(4.17,5.58,5.18,6.11,4.50,4.61,

Re: [R] Appending results via for loop

2010-02-05 Thread jim holtman
The element that you store in the list can be anything. If you have a matrix, or dataframe or anything else, it handles it just fine. Also each of the elements can have different dimensions. A list in this instance is probably more efficient than a dataframe that you would be adding to. Also if

Re: [R] Extract p-value from lm for the whole model

2010-02-05 Thread David Winsemius
On Feb 5, 2010, at 12:00 PM, Turchin, Michael wrote: You should be able to access the p-value using the $coefficients variable, which is part of summary. Try: results <- summary(lm(speed~dist, cars)) results$coefficients and then: results$coefficients[x] The "coefficients" element of th

Re: [R] Appending results via for loop

2010-02-05 Thread Turchin, Michael
Thanks Jim. If I want to store multiple columns of data in my list, how would I do that? result[j][[i]] where j could be 1, 2 or 3? And out of curiosity, why would I use a list over a dataframe in the instance were I only collecting one column of data? Is list more efficient than a dataframe at

Re: [R] Appending results via for loop

2010-02-05 Thread jim holtman
use a list: result <- list() for (i in 1:limit){ computation # store results result[[i]] <- yourData } On Fri, Feb 5, 2010 at 11:14 AM, Turchin, Michael wrote: > Hey all, > > So I'm manually conducting a sliding window test, and I would like to tack on > results to some var

Re: [R] Extract p-value from lm for the whole model

2010-02-05 Thread Turchin, Michael
You should be able to access the p-value using the $coefficients variable, which is part of summary. Try: results <- summary(lm(speed~dist, cars)) results$coefficients and then: results$coefficients[x] where x is the location of particular p-value, or coefficient supplied, you want, from the

Re: [R] Extract p-value from lm for the whole model

2010-02-05 Thread David Winsemius
On Feb 5, 2010, at 11:54 AM, Trafim Vanishek wrote: Dear all, I would like to ask how to extract the p-value for the whole model from summary(lm). This didn't help a lot summary.lm I disagree. The help page tells you that "coefficients" has the p- values sogo ahead and get them: > summ

Re: [R] String Manipulation- Extract numerical and alphanumerical segment

2010-02-05 Thread jim holtman
The '[[' is just the index access to an object. type: ?'[[' to see the help page. Actually I should have used '[' in this case: > sapply(y, '[', 1) [1] "1234567" "1234567" "1234567" is equivalent to: > sapply(y, function(a) a[1]) [1] "1234567" "1234567" "1234567" > So set a value based o

Re: [R] Importing data coming from Splus into R.

2010-02-05 Thread Uwe Ligges
1. I am stuck with a copy of S-PLUS 4.x. At that time I used dump() in S-PLUS and source() to get things into R afterwards ... 2. Why do you think that 32-bit vs. 64-bit issues matter? The file format does not change (well, this is guessed since I do not have any 64-bit S-PLUS version availabl

[R] Extract p-value from lm for the whole model

2010-02-05 Thread Trafim Vanishek
Dear all, I would like to ask how to extract the p-value for the whole model from summary(lm). This didn't help a lot summary.lm summary(lm(speed~dist, cars)) Thanks a lot! [[alternative HTML version deleted]] __ R-help@r-project.org mailing

Re: [R] About graphics

2010-02-05 Thread Silvano
It was exactly what I needed, thank you. -- Silvano Cesar da Costa Departamento de Estatística Universidade Estadual de Londrina Fone: 3371-4346 -- - Original Message - From: "Duncan Murdoch" To: "Silvano" Cc:

Re: [R] Using coxph with Gompertz-distributed survival data.

2010-02-05 Thread Terry Therneau
Before being helpful let me raise a couple of questions: 1. "I know I'm looking at longevity data (which is believed to have a Gompertz distribution for mammals dying from 'old age')". I'm not as convinced. The Gompertz is a nice story, but is confounded by individual risk or 'frailty'. B

Re: [R] ifelse on a series of rows for multiple criteria

2010-02-05 Thread David Winsemius
On Feb 5, 2010, at 10:48 AM, Steve Murray wrote: Dear all, I am attempting to perform a calculation which counts the number of positive (or negative) values based on the sample mean (on a per-row basis). If the mean is>0 then only positive values should be counted, and if the mean is <0

Re: [R] ifelse on a series of rows for multiple criteria

2010-02-05 Thread Thomas Liebig
Hi Steve, as far as i understood, you're trying to do this: direction_func <- function(combdframe) { ifelse(mean(combdframe==0), -9, sum((sign(mean(combdframe))*combdframe)>0)) } direction<-apply(combdframe, 1, direction_func) direction cheers, thomas -- Thomas Liebig Fraunhofe

Re: [R] About graphics

2010-02-05 Thread Duncan Murdoch
On 05/02/2010 10:48 AM, Silvano wrote: Hi, I'm building a graph (barplot) in which the X axis label disappears. I tried to use the option mgp of par() and I could not get the desired result. Note that want the axis labels horizontally. caes = c(37,20,19,16,75,103) names(caes) = c("Pinscher"

Re: [R] Hierarchical data sets: which software to use?

2010-02-05 Thread Douglas Bates
On Sun, Jan 31, 2010 at 10:24 PM, Anton du Toit wrote: > Dear R-helpers, > > I’m writing for advice on whether I should use R or a different package or > language. I’ve looked through the R-help archives, some manuals, and some > other sites as well, and I haven’t done too well finding relevant in

Re: [R] String Manipulation- Extract numerical and alphanumerical segment

2010-02-05 Thread Su C.
Yes, that was perfect! Thank you so much! Just to clarify, since I'm kind of new to string manipulation-- is that '[[' in the sapply function what is designating splits/elements within the string? So that's the part that says "I want this particular element" and the "1" or "2" or "number" is what

Re: [R] Importing data coming from Splus into R.

2010-02-05 Thread RICHARD M. HEIBERGER
Have you tried dput/dget or dump/source? On the S-Plus side, you need to tell it to use the older format. Rich __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/p

[R] maximum elements in an ff object?

2010-02-05 Thread Matthew Keller
Hello all, I hate to add to the daily queries regarding R's handling of large datsets ;), but... I read in an online powerpoint about the ff package something about the "length of an ff object" needing to be smaller than .Machine$integer.max. Does anyone know if this means that the # of elements

Re: [R] random slope models with lme --> failured to converge

2010-02-05 Thread Luc Villandré
Hi, I have been working with LMMs for a while now and convergence has proved to be a very common problem. Right now, your model has two random effect terms, that is, one for the intercept and one for the coefficient of the "treat" variable. This implies that the default variance-covariance mat

[R] Appending results via for loop

2010-02-05 Thread Turchin, Michael
Hey all, So I'm manually conducting a sliding window test, and I would like to tack on results to some variable as the results are outputted. I'm using a for loop, and I am currently using an array as my 'output collector' variable, though I know I should be using a dataframe of some sort. My q

Re: [R] lattice barchart using a time scale in x axis

2010-02-05 Thread RICHARD M. HEIBERGER
Fran, The trick is to use box.width, not box.ratio. xyplot(Perc ~ as.POSIXct(hora,format="%d-%m-%Y %H:%M"), data=digrate, groups=Drate, ## key=leg, xlab="time of the day", horizontal=FALSE, scales=list(alternating=FALSE, tck=c(1,0), x=list(at=seq(r

[R] Importing data coming from Splus into R.

2010-02-05 Thread gerald . jean
Hello there, I spent all day yesterday trying to get a small data set from Splus into R, no luck! Both, Splus and R, are run on a 64-bit RedHat Linux machine, the versions of the softwares are 64-bit and are as what follows: Splus: TIBCO Software Inc. Confidential Information Copyright (c) 1988

[R] ifelse on a series of rows for multiple criteria

2010-02-05 Thread Steve Murray
Dear all, I am attempting to perform a calculation which counts the number of positive (or negative) values based on the sample mean (on a per-row basis). If the mean is>0 then only positive values should be counted, and if the mean is <0 then only negative values should be counted. In cases w

  1   2   >