Re: [R] Significant performance difference between split of adata.frame and split of vectors

2009-12-18 Thread Matthew Dowle
() itself. Please note there is no split.data.table method and the default method does not appear efficient. methods(split) shows there is no split method for data.table but data.frame has its own special split method. The following method has been added as a feature request in the data.table proje

Re: [R] by function ??

2009-12-21 Thread Matthew Dowle
or if Dataset is a data.table : > Dataset = data.table(Dataset) > Dataset[,abs(ratio-median(ratio)),by="LEAID"] LEAIDV1 [1,] 6307 0.0911905 [2,] 6307 0.0488095 [3,] 6307 0.0488095 [4,] 6307 0.1088095 [5,] 8300 0.2021538 [6,] 8300 0.000 [7,] 8300 0.060 rather than : > Da

Re: [R] by function ??

2009-12-22 Thread Matthew Dowle
Maybe this (with enough data for a CI) ? : > Dataset = data.table(Dataset) > Dataset[,as.list(wilcox.test(ratio,conf.int=TRUE)$conf.int),by="LEAID"] LEAID V1 V2 [1,] 6307 0.720 0.92 [2,] 8300 0.5678462 0.83 Warning messages: 1: In switch(alternative, two.sided

Re: [R] by function ??

2009-12-23 Thread Matthew Dowle
221-977494.p...@n4.nabble.com... > > Thanks, but that produces what I think is an estimated interval. > I really want to use the above formula. I just can't figure out how to > get it to run by the LEAID. > It does require 9 observations to produce an interval, but I was showin

Re: [R] function in aggregate applied to specific columns only

2010-01-04 Thread Matthew Dowle
> That makes eight solutions. Any others? :) A ninth was detailed in two other threads last month. The first link compares to ave(). http://tolstoy.newcastle.edu.au/R/e8/help/09/12/9014.html http://tolstoy.newcastle.edu.au/R/e8/help/09/12/8830.html "Dennis Murphy" wrote in message news:9a8a6c6

Re: [R] by function ??

2010-01-05 Thread Matthew Dowle
ly and some other ways to aggregate in R, but more compact, more natural, easier and more convenient (and therefore quicker) to write, debug and maintain. "Matthew Dowle" wrote in message news:hgnjev$3h...@ger.gmane.org... > or if Dataset is a data.table : > >> Dataset =

Re: [R] R matching lat/lon pairs from two datasets?

2010-01-05 Thread Matthew Dowle
Or if there is a requirement for speed or shorter more convenient syntax then there is a data.table join. Basically setkey(data1,V1,V2) and setkey(data2,V1,V2), then "data1[data2]" does the merge very quickly. You probably then want to do something with the merged data set, which you just add

Re: [R] mean for subset

2010-01-06 Thread Matthew Dowle
As can data.table (i.e. do 'having' in one statement) : > DT = data.table(DF) > DT[,list(n=length(NAME),mean(SCORE)),by="NAME"][n==3] NAME n V2 [1,] James 3 64.0 [2,] Tom 3 78.7 > but data.table isn't restricted to SQL functions (such as avg), any R functions can be used,

Re: [R] mean for subset

2010-01-07 Thread Matthew Dowle
t recently received a new laptop > computer, and now I have an occassional problem with Word's "pretty print > quotes," but if you know about that problem, it is easy to fix. > > Jerry Floren > Minnesota Department of Agriculture > > > > > > Matthew Do

[R] Error: object of type 'closure' is not subsettable

2010-01-13 Thread Matthew Walker
return(TRUE) } What am I doing wrong when I use rep()? Thanks in advance, Matthew Walker __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide

Re: [R] Advantages of using SQLite for data import in comparison to csv files

2010-01-15 Thread Matthew Dowle
Just to comment on this bit : > For one thing, you cannot index a csv file or a data.frame. If you have to > repeatedly select subsets of your large data set, creating an index on the > relevant column in the sqlite table is an absolute life saver. This is one reason the data.table package was

Re: [R] problem of data manipulation

2010-01-20 Thread Matthew Dowle
The user wrote in their first post : > I have a lot of observations in my dataset Heres one way to do it with a data.table : a=data.table(a) ans = a[ , list(dt=dt[dt-min(dt)<7]) , by="var1,var2,var3"] class(ans$dt) = "Date" Timings are below comparing the 3 methods. In

Re: [R] problem of data manipulation

2010-01-20 Thread Matthew Dowle
Sounds like a good idea. Would it be possible to give an example of how to combine plyr with data.table, and why that is better than a data.table only solution ? "hadley wickham" wrote in message news:f8e6ff051001200624r2175e38xf558dc8fa3fb6...@mail.gmail.com... > Note that in the documentaton

Re: [R] problem of data manipulation

2010-01-20 Thread Matthew Dowle
; On Wed, Jan 20, 2010 at 8:43 AM, Matthew Dowle > wrote: >> Sounds like a good idea. Would it be possible to give an example of how >> to >> combine plyr with data.table, and why that is better than a data.table >> only >> solution ? > > Well, ideally, you

Re: [R] loop on list levels and names

2010-01-21 Thread Matthew Dowle
One way is : dataset = data.table(ssfamed) dataset[, < whatever "some functions" are on Asfc, Smc, epLsar, etc >, by="SPECSHOR,BONE"] Your SPECSHOR and BONE names will be in your result alongside the results of the Or try package plyr which does this sort of thing too. And sqldf may be bett

Re: [R] Mutliple sets of data in one dataset....Need a loop?

2010-01-21 Thread Matthew Dowle
> but I have thousands of results so it would be really hand to find away of > doing this quickly > its a little difficult to follow those examples Given your data in data.frame DF, maybe add the following to your list to investigate : > dat = data.table(DF) > dat[, cor(Score1,Score2), by="Expe

Re: [R] Once again: Error: cannot allocate vector of size

2010-01-22 Thread Matthew Dowle
Please re-read the posting guide e.g. you didn't provide an example data set or a way to generate one, or any R version information. "Werner W." wrote in message news:646146.32238...@web23002.mail.ird.yahoo.com... > Hi, > > I have browsed the help list and looked at the FAQ but I don't find >

Re: [R] Merging and extracting data from list

2010-01-22 Thread Matthew Dowle
?merge plyr data.table sqldf crantastic "Dr. Viviana Menzel" wrote in message news:4b58a0e9.3050...@gmx.de... Hello R-help group, I have a question about merging lists. I have two lists: Genes list (hSgenes) namechrstrandstartendtransStarttransEnd symboldescription

Re: [R] loop on list levels and names

2010-01-22 Thread Matthew Dowle
#x27;m just using the ones I was told for one specific function), but don't worry I won't forget. As you said "It only works if users contribute to it". That makes the power of R! Ivan Le 1/21/2010 19:01, Matthew Dowle a écrit : > One way is : > > dataset = data.ta

Re: [R] Once again: Error: cannot allocate vector of size

2010-01-22 Thread Matthew Dowle
Fantastic. You're much more likely to get a response now. Best of luck. "werner w" wrote in message news:1264175935970-1100164.p...@n4.nabble.com... > > Thanks Matthew, you are absolutely right. > > I am working on Windows XP SP2 32bit with R versions 2.9.1. &

Re: [R] loop on list levels and names

2010-01-22 Thread Matthew Dowle
-forge like this : install.packages("data.table",repos="http://r-forge.r-project.org";) Best of luck. "Ivan Calandra" wrote in message news:4b59d93c.5080...@uni-hamburg.de... Thanks for your advice, I will work on it then! Just one last question. In which package can

[R] PNG resolution

2010-01-26 Thread Matthew Walker
nits="in", res=72) plot(0,0, main="Title Text") dev.off() png("300dpi.png", width=6+2/3, height=6+2/3, units="in", res=300) plot(0,0, main="Title Text") dev.off() png("600dpi.png", width=6+2/3, heig

Re: [R] PNG resolution

2010-01-26 Thread Matthew Walker
Dieter Menne wrote: Matthew Walker wrote: I expect that if I change only the resolution of an image, although the image would have more pixels, if viewed in the same physical size, the elements in the image would have the same physical size but with more detail. The sample you

Re: [R] RMySQL - Bulk loading data and creating FK links

2010-01-27 Thread Matthew Dowle
How many columns, and of what type are the columns ? As Olga asked too, it would be useful to know more about what you're really trying to do. 3.5m rows is not actually that many rows, even for 32bit R. Its depends on the columns and what you want to do with those columns. At the risk of sugge

Re: [R] RMySQL - Bulk loading data and creating FK links

2010-01-27 Thread Matthew Dowle
uot; wrote in message news:971536df1001270629w4795da89vb7d77af6e4e8b...@mail.gmail.com... On Wed, Jan 27, 2010 at 8:56 AM, Matthew Dowle wrote: > How many columns, and of what type are the columns ? As Olga asked too, it > would be useful to know more about what you're really trying to do. > > 3.5m rows is not a

Re: [R] RMySQL - Bulk loading data and creating FK links

2010-01-28 Thread Matthew Dowle
uld not be important as long as you can do what you want. SQL is declarative so you just specify what you want rather than how to get it and invisibly to the user it automatically draws up a query plan and then uses that plan to get the result. On Wed, Jan 27, 2010 at 12:48 PM, Matthew Dowle wrote:

Re: [R] RMySQL - Bulk loading data and creating FK links

2010-01-28 Thread Matthew Dowle
is to hide the implementation and focus on the problem. That is why we use high level languages, object orientation, etc. On Thu, Jan 28, 2010 at 4:37 AM, Matthew Dowle wrote: > How it represents data internally is very important, depending on the real > goal : > http://en.wikipedia.org/w

Re: [R] RMySQL - Bulk loading data and creating FK links

2010-01-28 Thread Matthew Dowle
age provides links to their benchmarks. Thus if R is fast enough then its likely that the database approach is fast enough too since its even faster. On Thu, Jan 28, 2010 at 8:52 AM, Matthew Dowle wrote: > Are you claiming that SQL is that utopia? SQL is a row store. It ca

Re: [R] PNG resolution

2010-01-29 Thread Matthew Walker
__ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.

Re: [R] PNG resolution

2010-01-29 Thread Matthew Walker
version, please email me. Cheers, Matthew Walker Joshua Wiley wrote: Hello Matthew, I do not have a direct answer to your problem, but you could try saving it as an EPS and then rasterizing it using a graphics program (e.g., GIMP) to whatever resolution you wanted. Best luck finding a real

Re: [R] merging columns

2010-02-03 Thread Matthew Dowle
Yes. data.df[,wcol,drop=FALSE] For an explanation of drop see ?"[.data.frame" "Chuck White" wrote in message news:20100202212800.o8xbu.681696.r...@mp11... > Additional clarification: the problem only comes when you have one column > selected from the original dataframe. You need to make the fo

Re: [R] Reading large files

2010-02-05 Thread Matthew Dowle
I agree with Jim. The term "do analysis" is almost meaningless, the posting guide makes reference to statements such as that. At least he tried to define large, but inconsistenly (first of all 850MB, then changed to 10-20-15GB). > Satish wrote: "at one time I will need to load say 15GB into R"

[R] maximum elements in an ff object?

2010-02-05 Thread Matthew Keller
#x27;t let me store it because 19e9 is >> .Machine$integer.max = 2^31. Anyone else have suggestions on how to deal with such massive datasets like the ones I'm using? I'm exploring ncdf as we speak. Best, Matt -- Matthew C Keller Asst. Professor of Psychology Un

Re: [R] Reading large files

2010-02-05 Thread Matthew Dowle
I can't help you further than whats already been posted to you. Maybe someone else can. Best of luck. "Satish Vadlamani" wrote in message news:1265397089104-1470667.p...@n4.nabble.com... > > Matthew: > If it is going to help, here is the explanation. I have an end stat

[R] question about bigmemory: releasing RAM from a big.matrix that isn't used anymore

2010-02-05 Thread Matthew Keller
0, but it remains at 18Gb. How do I return RAM to the system? Thanks, Matt -- Matthew C Keller Asst. Professor of Psychology University of Colorado at Boulder www.matthewckeller.com __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/

Re: [R] question about bigmemory: releasing RAM from a big.matrix that isn't used anymore

2010-02-06 Thread Matthew Keller
datasets to still use R. Matt On Fri, Feb 5, 2010 at 9:27 PM, Steve Lianoglou wrote: > Hi, > > On Fri, Feb 5, 2010 at 9:24 PM, Matthew Keller wrote: >> Hi all, >> >> I'm on a Linux server with 48Gb RAM. I did the following: >> >> x <- >> b

Re: [R] question about bigmemory: releasing RAM from a big.matrix that isn't used anymore

2010-02-06 Thread Matthew Keller
the OS to free up memory, even after a gc(). >>>> But it's available for re-use; if you want to be really sure, have a >>>> look >>>> in /dev/shm to make sure the shared memory segments have been >>>> deleted. > > Thanks

Re: [R] Ideal (possible) configuration for an exalted R system

2009-03-09 Thread Matthew Keller
-help >> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html >> and provide commented, minimal, self-contained, reproducible code. >> > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailm

Re: [R] popular R packages

2009-03-09 Thread Matthew Keller
elate their experience with it, good or bad." Find the whole thread here: https://stat.ethz.ch/pipermail/r-help/2007-December/147323.html. Matt -- Matthew C Keller Asst. Professor of Psychology University of Colorado at Boulder www.matthewckeller.com

[R] non-positive definite matrix remedies?

2009-03-11 Thread Matthew Keller
esses of different approaches are? Thanks in advance, Matt -- Matthew C Keller Asst. Professor of Psychology University of Colorado at Boulder www.matthewckeller.com __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEA

Re: [R] non-positive definite matrix remedies?

2009-03-11 Thread Matthew Keller
er.ca/jfox > >> -Original Message- >> From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] > On >> Behalf Of Matthew Keller >> Sent: March-11-09 6:20 PM >> To: r-help@r-project.org >> Subject: [R] non-positive definite ma

[R] fwrite() not found in data.table package

2017-10-02 Thread Matthew Keller
a a namespace (and not attached): [1] tools_3.2.0 chron_2.3-47 tcltk_3.2.0 -- Matthew C Keller Asst. Professor of Psychology University of Colorado at Boulder www.matthewckeller.com [[alternative HTML version deleted]] __ R-help@r-project.org mailing l

Re: [R] fwrite() not found in data.table package

2017-10-02 Thread Matthew Keller
ble example (check it with package reprex) and try again here, or > ask one of the maintainers of that package. > -- > Sent from my phone. Please excuse my brevity. > > On October 2, 2017 8:56:46 AM PDT, Matthew Keller > wrote: > >Hi all, > > > >I used to use fwri

Re: [R] Joining data frames

2021-06-29 Thread Matthew McCormack
nging df.) df_final<- full_join(df, df1, by = c(“Sample”, "Plot")) Matthew On 6/29/21 7:15 PM, Jim Lemon wrote: External Email - Use Caution Hi Esthi, Have you tried something like: df2<-merge(df,df1,by.x="Sample",by.y="Plot",all.y=TRUE) Thi

Re: [R] Differential Gene Expression in R

2021-08-22 Thread Matthew McCormack
tle and you will see that edgeR is available. You will still have to learn a little about edgeR analysis, so reading the vignette will be very helpful.    Also, for the comparisons you want to do, statistical help is recommended. Matthew On 8/22/21 2:13 PM, Anas Jamshed wrote: Exte

[R] Why does residuals.coxph use naive.var?

2017-03-02 Thread Matthew Burgess
Hi all, I noticed that the scaled Schoenfeld residuals produced by residuals.coxph(fit, type="scaledsch") were different from those returned by cox.zph for a model where robust standard errors have been estimated. Looking at the source code for both functions suggests this is because residuals.cox

[R] Defining Variables from a Matrix for 10-Fold Cross Validation

2018-10-09 Thread matthew campbell
me across that specific message before, and am not exactly sure how to interpret its meaning. What exactly is this error message trying to tell me? Any suggestions or insights are appreciated! Thank you all, Matthew Campbell > library (ElemStatLearn) > library(kknn) > data(zip.train)

[R] Define pch and color based on two different columns

2019-04-09 Thread Matthew Snyder
ilter rule match" Thanks, Matt *Matthew R. Snyder* *~* PhD Candidate University Fellow University of Toledo Computational biologist, ecologist, and bioinformatician Sponsored Guest Researcher at NOAA PMEL, Seattle, WA. matthew.snyd...@rockets.utoledo.edu msnyder...@gmail.co

Re: [R] Define pch and color based on two different columns

2019-04-09 Thread Matthew Snyder
for single plots in ggplot. Maybe I should contact the authors of lattice and see if this is something they can help me with or if they would like to add this as a feature in the future... Matt *Matthew R. Snyder* *~* PhD Candidate University Fellow University of Toledo Computational

Re: [R] Define pch and color based on two different columns

2019-04-09 Thread Matthew Snyder
unique in the whole plot. But when you add cyl as a factor. Those two points are only unique within their respective panels, and not across the whole plot. Matt *Matthew R. Snyder* *~* PhD Candidate University Fellow University of Toledo Computational biologist, ecologist, and

Re: [R] Define pch and color based on two different columns

2019-04-09 Thread Matthew Snyder
I tried this too: xyplot(mpg ~ wt | cyl, data=mtcars, # groups = carb, subscripts = TRUE, col = as.factor(mtcars$gear), pch = as.factor(mtcars$carb) ) Same problem... *Matthew R. Snyder* *~* PhD Candidate University Fellow University of Toledo

Re: [R] Define pch and color based on two different columns

2019-04-09 Thread Matthew Snyder
You are not late to the party. And you solved it! Thank you very much. You just made my PhD a little closer to reality! Matt *Matthew R. Snyder* *~* PhD Candidate University Fellow University of Toledo Computational biologist, ecologist, and bioinformatician Sponsored Guest

Re: [R] RNA Seq Analysis in R

2020-08-01 Thread Matthew McCormack
Excel and then inspect the logFC and p-values for the top 1250 genes. Matthew On 8/1/20 1:13 PM, Jeff Newmiller wrote: > External Email - Use Caution > > https://www.bioconductor.org/help/ > > On August 1, 2020 4:01:08 AM PDT, Anas Jamshed > wrote: >> I choo

Re: [R] analyzing results from Tuesday's US elections

2020-11-08 Thread Matthew McCormack
om/statistical-anomalies-in-biden-votes-analyses-indicate_3570518.html?utm_source=newsnoe&utm_medium=email&utm_campaign=breaking-2020-11-08-5 Matthew On 11/8/20 11:25 PM, Bert Gunter wrote: > External Email - Use Caution > > NYT had interactive maps that reported votes by c

Re: [R] analyzing results from Tuesday's US elections

2020-11-09 Thread Matthew McCormack
Benford Analysis for Data Validation and Forensic Analytics Provides tools that make it easier to validate data using Benford's Law. https://www.rdocumentation.org/packages/benford.analysis/versions/0.1.5 Matthew On 11/9/20 9:23 AM, Alexandra Thorn wrote: > External Ema

Re: [R] analyzing results from Tuesday's US elections

2020-11-15 Thread Matthew McCormack
both agree with Benfords Law. However, he uses the last digit and not the first. A word of caution before you click on that link: he uses Excel ! Matthew On 11/13/20 9:59 PM, Rolf Turner wrote: External Email - Use Caution On Thu, 12 Nov 2020 01:23:06 +0100 Martin Møller Skarbiniks P

Re: [R] analyzing results from Tuesday's US elections

2020-11-16 Thread Matthew McCormack
e explored. A oncee-in-a-while dive into a practical application of statistics that has current interest can be fun and enlightening for those interested. Matthew On 11/16/20 9:01 PM, Abby Spurdle wrote: External Email - Use Caution I've come to the conclusion this whole

Re: [R] analyzing results from Tuesday's US elections

2020-11-16 Thread Matthew McCormack
Bye the way, I thought I had checked my e-mail before sending it, but my last e-mail had an unfortunate typo with an 'I' that originally belonged to the beginning of a deleted sentence. Matthew On 11/17/20 1:54 AM, Matthew McCormack wrote: External Email - Use Caution No

[R] dataframe, transform, strsplit

2010-10-25 Thread Matthew Pettis
Hi, I have a dataframe that has a column of vectors that I need to extract off the character string before the first '.' character and put it into a separate column. I thought I could use 'strsplit' for it within 'transform', but I can't seem to get the right invocation. Here is a sample datafra

Re: [R] dataframe, transform, strsplit

2010-10-25 Thread Matthew Pettis
way I think it should... Thank you both again, Matt On Mon, Oct 25, 2010 at 12:09 PM, Gabor Grothendieck < ggrothendi...@gmail.com> wrote: > On Mon, Oct 25, 2010 at 12:53 PM, Matthew Pettis > wrote: > > Hi, > > > > I have a dataframe that has a column of vectors t

[R] ggplot2: facet_grid with only one level does not display the graph with the facet_grid level in title

2010-10-28 Thread Matthew Pettis
Hi All, Here is the code that I'll be referring to: p <- ggplot(wastran.data, aes(PER_KEY, EVENTS)) (p <- p + facet_grid( pool.short ~ .) + stat_summary(aes(y=EVENTS), fun.y = sum, geom="line") + opts(axis.text.x = theme_text(angle = 90, hjust=1), title="Events (15min.) vs. Time: Face

[R] Printing data.frame data: alternatives to print?

2010-10-29 Thread Matthew Pettis
Hi, I have a data frame with two factors (well, more, but 2 for simple consideration), and I want to display the different combinations of the them that actually occur in the data. In reality, there are too many of them to do to do a 'table' call and have one col vertical and one col horizontal (

Re: [R] Printing data.frame data: alternatives to print?

2010-10-29 Thread Matthew Pettis
inor B > 7 Maj III Minor B > 8 Maj III Minor C > 9 Maj III Minor C >> df[!duplicated(df),] >       f1      f2 > 1   Maj I Minor A > 4  Maj II Minor A > 5  Maj II Minor B > 7 Maj III Minor B > 8 Maj III Minor C >> > > > On Fri, Oct 29, 2010 at 9:53 AM,

[R] remove data frame from list of data frames

2010-11-07 Thread Matthew Finkbeiner
I have a list of data frames like this: a<- data.frame(x=runif(10), y = runif(10), Acc = 1) b<- data.frame(x=runif(10), y = runif(10), Acc = 0) ls<- list(a,b) and I want to remove the data frames from ls that have Acc values other than 1. How do I do that? Thanks for any help!

Re: [R] remove data frame from list of data frames

2010-11-07 Thread Matthew Finkbeiner
Thank you Jim (and others who responded off list). This does the trick for me perfectly: ls[sapply(ls, function(x) all(x$Acc == 1))] Thanks again! Matthew On Sun, Nov 7, 2010 at 11:34 PM, jim holtman wrote: > Is this what you are asking; this accepts any dataframe that has at > lea

[R] Catmap package and forest plots

2010-11-09 Thread Matthew Law
1<-catmap('rs2305764.txt', 0.95, TRUE, TRUE) catmap.forest(catmapobject1, TRUE, TRUE) I think I am failing to assign the results of the meta anlaysis - catmap('rs2305764.txt', 0.95, TRUE, TRUE)--- to an object in R, and then have catmap.forest act on that object. thanks

[R] socket blocking with readBin

2011-06-30 Thread Matthew Maycock
eem to have that option. Is there any way to make readBin not block? ~Matthew Maycock [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting gui

Re: [R] ASCII values to Decimal

2011-07-08 Thread Matthew Maycock
You can do: as.numeric(charToRaw(paste(df[,4], collapse=""))) If you somehow want each row to be its own sequence of integers, you could do something like: lapply(df[,4], function(c) as.numeric(charToRaw(c))); ~Matthew Maycock -Original Message- From: r-help-boun...@r-p

Re: [R] manipulating "by" lists and "ave()" functions

2011-07-11 Thread Matthew Dowle
Users of package 'unknownR' already know simplify2array was added in R 2.13.0. They also know what else was added. Do you? http://unknownr.r-forge.r-project.org/ "Joshua Wiley" wrote in message news:canz9z_j+trwoim3scayuaruors+8hyc30pmt_thiex6qmto...@mail.gmail.com... > On Sat, Jul 9, 2011

Re: [R] EXTERNAL: Re: subset with aggregate key

2011-07-13 Thread Matthew Dowle
To close this thread on-list : packageVersion() was added to R in 2.12.0. data.table's dependency on 2.12.0 is updated, thanks. Matthew "Jesse Brown" wrote in message news:4e1b21a8.8090...@atl.lmco.com... > Matthew Dowle wrote: >> Hi, >> >> Try package &#

[R] gls yields much smaller std. errors with different base for contrasts

2011-07-20 Thread Matthew Wolak
91.74 1> coefficients(summary(cs2))[1]+coefficients(summary(cs2))[3] #3m:9f intercept in cs2 (Intercept) 91.74 1> coefficients(summary(cs1))[1]+coefficients(summary(cs1))[3] #6m:0f intercept in cs1 (Intercept) 114.195 1> coefficients(summary(cs2))[1] #6m:0f intercept in cs2 (

Re: [R] Sequential Naming of ggplot .pngs using plyr

2011-08-11 Thread Matthew Dowle
=site,y=val))+geom_boxplot()+opts(title=paste('plot number',ctr,sep=' '))) dev.off() ctr<-ctr+1 }, by=site] Btw, there was a new feature in 1.6.3, where you can subassign into data.table 500 times faster than <-. See the NEWS from 1.6.3 for an e

[R] how to merge distance data based on location

2011-08-19 Thread Matthew Keller
t how to accomplish this? Thank you, Matthew Keller -- Matthew C Keller Asst. Professor of Psychology University of Colorado at Boulder www.matthewckeller.com __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do

Re: [R] Pass By Value Questions

2010-08-20 Thread Matthew Dowle
stand correctly. Matthew http://datatable.r-forge.r-project.org/ -- View this message in context: http://r.789695.n4.nabble.com/Pass-By-Value-Questions-tp2331565p2332330.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.o

Re: [R] Sorting and subsetting

2010-09-21 Thread Matthew Dowle
5,] 1 1 0.6674880 [6,] 2 2 1.2236383 [7,] 2 2 0.9606766 [8,] 2 2 0.8654497 [9,] 2 2 0.5404112 [10,] 2 2 0.3373457 > As you can see it currently repeats the group column which is a shame (on the to do list to fix). Matthew http://d

Re: [R] Sorting and subsetting

2010-09-21 Thread Matthew Dowle
s.Rnw). Perhaps this example could be added there; PDF is on the homepage. One test is 340 times faster and the other is 13 times faster. More examples would be good. Matthew http://datatable.r-forge.r-project.org/ "Joshua Wiley" wrote in message news:aanlktimyuvl9suj65ktzqvpnyn

Re: [R] Sorting and subsetting

2010-09-21 Thread Matthew Dowle
See data.table:::duplist which does that (or at least very similar) in C, for multiple columns too. Matthew http://datatable.r-forge.r-project.org/ "peter dalgaard" wrote in message news:660991c3-b52b-4d58-b819-eadc95ecc...@gmail.com... > > On Sep 21, 2010, at 16:27 , Jo

[R] POSIXct: Extract the hour for a list of elements

2010-09-24 Thread Matthew Pettis
Hi, I have a list/data.frame 'pk' of POSIXct dates, and I'd like to extract the hour for each row. I know that if I have an individual POSIXct object, I can extract the hour by converting to a new object with: new.lt <- as.POSIXlt() new.lt$hour But I can't figure out how to apply this for a lis

Re: [R] POSIXct: Extract the hour for a list of elements

2010-09-24 Thread Matthew Pettis
hat I want. Thank you again! Matt On Fri, Sep 24, 2010 at 5:50 PM, Phil Spector wrote: > Matthew - > It's a bit simpler than you think: > > as.POSIXlt(pk)$hour > > should return what you want. (If not, please provide a reproducible > example.) >

[R] sample from very large distribution

2010-09-29 Thread Matthew Finkbeiner
by selecting every 100th item, it introduces a systematicity that may be problematic. I've tried this: x<- seq(1,2^32, by=sample(1:200, 1)) but that yields some unpredictable behavior so, any suggestions? Thank you kindly, Matthew [[alter

[R] problem with spline

2010-10-01 Thread Matthew Finkbeiner
461.692, 464.045, 466.409, 468.787, 471.254, 473.328) s<- spline(x,y, n=length(x)) plot(x,y) lines(s, col="blue") Is this a bug? Or am I doing something very wrong? Thanks for any help. Matthew __ R-help@r-project.org mailing list https://stat.

Re: [R] problem with spline

2010-10-02 Thread Matthew Finkbeiner
Thank you Michael, yes, perhaps that is the problem. I've since figured out a solution using approx() that is working just fine for me. Thanks again. Matthew On Sat, Oct 2, 2010 at 4:45 PM, Michael Bedward wrote: > I'm not overly familiar with the spilne function in R but I wa

[R] Offscreen rendering in RGL?

2010-06-27 Thread Matthew Neilson
Hi there, I've written a script for reading 3D simulation data into R, rendering it using RGL, and then saving the resulting plot using the snapshot3d() function. The results are fantastic! However, whenever RGL plots anything it automatically brings the viewing window into focus. Since I

Re: [R] Offscreen rendering in RGL?

2010-06-28 Thread Matthew Neilson
On 27 Jun 2010, at 22:19, Duncan Murdoch wrote: On 27/06/2010 12:58 PM, Matthew Neilson wrote: Hi there, I've written a script for reading 3D simulation data into R, rendering it using RGL, and then saving the resulting plot using the snapshot3d() function. The results are fant

Re: [R] Performance enhancement for ave

2010-06-29 Thread Matthew Dowle
> dt = data.table(d,key="grp1,grp2") > system.time(ans1 <- dt[ , list(mean(x),mean(y)) , by=list(grp1,grp2)]) user system elapsed 3.890.003.91# your 7.064 is 12.23 for me though, so this 3.9 should be faster for you However, Rprof() shows that 3.9 is mostly dispatch of mea

[R] Stacked Restricted Boltzmann Machine

2010-07-01 Thread Matthew OKane
Hi, Are there any implementations of stacked RBMs either complete or planned in R? Thanks, Matthew [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the

Re: [R] Query about using timestamps returned by SQL as 'factor' forsplit

2010-07-09 Thread Matthew Dowle
?sort.list. That is ultra fast for integers. Again radix is something Tom added to data.table. The radix algorithm (see wikipedia) is specifically designed to sort integers only. We would use Date, but that is stored as numeric. IDate is the same as Date but stor

[R] Not nice behaviour of nlminb (windows 32 bit, version 2.11.1)

2010-07-09 Thread Matthew Killeya
nlminb( obj = function(x) x, start=1, lower=-Inf, upper=Inf ) $par [1] 0 $objective [1] 0 $convergence [1] 0 $message [1] "absolute function convergence (6)" $iterations [1] 1 $evaluations function gradient 22 [[alternative HTML version deleted]] _

Re: [R] Not nice behaviour of nlminb (windows 32 bit, version 2.11.1)

2010-07-09 Thread Matthew Killeya
33 >> >> >> This is clearly a bug. >> >> >> Ravi. >> >> -Original Message- >> From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] >> On >> Behalf Of Ravi Varadhan >> Sent: Friday, July 09

Re: [R] Not nice behaviour of nlminb (windows 32 bit, version 2.11.1)

2010-07-09 Thread Matthew Killeya
And further to those two: nlminb( obj = function(x) 2*(x+3), start=-2, lower=-Inf, upper=Inf ) On 9 July 2010 23:09, Matthew Killeya wrote: > Yes clearly a bug... there are numerous variations ... problem seems to be > for a linear function whenever the first function valuation is 1. &

Re: [R] Not nice behaviour of nlminb (windows 32 bit, version 2.11.1)

2010-07-11 Thread Matthew Killeya
>> >> Ph. (410) 502-2619 >> email: rvarad...@jhmi.edu >> >> >> - Original Message - >> From: Duncan Murdoch >> Date: Saturday, July 10, 2010 7:32 am >> Subject: Re: [R] Not nice behaviour of nlminb (windows 32 bit, version >> 2.11.1)

Re: [R] long to wide on larger data set

2010-07-12 Thread Matthew Dowle
o that code as I didn't test it, but I think it should be possible in 32bit using R 2.12. Is it an option to just keep it in long format and use a data.table ? testDate[, somecomplexrfunction(onecolumn, anothercolumn), by=list(V2) ] Why you you need to reshape from long to wide ? HT

Re: [R] long to wide on larger data set

2010-07-12 Thread Matthew Dowle
he rest still applies, and you might have a much easier time than I thought since you are on 64bit. I was working on the basis of squeezing into 32bit. Matthew "Matthew Dowle" wrote in message news:i1faj2$lv...@dough.gmane.org... > > Hi Juliet, > > Thanks for the info. &g

[R] MplusAutomation

2010-07-13 Thread Gushta, Matthew
vely locks me out of my computer for the duration of the simulation. my question is this: can anyone suggest how i might pass the "quiet" command to the DOS program? is there a way to generally specify this from R? or any specific recommendations/experience with this package?

[R] Cforest mincriterion

2010-07-27 Thread Matthew OKane
riterion = qnorm(0.95) which would obviously feed the function a different value. Thanks in advance, Matthew [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the

Re: [R] Cforest mincriterion

2010-07-27 Thread Matthew OKane
le? I will stick to other "testype"s as a work around for this. Thanks again, Matt On 27 July 2010 16:34, Achim Zeileis wrote: > On Tue, 27 Jul 2010, Matthew OKane wrote: > > Hi, >> >> Could anyone help me understand how the mincriterion threshold works in &g

Re: [R] How to deal with more than 6GB dataset using R?

2010-07-27 Thread Matthew Keller
https://stat.ethz.ch/mailman/listinfo/r-help >> PLEASE do read the posting guide http://www.R-project.org/posting- >> guide.html >> and provide commented, minimal, self-contained, reproducible code. > > ______ > R-help@r-proj

Re: [R] Introductory statistics and introduction to R

2010-07-28 Thread Matthew Keller
ce (voice mail) > -3302 Department > -3170 FAX > VoIP: gjke...@ekiga.net > E-mail: gke...@ysu.edu > http://people.ysu.edu/~gkerns/ > > __ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLE

Re: [R] Finding points where two timeseries cross over

2010-08-04 Thread Matthew Dowle
Is this what you mean? x=c(1,2,2,3,4,5,6,3,2,1) y=c(2,3,4,2,1,2,3,4,5,6) matplot(cbind(x,y),type="l") which(diff(sign(x-y))!=0)+1 [1] 4 8 -- View this message in context: http://r.789695.n4.nabble.com/Finding-points-where-two-timeseries-cross-over-tp2313257p2313510.html Sent from the R help ma

Re: [R] limits of a data frame size for reading into R

2010-08-05 Thread Matthew Keller
rs, so they're limited to >> >> 2^31-1 >> >> elements in total. >> >> This is only likely to be a limitation on a 64 bit machine; in 32 bits >> >> you'll run out of memory first. >> >> >> >> Duncan Murdoch >>

<    1   2   3   4   5   >