Re: [R] qqnorm & huge datasets

2011-12-22 Thread cberry
Bert Gunter writes: > Chuck: > > A bad idea, I think: Rounding to unique values loses data density, > while sampling preserves it (to display resolution -- also a form of > rounding). Bert, The subject is "qqnorm & huge datasets". Downsampling for qqplots produces unreliable tails. Try downsam

Re: [R] qqnorm & huge datasets

2011-12-22 Thread Bert Gunter
Chuck: A bad idea, I think: Rounding to unique values loses data density, while sampling preserves it (to display resolution -- also a form of rounding). -- Bert On Thu, Dec 22, 2011 at 11:10 AM, wrote: > Sam Steingold writes: > >> Hi, >> When qqnorm on a vector of length 10M+ I get a huge pd

Re: [R] qqnorm & huge datasets

2011-12-22 Thread cberry
Sam Steingold writes: > Hi, > When qqnorm on a vector of length 10M+ I get a huge pdf file which > cannot be loaded by acroread or evince. > Any suggestions? (apart from sampling the data). > Thanks. Following the other suggestions, I did not notice mention of another trick for slimming down gra

Re: [R] qqnorm & huge datasets

2011-12-22 Thread peter dalgaard
On Dec 22, 2011, at 18:17 , Sam Steingold wrote: >> * peter dalgaard [2011-12-21 23:59:18 +0100]: >> On Dec 21, 2011, at 23:10 , Sam Steingold wrote: >>> When qqnorm on a vector of length 10M+ I get a huge pdf file which >>> cannot be loaded by acroread or evince. >>> Any suggestions? (apart fro

Re: [R] qqnorm & huge datasets

2011-12-22 Thread Sam Steingold
> * peter dalgaard [2011-12-21 23:59:18 +0100]: > On Dec 21, 2011, at 23:10 , Sam Steingold wrote: >> When qqnorm on a vector of length 10M+ I get a huge pdf file which >> cannot be loaded by acroread or evince. >> Any suggestions? (apart from sampling the data). > > Sample intelligently? Things l

Re: [R] qqnorm & huge datasets

2011-12-21 Thread R. Michael Weylandt
I'd second Peter's suggestion, but if you need every data point for whatever reason, you might also try passing the pch = "." option to qqnorm. On a test with 1e7 data points, it more than halved the resulting file size and with that many points, there's no loss in clarity with the different marker

Re: [R] qqnorm & huge datasets

2011-12-21 Thread peter dalgaard
On Dec 21, 2011, at 23:10 , Sam Steingold wrote: > Hi, > When qqnorm on a vector of length 10M+ I get a huge pdf file which > cannot be loaded by acroread or evince. > Any suggestions? (apart from sampling the data). Sample intelligently? Things like > qq <- seq(-4,4,,10001) > qqplot(qq,quantil

Re: [R] qqnorm & huge datasets

2011-12-21 Thread Marc Schwartz
On Dec 21, 2011, at 4:10 PM, Sam Steingold wrote: > Hi, > When qqnorm on a vector of length 10M+ I get a huge pdf file which > cannot be loaded by acroread or evince. > Any suggestions? (apart from sampling the data). > Thanks. Depending upon what your end product needs to be, generate a png fil

[R] qqnorm & huge datasets

2011-12-21 Thread Sam Steingold
Hi, When qqnorm on a vector of length 10M+ I get a huge pdf file which cannot be loaded by acroread or evince. Any suggestions? (apart from sampling the data). Thanks. -- Sam Steingold (http://sds.podval.org/) on Ubuntu 11.10 (oneiric) X 11.0.11004000 http://mideasttruth.com http://honestreporting