my earlier comment is probably irrelevant since you are fitting only
one qss component and have no other covariates.
A word of warning though when you go back to this on your new machine
-- you are almost surely going to want to specify
a large lambda for the qss component in the rqss call. The default
of 1 is likely to produce something very very rough with
such a large dataset.
url: www.econ.uiuc.edu/~roger Roger Koenker
email rkoen...@uiuc.edu Department of Economics
vox: 217-333-4558 University of Illinois
fax: 217-244-6678 Urbana, IL 61801
On Jun 24, 2009, at 5:04 PM, Jonathan Greenberg wrote:
Yep, its looking like a memory issue -- we have 6GB RAM and 1GB swap
-- I did notice that the analysis takes far less memory (and runs)
if I:
tahoe_rq <-
rqss(ltbmu_4_stemsha_30m_exp.img~ltbmu_eto_annual_mm.img,tau=.
99,data=boundary_data)
(which I assume fits a line to the quantiles)
vs.
tahoe_rq <-
rqss(ltbmu_4_stemsha_30m_exp.img~qss(ltbmu_eto_annual_mm.img),tau=.
99,data=boundary_data)
(which is fitting a spline)
Unless anyone else has any hints as to whether or not I'm making a
mistake in my call (beyond randomly subsetting the data -- I'd like
to run the analysis on the full dataset to begin with) -- I'd like
to fit a spline to the upper 1% of the data, I'll just wait until my
new computer comes in next week which has more RAM. Thanks!
--j
roger koenker wrote:
Jonathan,
Take a look at the output of sessionInfo(), it should say x86-64 if
you have a 64bit installation, or at least I think this is the case.
Regarding rqss(), my experience is that (usually) memory problems
are due to the fact that early on the processing there is
a call to model.matrix() which is supposed to create a design, aka
X, matrix for the problem. This matrix is then coerced to
matrix.csr sparse format, but the dense form is often too big for
the machine to cope with. Ideally, someone would write an
R version of model.matrix that would permit building the matrix in
sparse form from the get-go, but this is a non-trivial task.
(Or at least so it appeared to me when I looked into it a few years
ago.) An option is to roll your own X matrix: take a smalller
version of the data, apply the formula, look at the structure of X
and then try to make a sparse version of the full X matrix.
This is usually not that difficult, but "usually" is based on a
rather small sample that may not be representative of your problems.
Hope that this helps,
Roger
url: www.econ.uiuc.edu/~roger Roger Koenker
email rkoen...@uiuc.edu Department of Economics
vox: 217-333-4558 University of Illinois
fax: 217-244-6678 Urbana, IL 61801
On Jun 24, 2009, at 4:07 PM, Jonathan Greenberg wrote:
Rers:
I installed R 2.9.0 from the Debian package manager on our amd64
system that currently has 6GB of RAM -- my first question is
whether this installation is a true 64-bit installation (should R
have access to > 4GB of RAM?) I suspect so, because I was running
an rqss() (package quantreg, installed via install.packages() -- I
noticed it required a compilation of the source) and watched the
memory usage spike to 4.9GB (my input data contains > 500,000
samples).
With this said, after 30 mins or so of processing, I got the
following error:
tahoe_rq <-
rqss(ltbmu_4_stemsha_30m_exp.img~qss(ltbmu_eto_annual_mm.img),tau=.
99,data=boundary_data)
Error: cannot allocate vector of size 1.5 Gb
The dataset is a bit big (300mb or so), so I'm not providing it
unless necessary to solve this memory problem.
Thoughts? Do I need to compile either the main R "by hand" or
the quantreg package?
--j
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.