Em Ter, 2008-09-30 às 18:56 -0500, Frank E Harrell Jr escreveu:
Bernardo Rangel Tura wrote:
Em Sáb, 2008-09-27 às 10:51 -0700, milicic.marko escreveu:
I have a huge data set with thousands of variable and one binary
variable. I know that most of the variables are correlated and are not
Bernardo Rangel Tura wrote:
Em Ter, 2008-09-30 às 18:56 -0500, Frank E Harrell Jr escreveu:
Bernardo Rangel Tura wrote:
Em Sáb, 2008-09-27 às 10:51 -0700, milicic.marko escreveu:
I have a huge data set with thousands of variable and one binary
variable. I know that most of the variables are
It would not be possible to answer your original
question until you specify your goal.
Is it to develop a model with external validity
that will generalize to new data? (You are not
likely to succeed, if you are starting with a
boil the ocean approach with 44,000+ covariates
and millions of
From: Frank E Harrell Jr
Bernardo Rangel Tura wrote:
Em Ter, 2008-09-30 às 18:56 -0500, Frank E Harrell Jr escreveu:
Bernardo Rangel Tura wrote:
Em Sáb, 2008-09-27 às 10:51 -0700, milicic.marko escreveu:
I have a huge data set with thousands of variable and one binary
variable. I know
:[EMAIL PROTECTED] On Behalf Of Liaw, Andy
Sent: Wednesday, October 01, 2008 12:01 PM
To: Frank E Harrell Jr; [EMAIL PROTECTED]
Cc: r-help@r-project.org
Subject: Re: [R] Logistic regression problem
From: Frank E Harrell Jr
Bernardo Rangel Tura wrote:
Em Ter, 2008-09-30 às 18:56 -0500, Frank E
The only solution I can see is fitting all possib le 2 factor models enabling
interactions and then assessing if interaction term is significant...
any more ideas?
Milicic B. Marko wrote:
I have a huge data set with thousands of variable and one binary
variable. I know that most of the
Milicic B. Marko wrote:
The only solution I can see is fitting all possib le 2 factor models enabling
interactions and then assessing if interaction term is significant...
any more ideas?
Please don't suggest such a thing unless you do simulations to back up
its predictive performance, type
Em Sáb, 2008-09-27 às 10:51 -0700, milicic.marko escreveu:
I have a huge data set with thousands of variable and one binary
variable. I know that most of the variables are correlated and are not
good predictors... but...
It is very hard to start modeling with such a huge dataset. What would
, September 30, 2008 2:54 PM
To: Milicic B. Marko
Cc: r-help@r-project.org
Subject: Re: [R] Logistic regression problem
Milicic B. Marko wrote:
The only solution I can see is fitting all possib le 2 factor models enabling
interactions and then assessing if interaction term is significant...
any more
Bernardo Rangel Tura wrote:
Em Sáb, 2008-09-27 às 10:51 -0700, milicic.marko escreveu:
I have a huge data set with thousands of variable and one binary
variable. I know that most of the variables are correlated and are not
good predictors... but...
It is very hard to start modeling with such a
I have a huge data set with thousands of variable and one binary
variable. I know that most of the variables are correlated and are not
good predictors... but...
It is very hard to start modeling with such a huge dataset. What would
be your suggestion. How to make a first cut... how to eliminate
11 matches
Mail list logo