I need to write up a case study of ordinary least squares
multiple regression modeling using regression splines and
bootstrap model validation. I have a lot of medical examples
but would like an example from a discipline that is a big
non-medical user of statistical modeling. I need a public-
domain dataset with at least 200 observations, almost no
missing data (my other case studies cover the more usual
case where there is a significant number of missings), and
several independent variables, at least two of which are
continuous. A dataset that many readers can connect with
would be especially good. I appreciate any suggestions and
pointers to where the dataset can be downloaded.
Thanks in advance.
To return the favor, we have lots of good datasets on our
web site. One of special note for teaching binary logistic
regression is the "titanic3" dataset with data on 1300
of the Titanic passengers, including survival status.
See http://hesweb1.med.virginia.edu/biostat/s/data/.
Many of our datasets are in S-Plus format. If many non-S-Plus
users who do not possess a data conversion utility such as
DBMS/Copy need a specific dataset on our Web site I can convert it
to another format.
--
Frank E Harrell Jr Prof. of Biostatistics & Statistics
Div. of Biostatistics & Epidem. Dept. of Health Evaluation Sciences
U. Virginia School of Medicine http://hesweb1.med.virginia.edu/biostat
=================================================================
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
http://jse.stat.ncsu.edu/
=================================================================