Greetings Everybody:
I generated a 1.2MB dta file based on the general social survey with Stata8 for linux. The file can be re-opened with Stata, but when I bring it into R, it says all the values are missing for most of the variables.
This dataset is called "morgen.dta" and I dropped a copy online in case you are interested
http://www.ku.edu/~pauljohn/R/morgen.dta
looks like this to R (I tried various options on the read.dta command):
> myDat <- read.dta("morgen.dta") > summary(myDat) CASEID year id hrs1 hrs2 Min. : 19721 Min. :1972 Min. : 1 NAP : 0 NAP : 0 1st Qu.: 1983475 1st Qu.:1978 1st Qu.: 445 DK : 0 DK : 0 Median : 1996808 Median :1987 Median : 905 NA : 0 NA : 0 Mean : 9963040 Mean :1986 Mean : 990 NA's:40933 NA's:40933 3rd Qu.:19872187 3rd Qu.:1994 3rd Qu.:1358 Max. :20002817 Max. :2000 Max. :3247
prestige agewed age educ paeduc DK,NA,NAP: 0 NAP : 0 DK : 0 NAP : 0 NAP : 0 NA's :40933 DK : 0 NA : 0 DK : 0 DK : 0 NA : 0 NA's:40933 NA : 0 NA : 0 NA's:40933 NA's:40933 NA's:40933
maeduc speduc income NAP : 0 NAP : 0 $25000 OR MORE:14525 DK : 0 DK : 0 $10000 - 14999: 5022 NA : 0 NA : 0 $15000 - 19999: 3869 NA's:40933 NA's:40933 $20000 - 24999: 3664 REFUSED : 1877 (Other) : 8523 NA's : 3453 >
Here's what Stata sees when I load the same thing:
summarize, detail
Case identification number ------------------------------------------------------------- Percentiles Smallest 1% 197432 19721 5% 199649 19722 10% 1974116 19723 Obs 40933 25% 1983475 19724 Sum of Wgt. 40933
50% 1996808 Mean 9963040 Largest Std. Dev. 9006352 75% 1.99e+07 2.00e+07 90% 2.00e+07 2.00e+07 Variance 8.11e+13 95% 2.00e+07 2.00e+07 Skewness .18931 99% 2.00e+07 2.00e+07 Kurtosis 1.045409
GSS YEAR FOR THIS RESPONDENT ------------------------------------------------------------- Percentiles Smallest 1% 1972 1972 5% 1973 1972 10% 1974 1972 Obs 40933 25% 1978 1972 Sum of Wgt. 40933
50% 1987 Mean 1986.421 Largest Std. Dev. 8.61136 75% 1994 2000 90% 1998 2000 Variance 74.15552 95% 2000 2000 Skewness -.0789223 99% 2000 2000 Kurtosis 1.799939
RESPONDENT ID NUMBER ------------------------------------------------------------- Percentiles Smallest 1% 18 1 5% 89 1 10% 178 1 Obs 40933 25% 445 1 Sum of Wgt. 40933
50% 905 Mean 989.9129 Largest Std. Dev. 689.0596 75% 1358 3244 90% 2027 3245 Variance 474803.2 95% 2437 3246 Skewness .8359211 99% 2867 3247 Kurtosis 3.311248
NUMBER OF HOURS WORKED LAST WEEK ------------------------------------------------------------- Percentiles Smallest 1% 6 0 5% 15 0 10% 21 0 Obs 23279 25% 37 0 Sum of Wgt. 23279
50% 40 Mean 41.05206 Largest Std. Dev. 13.95931 75% 48 89 90% 60 89 Variance 194.8624 95% 65 89 Skewness .195045 99% 82 89 Kurtosis 4.448998
NUMBER OF HOURS USUALLY WORK A WEEK ------------------------------------------------------------- Percentiles Smallest 1% 4 0 5% 15 0 10% 20 1 Obs 774 25% 38 2 Sum of Wgt. 774
50% 40 Mean 39.79199 Largest Std. Dev. 13.43383 75% 45 89 90% 55 89 Variance 180.4677 95% 60 89 Skewness -.0002332 99% 80 89 Kurtosis 5.009869
RS OCCUPATIONAL PRESTIGE SCORE (1970) ------------------------------------------------------------- Percentiles Smallest 1% 14 12 5% 17 12 10% 20 12 Obs 24267 25% 30 12 Sum of Wgt. 24267
50% 39 Mean 39.35645 Largest Std. Dev. 14.03712 75% 48 82 90% 60 82 Variance 197.0407 95% 62 82 Skewness .2927414 99% 76 82 Kurtosis 2.775553
AGE WHEN FIRST MARRIED ------------------------------------------------------------- Percentiles Smallest 1% 15 12 5% 17 12 10% 17 12 Obs 25382 25% 19 12 Sum of Wgt. 25382
50% 21 Mean 22.09609 Largest Std. Dev. 4.813944 75% 24 63 90% 28 68 Variance 23.17405 95% 31 73 Skewness 2.002265 99% 39 73 Kurtosis 11.28279
AGE OF RESPONDENT ------------------------------------------------------------- Percentiles Smallest 1% 19 18 5% 21 18 10% 24 18 Obs 40790 25% 30 18 Sum of Wgt. 40790
50% 42 Mean 45.14798 Largest Std. Dev. 17.53519 75% 58 89 90% 71 89 Variance 307.4828 95% 77 89 Skewness .4774907 99% 86 89 Kurtosis 2.239618
HIGHEST YEAR OF SCHOOL COMPLETED ------------------------------------------------------------- Percentiles Smallest 1% 3 0 5% 7 0 10% 8 0 Obs 40806 25% 11 0 Sum of Wgt. 40806
50% 12 Mean 12.48152 Largest Std. Dev. 3.176226 75% 14 20 90% 16 20 Variance 10.08841 95% 18 20 Skewness -.3389303 99% 20 20 Kurtosis 3.960311
HIGHEST YEAR SCHOOL COMPLETED, FATHER ------------------------------------------------------------- Percentiles Smallest 1% 0 0 5% 3 0 10% 4 0 Obs 29347 25% 8 0 Sum of Wgt. 29347
50% 11 Mean 10.20994 Largest Std. Dev. 4.342143 75% 12 20 90% 16 20 Variance 18.85421 95% 17 20 Skewness -.1628909 99% 20 20 Kurtosis 2.826482
HIGHEST YEAR SCHOOL COMPLETED, MOTHER ------------------------------------------------------------- Percentiles Smallest 1% 0 0 5% 3 0 10% 6 0 Obs 34151 25% 8 0 Sum of Wgt. 34151
50% 12 Mean 10.41478 Largest Std. Dev. 3.709352 75% 12 20 90% 14 20 Variance 13.75929 95% 16 20 Skewness -.6324499 99% 18 20 Kurtosis 3.605715
HIGHEST YEAR SCHOOL COMPLETED, SPOUSE ------------------------------------------------------------- Percentiles Smallest 1% 4 0 5% 7 0 10% 8 0 Obs 22780 25% 12 0 Sum of Wgt. 22780
50% 12 Mean 12.53095 Largest Std. Dev. 3.103418 75% 14 20 90% 16 20 Variance 9.631203 95% 18 20 Skewness -.287755 99% 20 20 Kurtosis 4.051822
TOTAL FAMILY INCOME ------------------------------------------------------------- Percentiles Smallest 1% 1 1 5% 3 1 10% 5 1 Obs 37480 25% 9 1 Sum of Wgt. 37480
50% 11 Mean 9.75619 Largest Std. Dev. 2.994967 75% 12 13 90% 12 13 Variance 8.969825 95% 13 13 Skewness -1.29205 99% 13 13 Kurtosis 3.759778
.
-- Paul E. Johnson email: [EMAIL PROTECTED] Dept. of Political Science http://lark.cc.ku.edu/~pauljohn 1541 Lilac Lane, Rm 504 University of Kansas Office: (785) 864-9086 Lawrence, Kansas 66044-3177 FAX: (785) 864-5700
______________________________________________ [EMAIL PROTECTED] mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html