Re: [R] Cross tabulation with top one variable and side as multiple variables
They do not match because xtabs() in R produces a multidimensional array (one dimension for each variable). Looking at your spreadsheet on nabble, it appears that SPSS is just creating 4 crosstabulations with TREND against each of the other variables. That is easily done in R, but for tested code, you need to give us reproducible data using dput(). I get an error using read.spss() on your uploaded file. You should also read some of the extensive free documentation available on R. The ftable() function creates a two dimensional representation of that 5-dimensional array. But your spreadsheet is just a stack of two-dimensional tables. You could get there with the margin.table() function, but unless you really need the 5-dimensional array, you probably want something more like: rowvars - c(AGET, SEXT, EDUCRT, JOBRT) table.lst - lapply(rowvars, function(x) xtabs(~x+TREND)) That would give you a list containing a crosstabulation table between each of the variables and TREND. A spreadsheet with 2000 tables seems a bit unwieldy so you might want to give some thought to what you really want as output. - David L Carlson Department of Anthropology Texas AM University College Station, TX 77840-4352 -Original Message- From: R-help [mailto:r-help-boun...@r-project.org] On Behalf Of jagadishpchary Sent: Thursday, June 18, 2015 1:46 AM To: r-help@r-project.org Subject: Re: [R] Cross tabulation with top one variable and side as multiple variables I think my explanation in the post is not giving the full details on the job to be done. Sorry for that. Here is what I am doing.. 1. I have a SPSS data set with more than 2000 variables. However for test purpose I have created a temporary data set with 5 variables which I am reading it to R environment (Attached the test.sav file). 2. There is a variable called “TREND” which has the year data. So all I need to do is cross tabulate the variables with this Trend variable. In SPSS the syntax would be CTABLES /VLABELS VARIABLES =ALL DISPLAY=LABEL /TABLES (AGET +SEXT +EDUCRT +JOBRT ) [COUNT F40.0] by TREND. The final cross tabulation results are placed in the attached excel report with sheet name “Results”. As I am new to R - I tried searching the forums for the cross tabulation with top variable constant and multiple variables as side however I could not find it. Anyhow I tried using the below syntax : Xtabs ( ~ AGET +SEXT +EDUCRT +JOBRT + TREND, data=mydata) summary(~AGET +SEXT +EDUCRT +JOBRT, data= mydata, fun=table) ftable (mydata, row.vars=c(AGET , SEXT , EDUCRT , “JOBRT”), col.vars=TREND) the results are not identical to what I am getting in SPSS Hence I would request to suggest me a R code that helps me in getting the results as shown in the attached excel report with sheet name “Results”. Test.sav http://r.789695.n4.nabble.com/file/n4708799/Test.sav Cross_tabulation.xlsx http://r.789695.n4.nabble.com/file/n4708799/Cross_tabulation.xlsx -- View this message in context: http://r.789695.n4.nabble.com/Cross-tabulation-with-top-one-variable-and-side-as-multiple-variables-tp4708379p4708799.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Cross tabulation with top one variable and side as multiple variables
I think my explanation in the post is not giving the full details on the job to be done. Sorry for that. Here is what I am doing.. 1. I have a SPSS data set with more than 2000 variables. However for test purpose I have created a temporary data set with 5 variables which I am reading it to R environment (Attached the test.sav file). 2. There is a variable called “TREND” which has the year data. So all I need to do is cross tabulate the variables with this Trend variable. In SPSS the syntax would be CTABLES /VLABELS VARIABLES =ALL DISPLAY=LABEL /TABLES (AGET +SEXT +EDUCRT +JOBRT ) [COUNT F40.0] by TREND. The final cross tabulation results are placed in the attached excel report with sheet name “Results”. As I am new to R - I tried searching the forums for the cross tabulation with top variable constant and multiple variables as side however I could not find it. Anyhow I tried using the below syntax : Xtabs ( ~ AGET +SEXT +EDUCRT +JOBRT + TREND, data=mydata) summary(~AGET +SEXT +EDUCRT +JOBRT, data= mydata, fun=table) ftable (mydata, row.vars=c(AGET , SEXT , EDUCRT , “JOBRT”), col.vars=TREND) the results are not identical to what I am getting in SPSS Hence I would request to suggest me a R code that helps me in getting the results as shown in the attached excel report with sheet name “Results”. Test.sav http://r.789695.n4.nabble.com/file/n4708799/Test.sav Cross_tabulation.xlsx http://r.789695.n4.nabble.com/file/n4708799/Cross_tabulation.xlsx -- View this message in context: http://r.789695.n4.nabble.com/Cross-tabulation-with-top-one-variable-and-side-as-multiple-variables-tp4708379p4708799.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Cross tabulation with top one variable and side as multiple variables
Hi: I have a huge data with lot of variables and I need to check the trend variations from year to year. In order to do so, I have to cross tabulate the year variable as top (constant) and all the remaining variables as side (attached the cross tabulation report). I have searched the forums but the syntax I could find for cross tabulation is between 2 or 3 variables. So i would request to provide a code which can print the data in the same way as in the attached. http://r.789695.n4.nabble.com/file/n4708379/Untitled.png -- View this message in context: http://r.789695.n4.nabble.com/Cross-tabulation-with-top-one-variable-and-side-as-multiple-variables-tp4708379.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Cross tabulation with top one variable and side as multiple variables
There are two issues here... calculation and presentation. The table function from base R can work with many variables. If your data set is so large that you have problems with memory then you could investigate data.table or sqldf packages, which perform the computations but do not present the data in cross tabulation form. You could use table or perhaps the tables package to render the data into the desired form. --- Jeff NewmillerThe . . Go Live... DCN:jdnew...@dcn.davis.ca.usBasics: ##.#. ##.#. Live Go... Live: OO#.. Dead: OO#.. Playing Research Engineer (Solar/BatteriesO.O#. #.O#. with /Software/Embedded Controllers) .OO#. .OO#. rocks...1k --- Sent from my phone. Please excuse my brevity. On June 9, 2015 1:40:53 AM PDT, jagadishpchary p.jagad...@inrhythm-inc.com wrote: Hi: I have a huge data with lot of variables and I need to check the trend variations from year to year. In order to do so, I have to cross tabulate the year variable as top (constant) and all the remaining variables as side (attached the cross tabulation report). I have searched the forums but the syntax I could find for cross tabulation is between 2 or 3 variables. So i would request to provide a code which can print the data in the same way as in the attached. http://r.789695.n4.nabble.com/file/n4708379/Untitled.png -- View this message in context: http://r.789695.n4.nabble.com/Cross-tabulation-with-top-one-variable-and-side-as-multiple-variables-tp4708379.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Cross tabulation with top one variable and side as multiple variables
On Jun 9, 2015, at 1:40 AM, jagadishpchary wrote: Hi: I have a huge data with lot of variables and I need to check the trend variations from year to year. In order to do so, I have to cross tabulate the year variable as top (constant) and all the remaining variables as side (attached the cross tabulation report). I have searched the forums but the syntax I could find for cross tabulation is between 2 or 3 variables. So i would request to provide a code which can print the data in the same way as in the attached. http://r.789695.n4.nabble.com/file/n4708379/Untitled.png I think you will find that people on this list expect you to provide data in the form of text rather than pictures. When I looked at the request there were two routes I considered: 1) combine margin.table with ftable and 2) investigate one of (but not both) of plyr or dply packages. View this message in context: http://r.789695.n4.nabble.com/Cross-tabulation-with-top-one-variable-and-side-as-multiple-variables-tp4708379.html Sent from the R help mailing list archive at Nabble.com. Nabble is neither the R help mailing list nor its archive. Nabble also removes this message from replies. You should read the material about the list. R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. David Winsemius Alameda, CA, USA __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Cross tabulation with top one variable and side as multiple variables
We probably should have a better idea of what the raw data looks like and perhaps a bit better idea of what the analyis is to show. Have a look at http://stackoverflow.com/questions/5963269/how-to-make-a-great-r-reproducible-example and http://adv-r.had.co.nz/Reproducibility.html for some suggestions. In particular see the discussion about dput() for the best way to provide sample data to the help list. John Kane Kingston ON Canada -Original Message- From: p.jagad...@inrhythm-inc.com Sent: Tue, 9 Jun 2015 01:40:53 -0700 (PDT) To: r-help@r-project.org Subject: [R] Cross tabulation with top one variable and side as multiple variables Hi: I have a huge data with lot of variables and I need to check the trend variations from year to year. In order to do so, I have to cross tabulate the year variable as top (constant) and all the remaining variables as side (attached the cross tabulation report). I have searched the forums but the syntax I could find for cross tabulation is between 2 or 3 variables. So i would request to provide a code which can print the data in the same way as in the attached. http://r.789695.n4.nabble.com/file/n4708379/Untitled.png -- View this message in context: http://r.789695.n4.nabble.com/Cross-tabulation-with-top-one-variable-and-side-as-multiple-variables-tp4708379.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Can't remember your password? Do you need a strong and secure password? Use Password manager! It stores your passwords protects your account. __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.