Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-15 Thread Anthony Damico
> What I do not understand is how SAS knows where the variables begin and > end. > I managed to break off a little hunk of the beginning of my file and look > at > it in an editor, and it is numbers without any obvious delimiters. Is the > delimiter a particular numeric string? I thought the SAS co

Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-14 Thread Daniel Nordlund
> -Original Message- > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] > On Behalf Of andrewH > Sent: Wednesday, November 14, 2012 2:34 PM > To: r-help@r-project.org > Subject: Re: [R] Getting information encoded in a SAS, SPSS or Stata >

Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-14 Thread David Winsemius
On Nov 14, 2012, at 2:33 PM, andrewH wrote: > Dear Anthony – > > On closer examination, what I am talking about is not factor levels, but > something different (but analogous). The data that is categorical all has > integer codes, so the file is entirely numeric. The SAS proc format then > give

Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-14 Thread andrewH
Dear Anthony – On closer examination, what I am talking about is not factor levels, but something different (but analogous). The data that is categorical all has integer codes, so the file is entirely numeric. The SAS proc format then gives text strings for each code for each categorical variable

Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-14 Thread Anthony Damico
Hi Andrew, great to hear from you :) You really ought to review the (100% R-specific) US Government Survey Datasets already available at http://usgsd.blogspot.com/ and contact me directly if you hit a problem -- I am furiously working on a few right now (ACS, SIPP, BSAPUFs, BRFSS, MEPS) , and am

Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-13 Thread andrewH
Wow! After reading Jan's post, I said "Great, I'll do that," because it was the closest to what I originally had in mind. Then I read Ista's post, and said "I think I'l try that first," because it got me back on the track of following directions in the R Data Import/Export manual. Then I read Anth

Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-13 Thread David Winsemius
On Nov 13, 2012, at 7:20 AM, Anthony Damico wrote: > Hi Andrew, to work with the Current Population Survey with R, your best > best is to use a variant of my SAScii package that works with a SQLite > database (and therefore doesn't overload RAM). > > I have written obsessively-documented code ab

Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-13 Thread Anthony Damico
Hi Andrew, to work with the Current Population Survey with R, your best best is to use a variant of my SAScii package that works with a SQLite database (and therefore doesn't overload RAM). I have written obsessively-documented code about how to work with the CPS in R here.. http://usgsd.blogspot

Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-13 Thread Ista Zahn
Hi Andrew, You may be able to run the SPSS syntax file using pspp (http://www.gnu.org/software/pspp/) Best, Ista On Mon, Nov 12, 2012 at 11:23 PM, andrewH wrote: > Dear folks – > I have a large (26 gig) ASCII flat file in fixed-width format with about 10 > million observations of roughly 400 va

Re: [R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-13 Thread Jan
Hi, If it is your objective to get your data in an ffdf, I suggest you look at the SAS/SPSS/Stata code to see where each column is starting, next try out the LaF package as it allows you to read in large fixed width format files and once y

[R] Getting information encoded in a SAS, SPSS or Stata command file into R.

2012-11-12 Thread andrewH
Dear folks – I have a large (26 gig) ASCII flat file in fixed-width format with about 10 million observations of roughly 400 variables. (It is 51 years of Current Population Survey micro data from IPUMS, roughly half the fields for each record). The file was produced by automatic process in respo