John Sorkin wrote:
Frank,
A programming language's efficience is a function of several items, including what you are trying to program. Without using SAS proc IML, I have found that it is more efficient to code algorithms (e.g. a least squares linear regression) using R than SAS; we all know that matrix notation leads to more compact syntax than can be had when using non-matrix notation and R implements matrix notation. On the other hand, searching, sub-setting, merging etc. can a times be coded more efficiently, more easily, and in a more easily understood fashion is SAS. I am sure you people who use SAS to set up their datasets and then use R when they are developing an algorithm.
Just as French may be a better language to express love, Italian a better 
language in which to write opera, and English the most efficient language for 
communication (at least for the last 50 years), so too do both R and SAS have a 
place in the larger world.
John

John I'll have to strongly disagree with most of your statement about data manipulation. R is far more powerful, easier to debug dynamically, and concise for merging, reshaping, recoding, etc. But I agree on the "easily understood" portion of your statement.

Cheers
Frank


John David Sorkin M.D., Ph.D.
Chief, Biostatistics and Informatics
University of Maryland School of Medicine Division of Gerontology
Baltimore VA Medical Center
10 North Greene Street
GRECC (BT/18/GR)
Baltimore, MD 21201-1524
(Phone) 410-605-7119
(Fax) 410-605-7913 (Please call phone number above prior to faxing)

Frank E Harrell Jr <f.harr...@vanderbilt.edu> 2/27/2009 12:52 PM >>>
John Sorkin wrote:
Terry's remarks (see below) are well received however, I take issue with one part of his comments. 
As a long time programmer (in both "statistical" programming languages and 
"traditional" programming languages), I miss the ability to write native-languages in R. 
While macros can make for difficult to read code, when used properly, they can also make flexible 
code that, if properly written (including good documentation, which should be a part of any code) 
can be easy to read.

Finally, everyone must remember that SAS code can be difficult to understand or 
"inefficient" just as R code can be difficult to understand or "inefficient". 
In the end, both programming systems have their advantages and disadvantage. No programming 
language is perfect. It is not fair, nor correct to damn one or the other. Accept the fact that 
some things are more easily and more clearly done in one language, other things are more clearly 
and more easily done in another language.  Let's move on to more important issues, viz. improving R 
so it is as good as it possibly can be.
John

Nice points John. My only response is that I learned SAS in 1969 and used it intensively until 1991. I wrote some of the first user-contributed SAS procedures (PROCs PCTL, GRAPH, DATACHK, LOGIST, PHGLM) and wrote extensively in the macro language. After using S-Plus for only one month my productivity was far ahead of my productivity using SAS.

Frank

John David Sorkin M.D., Ph.D.
Chief, Biostatistics and Informatics
University of Maryland School of Medicine Division of Gerontology
Baltimore VA Medical Center
10 North Greene Street
GRECC (BT/18/GR)
Baltimore, MD 21201-1524
(Phone) 410-605-7119
(Fax) 410-605-7913 (Please call phone number above prior to faxing)

Terry Therneau <thern...@mayo.edu> 2/27/2009 10:23 AM >>>
Three comments

I actually think you can write worse code in R than in SAS: more tools = more scope for innovatively bad ideas. The ability to write bad code should not damm a language. I found almost all of the "improvements" to the multi-line SAS recode to be regressions, both the SAS and the S suggestions. a. Everyone, even those of you with no SAS backround whatsoever, immediately understood the code. Most of the replacements are obscure. Compilers are very good these days and computers are fast, fewer typed characters != better. b. If I were writing the S code for such an application, it would look much the same. I worked as a programmer in medical research for several years, and one of the things that moved me on to graduate studies in statistics was the realization that doing my best work meant being as UN-clever as possible in my code. Frank's comments imply that he was reading SAS macro code at the moment of peak frustration. And if you want to criticise SAS code, this is the place to look. SAS macro started out as some simple expansions, then got added on to, then added on again, and again, and .... with no overall blueprint. It is much like the farmhouse of some neighbors of mine growing up: 4 different expansions in 4 eras, and no overall guiding plan. The interior layout was "interesting" to say the least. I was once a bona fide SAS 'wizard' (and Frank was much better than me), and I can't read the stuff without grinding my teeth. S was once headed down the same road. One of the best things ever with the language was documented in the blue book "The New S Language", where Becker et al had the wisdom to scrap the macro processor. Terry Therneau

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.

Confidentiality Statement:
This email message, including any attachments, is for th...{{dropped:6}}

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.





--
Frank E Harrell Jr   Professor and Chair           School of Medicine
                     Department of Biostatistics   Vanderbilt University

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to