Dear Calum,
THanks for the simple and practical answer.
Thanking you,
Yours sincerely
AKSHAY M KULKARNI
________________________________
From: CALUM POLWART <[email protected]>
Sent: Thursday, December 12, 2024 4:22 AM
To: Sorkin, John <[email protected]>
Cc: akshay kulkarni <[email protected]>; R help Mailing list
<[email protected]>; [email protected] <[email protected]>; Ben
Bolker <[email protected]>
Subject: Re: [R] SQL and R
And to answer the dependency question.
Neither is dependent on the other. But both can be complimentary.
If you consider that SQL*may* be a route to accessing your data (if it's in a
database).
And R *may* be a route to analysis of the data.
If the data is in a CSV file, Excel file, API etc. you don't need SQL. IF it
is in a database, you might extract it to CSV etc. or you might directly access
it from R
If ALL you want is some simple number counts you can do that in SQL. NO NEED
FOR R. So for instance you could have a database for a warehouse. There could
be three tables in that database (you'll have far more).
Orders
Customers
Products
Products lists all your products - with a product code.
Customers lists all the customers with a customer code
And orders lists the customer code, date and product ordered.
SQL will let you create a "view" of that data that shows the customer address,
and product name.
SQL can also tell you how many of a product were ordered by postal area in a
month even though the data is not organised like that.
R can do that analysis too. But it could do far more statistical analysis and
say run a stats test to see if male customers are more likely to buy beer than
female. I don't think that's possible in SQL.
Sometimes you could just pull the whole database into R and analyse it. But if
you imagine that database was actually every sale made in Walmart in the last
10 years, the database is huge. It would be better to extract the beer sales
and males and females only which is hopefully smaller to analyse...
On Wed, 11 Dec 2024, 21:39 Sorkin, John,
<[email protected]<mailto:[email protected]>> wrote:
Dear Askay,
I believe my grey hair allows me to help answer your question. SQL, and its
progenitor SEQUEL, were developed specifically to manipulate relational
databases. It was developed in the early 1970s (equivalent to the historical
bronze age) when the concept of a relational database (see
https://en.wikipedia.org/wiki/Relational_database) and Codd's 12-rules were
being developed (see https://en.wikipedia.org/wiki/Codd%27s_12_rules)
At the time, the concept of a relation database and a programming language
dedicated to manipulating them was revolutionary. The concept was clearly
needed, important, and well used; a commercial version of SQL, Oracle, made
Larry Ellison more than a quarter billionaire.
S, one of the progenitors of R, was developed later. In 1975 by John Chambers,
Rick Becker, Trevor Hastie, and William Cleveland (all of whom, I believe
worked at Bell Labs) developed S as a general programming language. It was NOT
developed specifically for the manipulation of relational databases. S had
modest success in academia. S-Plus, a commercial version of R was developed
fairly recently in 1988 by a company Statistical Sciences. The founder of
Statistical Sciences was R. Douglas Marin who was a professor of statistics at
the University of Washington, Seattle.
S was also the progenitor of R. R was developed by Ross Ihaka and Robert
Gentlemen in 1993, faculty members of the University of Auckland. Given the
ubiquity of R in academia, it is clear that S, much like SQL has been
extraordinarily successful.
John
John David Sorkin M.D., Ph.D.
Professor of Medicine, University of Maryland School of Medicine;
Associate Director for Biostatistics and Informatics, Baltimore VA Medical
Center Geriatrics Research, Education, and Clinical Center;
PI Biostatistics and Informatics Core, University of Maryland School of
Medicine Claude D. Pepper Older Americans Independence Center;
Senior Statistician University of Maryland Center for Vascular Research;
Division of Gerontology and Paliative Care,
10 North Greene
Street<https://www.google.com/maps/search/10+North+Greene+Street?entry=gmail&source=g>
GRECC (BT/18/GR)
Baltimore, MD 21201-1524
Cell phone 443-418-5382
________________________________________
From: R-help
<[email protected]<mailto:[email protected]>> on behalf
of akshay kulkarni <[email protected]<mailto:[email protected]>>
Sent: Wednesday, December 11, 2024 8:16 AM
To: R help Mailing list
Subject: [R] SQL and R
dear Members,
I have recently started studying SQL and MySQL. My
question is, what exactly is SQL used for? That is, whatever can be done by
SQL, like subsetting and filtering of data sets, can also be done by R. What's,
then, the advantage of SQL? It is OK if you tag this question as offtopic, but
I could'nt find any info on the web. Can you please refer me
QL complement R? Are both dependent?
THanking you,
Yours sincerely,
AKSHAY M KULKARNI
[https://s-install.avcdn.net/ipm/preview/icons/icon-envelope-tick-round-orange-animated-no-repeat-v1.gif]<https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail>
Virus-free.http://www.avast.com/<https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail>
[[alternative HTML version deleted]]
______________________________________________
[email protected]<mailto:[email protected]> mailing list -- To
UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide https://www.r-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
______________________________________________
[email protected]<mailto:[email protected]> mailing list -- To
UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide https://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
[[alternative HTML version deleted]]
______________________________________________
[email protected] mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide https://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.