Re: [R] R on Large Data Sets (again)
On 2009.11.29 14:24:40, Prof Brian Ripley wrote: > >> Windows 64-bit can certainly handle large memory spaces, but unless > >> something has changed recently it my understanding Revolution > >> Computing's 64-bit is the only 64-bit version of R available for > >> Windows (due to the unavailability of adequate open source compilers > >> for 64-bit Windows). So 64-bit R will need to be Revolution's > >> solution or a non-Windows platform. > > Or use a commercial Windows compiler. > > > It appears that GNU does have a project that has had some success at > > compiling 64 bit Windows applications: > > > > http://mingw-w64.sourceforge.net/ > > Well, some interesed people have a project to port GCC and binutils: > as far as I am aware that is not an official GNU project. > > > Not sure if all of the pieces are there for an R build, though. > > You are welcome to show us how to do it (on the R-devel list): several > people have spent man months attempting this (including submitting > many patches to that project), and the rw-FAQ did tell you do so in > http://cran.r-project.org/bin/windows/base/rw-FAQ.html#How-can-I-compile-R-from-source_003f Not a chance :) I got away from Windows 10 years ago for exactly these reasons. I was just trying help point a poor guy in the right direction. -- Jason W. Morgan Graduate Student Department of Political Science *The Ohio State University* 154 North Oval Mall Columbus, Ohio 43210 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] R on Large Data Sets (again)
On Sun, 29 Nov 2009, Jason Morgan wrote: On 2009.11.28 21:50:09, Daniel Nordlund wrote: - Is a Unix-like platform a better option than win-64? Again, would this solve my memory limitation problems? Possibly, but Win64 should provide plenty of memory (I believe Windows 7 Ultimate can use up to 192 GB of memory). You just have to find the system that can take that much... With Unix/Linux you can probably cut back some overhead, and the memory management is most likely better, but unless you need to go over 192GB of memory, you don't necessarily have to move to a different platform. ~Jason Windows 64-bit can certainly handle large memory spaces, but unless something has changed recently it my understanding Revolution Computing's 64-bit is the only 64-bit version of R available for Windows (due to the unavailability of adequate open source compilers for 64-bit Windows). So 64-bit R will need to be Revolution's solution or a non-Windows platform. Or use a commercial Windows compiler. It appears that GNU does have a project that has had some success at compiling 64 bit Windows applications: http://mingw-w64.sourceforge.net/ Well, some interesed people have a project to port GCC and binutils: as far as I am aware that is not an official GNU project. Not sure if all of the pieces are there for an R build, though. You are welcome to show us how to do it (on the R-devel list): several people have spent man months attempting this (including submitting many patches to that project), and the rw-FAQ did tell you do so in http://cran.r-project.org/bin/windows/base/rw-FAQ.html#How-can-I-compile-R-from-source_003f -- Jason W. Morgan Graduate Student Department of Political Science *The Ohio State University* 154 North Oval Mall Columbus, Ohio 43210 -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] R on Large Data Sets (again)
Jason Morgan wrote: On 2009.11.28 21:50:09, Daniel Nordlund wrote: - Is a Unix-like platform a better option than win-64? Again, would this solve my memory limitation problems? Possibly, but Win64 should provide plenty of memory (I believe Windows 7 Ultimate can use up to 192 GB of memory). You just have to find the system that can take that much... With Unix/Linux you can probably cut back some overhead, and the memory management is most likely better, but unless you need to go over 192GB of memory, you don't necessarily have to move to a different platform. ~Jason Windows 64-bit can certainly handle large memory spaces, but unless something has changed recently it my understanding Revolution Computing's 64-bit is the only 64-bit version of R available for Windows (due to the unavailability of adequate open source compilers for 64-bit Windows). So 64-bit R will need to be Revolution's solution or a non-Windows platform. It appears that GNU does have a project that has had some success at compiling 64 bit Windows applications: http://mingw-w64.sourceforge.net/ Not sure if all of the pieces are there for an R build, though. Last time we tried, it was not sufficient. Best wishes, Uwe Ligges __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] R on Large Data Sets (again)
On 2009.11.28 21:50:09, Daniel Nordlund wrote: > > > - Is a Unix-like platform a better option than win-64? Again, would > > > this solve my memory limitation problems? > > > > Possibly, but Win64 should provide plenty of memory (I believe Windows 7 > > Ultimate can use up to 192 GB of memory). You just have to find the > > system that can take that much... With Unix/Linux you can probably cut > > back some overhead, and the memory management is most likely better, but > > unless you need to go over 192GB of memory, you don't necessarily have > > to move to a different platform. > > > > ~Jason > > Windows 64-bit can certainly handle large memory spaces, but unless > something has changed recently it my understanding Revolution > Computing's 64-bit is the only 64-bit version of R available for > Windows (due to the unavailability of adequate open source compilers > for 64-bit Windows). So 64-bit R will need to be Revolution's > solution or a non-Windows platform. It appears that GNU does have a project that has had some success at compiling 64 bit Windows applications: http://mingw-w64.sourceforge.net/ Not sure if all of the pieces are there for an R build, though. -- Jason W. Morgan Graduate Student Department of Political Science *The Ohio State University* 154 North Oval Mall Columbus, Ohio 43210 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] R on Large Data Sets (again)
On 28/11/2009 6:53 PM, Lars Bishop wrote: Dear R users, I’ve search the R site for help on this topic but it is hard to find a precise answer for my questions. Which are the best options to overcome the RAM memory limitation problems when using R on “large” data sets (such as 2 or 3 million records)? There are several packages for handling datasets without keeping them in RAM: bigmemory, ff, etc. You may find that you need to write functions to handle your data a block at a time, or you may find they have already been written, e.g. biglm. You can also keep your data in a database and just retrieve it a block at a time for processing. - Is the free available version of R (as opposed to the one provided by REvolution Computing) compatible with a windows 64-bit machine? And if I increase the RAM memory enough on win-64, would this virtually solve my memory limitation problems? It is compatible with Win64, but it is a 32 bit application. It benefits from running on 64 bit Windows (because Windows can get out of the way and give it most of 4 GB to work in), but not as much as a true 64 bit application. So it probably doesn't solve your problem. - Is a Unix-like platform a better option than win-64? Again, would this solve my memory limitation problems? There are builds available for 64 bit Linux and MacOS (and maybe others); they'd likely help more than running 32 bit R in Win64. I don't know how they compare to running Revolution's 64 bit R in Win64. Duncan Murdoch - Any better option? Thanks in advance for your help, Lars. [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] R on Large Data Sets (again)
> -Original Message- > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] > On Behalf Of Jason Morgan > Sent: Saturday, November 28, 2009 6:18 PM > To: Lars Bishop > Cc: r-help@r-project.org > Subject: Re: [R] R on Large Data Sets (again) > > Hello Lars, > > On 2009.11.28 18:53:09, Lars Bishop wrote: > > Dear R users, > > > > I?ve search the R site for help on this topic but it is hard to find a > > precise answer for my questions. > > > > Which are the best options to overcome the RAM memory limitation > problems > > when using R on ?large? data sets (such as 2 or 3 million records)? > > I think you'll have to provide a more precise definition of > "large"---are we talking 1 GB of records or 100 GB? Also, it would help > to know what you are trying to do with the data. The documentation for > the biglm and bigmemory packages may provide some help. > > > - Is the free available version of R (as opposed to the one provided > > by REvolution Computing) compatible with a windows 64-bit machine? > > And if I increase the RAM memory enough on win-64, would this > > virtually solve my memory limitation problems? > > I'm not familiar enough with the commercial version of R, but I do > believe it provides better support for parallelization, which may be of > some help. I don't think, however, that this version will "solve" your > problem. > > > - Is a Unix-like platform a better option than win-64? Again, would > > this solve my memory limitation problems? > > Possibly, but Win64 should provide plenty of memory (I believe Windows 7 > Ultimate can use up to 192 GB of memory). You just have to find the > system that can take that much... With Unix/Linux you can probably cut > back some overhead, and the memory management is most likely better, but > unless you need to go over 192GB of memory, you don't necessarily have > to move to a different platform. > > ~Jason Windows 64-bit can certainly handle large memory spaces, but unless something has changed recently it my understanding Revolution Computing's 64-bit is the only 64-bit version of R available for Windows (due to the unavailability of adequate open source compilers for 64-bit Windows). So 64-bit R will need to be Revolution's solution or a non-Windows platform. Hope this is helpful, Dan Daniel Nordlund Bothell, WA USA __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] R on Large Data Sets (again)
Hello Lars, On 2009.11.28 18:53:09, Lars Bishop wrote: > Dear R users, > > I?ve search the R site for help on this topic but it is hard to find a > precise answer for my questions. > > Which are the best options to overcome the RAM memory limitation problems > when using R on ?large? data sets (such as 2 or 3 million records)? I think you'll have to provide a more precise definition of "large"---are we talking 1 GB of records or 100 GB? Also, it would help to know what you are trying to do with the data. The documentation for the biglm and bigmemory packages may provide some help. > - Is the free available version of R (as opposed to the one provided > by REvolution Computing) compatible with a windows 64-bit machine? > And if I increase the RAM memory enough on win-64, would this > virtually solve my memory limitation problems? I'm not familiar enough with the commercial version of R, but I do believe it provides better support for parallelization, which may be of some help. I don't think, however, that this version will "solve" your problem. > - Is a Unix-like platform a better option than win-64? Again, would > this solve my memory limitation problems? Possibly, but Win64 should provide plenty of memory (I believe Windows 7 Ultimate can use up to 192 GB of memory). You just have to find the system that can take that much... With Unix/Linux you can probably cut back some overhead, and the memory management is most likely better, but unless you need to go over 192GB of memory, you don't necessarily have to move to a different platform. ~Jason -- Jason W. Morgan Graduate Student Department of Political Science *The Ohio State University* 154 North Oval Mall Columbus, Ohio 43210 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] R on Large Data Sets (again)
Dear R users, Ive search the R site for help on this topic but it is hard to find a precise answer for my questions. Which are the best options to overcome the RAM memory limitation problems when using R on large data sets (such as 2 or 3 million records)? - Is the free available version of R (as opposed to the one provided by REvolution Computing) compatible with a windows 64-bit machine? And if I increase the RAM memory enough on win-64, would this virtually solve my memory limitation problems? - Is a Unix-like platform a better option than win-64? Again, would this solve my memory limitation problems? - Any better option? Thanks in advance for your help, Lars. [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.