Re: [R] How to speed up R program

2016-03-14 Thread Amos B. Elberg
There's no way for anyone to be sure without looking directly at the code. As Jim said, the prime suspect is inefficient use of memory and excessive use of virtual memory. The book "Advanced R" by Hadley Wickham has a very detailed and easy to understand section on how to benchmark and optimize

[R] [R-pkgs] rZeppelin: An R notebook that makes Spark easy to use

2016-03-10 Thread Amos B. Elberg
rZeppelin is an R interpreter for Apache (incubating) Zeppelin. Zeppelin is a notebook, sort of like iPython, built on top of Apache Spark. rZeppelin makes it possible, for the first time, to create a single data/ML pipeline that mixes R, scala, and Python code, seamlessly, from a single interfac

[R] rZeppelin: Easy Spark for R Data Scientists

2016-01-04 Thread Amos B. Elberg
rZeppelin is an R Interpreter for the Apache (Incubating) Zeppelin project.   The intention of rZeppelin is to make it possible for regular R-using non-programmer to integrate the power of Spark, and the wide range of ML packages available for Python and scala, into their day-to-day toolbox — w

Re: [R] Problems with tm package, Removeword and trasformations

2015-02-10 Thread Amos B. Elberg
Trying to use t m to analyze tweets, you're going to experience a long stream of issues like the one you found, which generally relate to text formatting. I worked through them over the past few months for a project. If you email me offline I'll try to help and share some example code. > On Fe

Re: [R] Build a package on one Mac but unable to load this on another Mac

2014-10-18 Thread Amos B. Elberg
You could include just that library in your distribution, and change the dynamic link path using install_name_tool. I assume how to do so in detail is beyond the intended scope of this mailing list. > On Oct 18, 2014, at 11:42 AM, althu07 wrote: > > Hi, I build a package on one Mac. I can ins

Re: [R] Data formart

2014-10-09 Thread Amos B. Elberg
The code changes a single row; that's why I said you'd have to adapt it to your application. > On Oct 9, 2014, at 3:10 AM, Frederic Ntirenganya wrote: > > Hi Amos, > > This approach gives a column of NA. > >> On Wed, Oct 8, 2014 at 10:20 PM, Amos B. E

Re: [R] Data formart

2014-10-08 Thread Amos B. Elberg
You can adapt:   format(as.Date(paste(Year, sep = "-", ifelse(Year %% 4 == 0, Start, ifelse(Start > 59, Start - 1, Start))), "%Y-%j"), "%b %d”) But 60% of the dates in your data.frame will then be wrong.   From: Frederic Ntirenganya Reply: Frederic Ntirenganya > Date: October 8, 2014 at 9:38:3

Re: [R] Hadley's book: paper/PDF/etc. versus github

2014-10-06 Thread Amos B. Elberg
For me, it output html from rstudio complete with code etc., with no changes except a tinker to a call to setwd(), downloading some libraries, and no other intervention. I haven't explored the rmarkdown capabilities before, and I was just amazed. > On Oct 6, 2014, at 1:28 PM, Greg Snow <538.

Re: [R] Failure with .Rprofile on Mac OS X

2014-09-18 Thread Amos B. Elberg
://cran.r-project.org/bin/macosx/RMacOSX-FAQ.html#The-current-and-startup-working-directories > > > See ?Startup for more specifics that are generic to all R versions: > > > On Sep 18, 2014, at 7:04 PM, Amos B. Elberg wrote: > >> The only reason that *should* happen is if

Re: [R] Failure with .Rprofile on Mac OS X

2014-09-18 Thread Amos B. Elberg
The only reason that *should* happen is if there's an .Rprofile in the directory you're in when you start R. Where *exactly* is the .Rprofile file you want loaded, what directory are you starting from, and what does R say is the user's home directory? Did you make *any* changes to Rprofile.site

Re: [R] Question about searchTwitter{twitteR}

2014-09-07 Thread Amos B. Elberg
Twitter tweets aren't a stable database. I wouldn't expect the search results to stay stable, as tweets are retweeted, deleted, accounts are closed, privacy settings adjusted, etc. And if there are more than 1000 results, I don't know that twitter is internally ordered so you'd get the same set

Re: [R] Old g++ in Rtools

2014-08-06 Thread Amos B. Elberg
I’m working on a Mac so ymmv, but I’ve been running benchmarks of vanilla R from cran vs. recompiles with different versions of gcc and R, and I see a speedup of 20-30% vs the cran binary after recompiling with gnu 4.9.  Its quite distinct.   Each jump in gcc revision (4.7-4.8, 4.8-4.9) s