Hello, I'm doing some analysis on a rather large data set. In this case, some simple commands are failing. For example, this one:
x$eventtype <- factor(x$eventtype)
Error in unique.default(x) : length 1093574297 is too large for hashing ...I think this is a bug, because "hashing" should not be required for the "factor" function. Am I right? The whole column does not need to be hashed, only the unique keys. Sure, there is the potential to overflow the key register, but this error should be thrown only if that occurs, no? Cordially, Adam D. I. Kramer, Ph.D. Data Scientist, Facebook, Inc. akra...@fb.com ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.