Re: [R] Error in Corpus() in tm package

2013-08-18 Thread Ajinkya Kale
at I have ended up debugging with sub sets of input one too many times. On Aug 18, 2013 9:01 AM, Milan Bouchet-Valat nalimi...@club.fr wrote: Le samedi 17 août 2013 à 11:16 -0700, Ajinkya Kale a écrit : It contains all text files which were converted from doc, docx, ppt etc. using libreoffice

Re: [R] Error in Corpus() in tm package

2013-08-17 Thread Ajinkya Kale
are causing it. On Sat, Aug 17, 2013 at 9:55 AM, Milan Bouchet-Valat nalimi...@club.frwrote: Le vendredi 16 août 2013 à 19:35 -0700, Ajinkya Kale a écrit : I am trying to use the text mining package ... I keep getting this error : rm(list=ls()) library(tm) sourceDir - Z

Re: [R] Error in Corpus() in tm package

2013-08-17 Thread Ajinkya Kale
initially it was. Also, I noticed that VectorSource works way more faster than having a DirSource there. Any particular reason ? On Sat, Aug 17, 2013 at 11:16 AM, Ajinkya Kale kaleajin...@gmail.comwrote: It contains all text files which were converted from doc, docx, ppt etc. using libreoffice. Some

Re: [R] Error in Corpus() in tm package

2013-08-17 Thread Ajinkya Kale
I think I know why it works faster, cause VectorSource in above code only takes the files names as a corpus and not the contents of the files :D duh! Any suggestions to create a vector source out of contents of the txt files ? On Sat, Aug 17, 2013 at 1:59 PM, Ajinkya Kale kaleajin...@gmail.com

[R] Error in Corpus() in tm package

2013-08-16 Thread Ajinkya Kale
I am trying to use the text mining package ... I keep getting this error : rm(list=ls()) library(tm) sourceDir - Z:\\projectk_viz\\docs_to_index ovid - Corpus(DirSource(sourceDir),readerControl = list(language = lat)) Error in if (vectorized (length = 0)) stop(vectorized sources must have