Thanks that worked. Also wanted to make data for my LDA. I am putting in space delimited documents to sequence analyzer is there a way i can verify the data produced by sequence2sparse script? Also I found a bug that seqDirectory script fails quitely at zero document sizes... Once i removed teh zero documents it worked. Someone should fix this or add as a command line option....
On Fri, Oct 22, 2010 at 7:20 AM, Robin Anil <[email protected]> wrote: > This could be because New lucene analyzer cannot be instantiated withzero > params in constructor via reflection. Drop the analyzer option. Mahout's > DefaultAnalyzer is a wrapper over StandardAnalyzer > > > On Fri, Oct 22, 2010 at 6:58 PM, Grant Ingersoll <[email protected] > >wrote: > > > Do you have the full stack trace? > > > > On Oct 22, 2010, at 6:59 AM, Sid wrote: > > > > > Hi Am i supposed to build mahout with lucene somehow i am getting : > > > > > >> > > >> Exception in thread "main" java.lang.InstantiationException: > > >> org.apache.lucene.analysis.standard.StandardAnalyzer > > >> What could be wrong? i am running hadoop 0.20.2 and mahout I checked > out > > >> from SVN a few days ago. So it is new. > > >> > > >> Thanks > > >> Sid > > >> > > > > > > > > > > > > -- > > > Sidharth Gupta > > > > > > 1249 E Spence Avenue > > > Tempe Az, 85281 > > > 480-307-5994 > > > > -------------------------- > > Grant Ingersoll > > http://www.lucidimagination.com > > > > > -- Sidharth Gupta 1249 E Spence Avenue Tempe Az, 85281 480-307-5994
