Re: Indexing nouns only with UIMA works - performance issue?

2013-02-05 Thread Tommaso Teofili
l.com] > Sent: Monday, February 04, 2013 2:47 PM > To: solr-user@lucene.apache.org > Subject: Re: Indexing nouns only with UIMA works - performance issue? > > see an example at > > http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/contrib/uima/src/test-files/ui

RE: Indexing nouns only with UIMA works - performance issue?

2013-02-05 Thread Kai Gülzau
exing nouns only with UIMA works - performance issue? see an example at http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/contrib/uima/src/test-files/uima/uima-tokenizers-schema.xml?view=diff&r1=1442116&r2=1442117&pathrev=1442117where the 'ngramsize' paramete

Re: Indexing nouns only with UIMA works - performance issue?

2013-02-04 Thread Tommaso Teofili
see an example at http://svn.apache.org/viewvc/lucene/dev/branches/branch_4x/solr/contrib/uima/src/test-files/uima/uima-tokenizers-schema.xml?view=diff&r1=1442116&r2=1442117&pathrev=1442117where the 'ngramsize' parameter is set, that's defined in AggregateSentenceAE.xml descriptor and is then set w

Re: Indexing nouns only with UIMA works - performance issue?

2013-02-04 Thread Tommaso Teofili
Regarding configuration parameters have a look at https://issues.apache.org/jira/browse/LUCENE-4749 Regards, Tommaso 2013/2/4 Tommaso Teofili > Thanks Kai for your feedback, I'll look into it and let you know. > Regards, > Tommaso > > > 2013/2/1 Kai Gülzau > >> I now use the "stupid" way to use

Re: Indexing nouns only with UIMA works - performance issue?

2013-02-04 Thread Tommaso Teofili
Thanks Kai for your feedback, I'll look into it and let you know. Regards, Tommaso 2013/2/1 Kai Gülzau > I now use the "stupid" way to use the german corpus for UIMA: copy + paste > :-) > > I modified the Tagger-2.3.1.jar/HmmTagger.xml to use the german corpus > ... > > file:german/TuebaMode