Dear Toke and Davidphilip, Hi, The fieldtype text_fa has some custom language specific normalizer and charfilter, here is the schema.xml value related for this field: <fieldType name="text_fa" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <charFilter class="com.ictcert.lucene.analysis.fa.FarsiCharFilterFactory"/> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="com.ictcert.lucene.analysis.fa.FarsiNormalizationFilterFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="lang/stopwords_fa.txt" /> </analyzer> <analyzer type="query"> <charFilter class="com.ictcert.lucene.analysis.fa.FarsiCharFilterFactory"/> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="com.ictcert.lucene.analysis.fa.FarsiNormalizationFilterFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="lang/stopwords_fa.txt" /> </analyzer> </fieldType>
I did try the facet.method=enum and it works fine. Did you mean that actually applying facet on analyzed field is wrong? Best regards. On Mon, Jul 20, 2015 at 8:07 PM, Toke Eskildsen <t...@statsbiblioteket.dk> wrote: > Ali Nazemian <alinazem...@gmail.com> wrote: > > I have a collection of 1.6m documents in Solr 5.2.1. > > [...] > > Caused by: java.lang.IllegalStateException: Too many values for > > UnInvertedField faceting on field content > > [...] > > <field name="content" type="text_fa" stored="true" indexed="true" > > default="noval" termVectors="true" termPositions="true" > > termOffsets="true"/> > > You are hitting an internal limit in Solr. As davidphilip tells you, the > solution is docValues, but they cannot be enabled for text fields. You need > String fields, but the name of your field suggests that you need > analyzation & tokenization, which cannot be done on String fields. > > > Would you please help me to solve this problem? > > With the information we have, it does not seem to be easy to solve: It > seems like you want to facet on all terms in your index. As they need to be > String (to use docValues), you would have to do all the splitting on white > space, normalization etc. outside of Solr. > > - Toke Eskildsen > -- A.Nazemian