Thanks Shawn. Here is the text_general type definition. We would like to bring down the storage requirement down to a minimum for those 500KB content documents. We just need basic full-text search.
Thanks!!! :) <fieldType name="text_general" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true"/> <!-- in this example, we will only use synonyms at query time <filter class="solr.SynonymFilterFactory" synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/> --> <filter class="solr.LowerCaseFilterFactory"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true"/> <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/> <filter class="solr.LowerCaseFilterFactory"/> </analyzer> </fieldType> On Tue, Jul 2, 2013 at 11:35 AM, Shawn Heisey <s...@elyograg.org> wrote: > On 7/2/2013 12:22 PM, Ali, Saqib wrote: > > Newbie question: > > > > We have the following fields defined in the schema: > > > > <field name="content" type="text_general" indexed="true" stored="false"/> > > <field name="teaser" type="text_general" indexed="false" stored="true"/> > > <copyField source="content" dest="teaser" maxChars="80"/> > > > > the content is field is about 500KB data. > > > > My question is whether Solr stores the entire contents of the that 500KB > > content field? > > > > We want to minimize the stored data in the Solr index, that is why we > added > > the copyField teaser. > > With that config, the entire 500KB will not be _stored_ .. but it will > affect the index size because you are indexing it. Exactly what degree > that will be depends on the definition of the text_general type. > > Thanks, > Shawn > >