You're probably hitting the default limit on a field. This is set in solrconfig.xml, the <maxFieldLength> element. The first thing I'd try is upping that to, say, 10000000 reindex and see if that fixes your problem. This is the number of *tokens*, not characters. Roughly the number of words...
Searching for the common word is probably a complete red herring. Best Erick On Wed, Dec 21, 2011 at 4:36 PM, Bent Jensen <bentjen...@yahoo.com> wrote: > Being new to xml/xslt/solr, I am hoping someone can explain/help me with the > following: > > > > Using Apache-Solr 3.4.0 . I have a php page for submitting the search, and > display the result in html. I indexed a 1.5MB size pdf document (400 > pages). Using the admin interface with *:* query everything is returned. > > > > I then try using' highlighting' in the query, and modified the xsl file to > return the highlighting. > > It works fine for the text in the beginning of the document. I can also > query with a phrase between " " and it returns the exact match. > > > > When searching content approx. beyond the first 100 pages, I see this > behavior: > > I must include common words in a phrase to get a result returned. For > example if I search using the word handymen, that only appears in one place > towards the end of the document, nothing is returned, but if I add a common > word that appears in the sentence where handymen is; e.g. 'handymen that', > then both are returned in the highlighting including many other occurrences > of 'that'. If I query with "handymen that", nothing is returned. > > > > thanks > > Ben > > > > > > > > > > _____ > > No virus found in this message. > Checked by AVG - www.avg.com > Version: 2012.0.1890 / Virus Database: 2109/4694 - Release Date: 12/21/11 >