Edwin,
The spellcheck is a thing, the Suggester is another.

If you need to provide auto suggestion to your users, the suggester is the
right thing to use.
But I really doubt to be useful to select as a suggester field the entire
content.
it is going to be quite expensive.

In the case I would again really suggest you to take a look to the article
I quoted and Solr generic documentation.

It is possible to suggest part of the field.
You can use the FreeText suggester with a proper analysis selected.

Cheers

2015-06-17 6:14 GMT+01:00 Zheng Lin Edwin Yeo <edwinye...@gmail.com>:

> Yes I've looked at that before, but I was told that the newer version of
> Solr has its own suggester, and does not need to use spellchecker anymore?
>
> So it's not necessary to use the spellechecker inside suggester anymore?
>
> Regards,
> Edwin
>
>
> On 17 June 2015 at 11:56, Erick Erickson <erickerick...@gmail.com> wrote:
>
> > Have you looked at spellchecker? Because that sound much more like
> > what you're asking about than suggester.
> >
> > Spell checking is more what you're asking for, have you even looked at
> that
> > after it was suggested?
> >
> > bq: Also, when I do a search, it shouldn't be returning whole fields,
> > but just to return a portion of the sentence
> >
> > This is what highlighting is built for.
> >
> > Really, I recommend you take the time to do some familiarization with the
> > whole search space and Solr. The excellent book here:
> >
> >
> >
> http://www.amazon.com/Solr-Action-Trey-Grainger/dp/1617291021/ref=sr_1_1?ie=UTF8&qid=1434513284&sr=8-1&keywords=apache+solr&pebp=1434513287267&perid=0YRK508J0HJ1N3BAX20E
> >
> > will give you the grounding you need to get the most out of Solr.
> >
> > Best,
> > Erick
> >
> > On Tue, Jun 16, 2015 at 8:27 PM, Zheng Lin Edwin Yeo
> > <edwinye...@gmail.com> wrote:
> > > The long content is from when I tried to index PDF files. As some PDF
> > files
> > > has alot of words in the content, it will lead to the *UTF8 encoding is
> > > longer than the max length 32766 error.*
> > >
> > > I think the problem is the content size of the PDF file exceed 32766
> > > characters?
> > >
> > > I'm trying to accomplish to be able to index documents that can be of
> any
> > > size (even those with very large contents), and build the suggester
> from
> > > there. Also, when I do a search, it shouldn't be returning whole
> fields,
> > > but just to return a portion of the sentence.
> > >
> > >
> > >
> > > Regards,
> > > Edwin
> > >
> > >
> > > On 16 June 2015 at 23:02, Erick Erickson <erickerick...@gmail.com>
> > wrote:
> > >
> > >> The suggesters are built to return whole fields. You _might_
> > >> be able to add multiple fragments to a multiValued
> > >> entry and get fragments, I haven't tried that though
> > >> and I suspect that actually you'd get the same thing..
> > >>
> > >> This is an XY problem IMO. Please describe exactly what
> > >> you're trying to accomplish, with examples rather than
> > >> continue to pursue this path. It sounds like you want
> > >> spellcheck or similar. The _point_ behind the
> > >> suggesters is that they handle multiple-word suggestions
> > >> by returning he whole field. So putting long text fields
> > >> into them is not going to work.
> > >>
> > >> Best,
> > >> Erick
> > >>
> > >> On Tue, Jun 16, 2015 at 1:46 AM, Alessandro Benedetti
> > >> <benedetti.ale...@gmail.com> wrote:
> > >> > in line :
> > >> >
> > >> > 2015-06-16 4:43 GMT+01:00 Zheng Lin Edwin Yeo <edwinye...@gmail.com
> >:
> > >> >
> > >> >> Thanks Benedetti,
> > >> >>
> > >> >> I've change to the AnalyzingInfixLookup approach, and it is able to
> > >> start
> > >> >> searching from the middle of the field.
> > >> >>
> > >> >> However, is it possible to make the suggester to show only part of
> > the
> > >> >> content of the field (like 2 or 3 fields after), instead of the
> > entire
> > >> >> content/sentence, which can be quite long?
> > >> >>
> > >> >
> > >> > I assume you use "fields" in the place of tokens.
> > >> > The answer is yes, I already said that in my previous mail, I invite
> > you
> > >> to
> > >> > read carefully the answers and the documentation linked !
> > >> >
> > >> > Related the excessive dimensions of tokens. This is weird, what are
> > you
> > >> > trying to autocomplete ?
> > >> > I really doubt would be useful for a user to see super long auto
> > >> completed
> > >> > terms.
> > >> >
> > >> > Cheers
> > >> >
> > >> >>
> > >> >>
> > >> >> Regards,
> > >> >> Edwin
> > >> >>
> > >> >>
> > >> >>
> > >> >> On 15 June 2015 at 17:33, Alessandro Benedetti <
> > >> benedetti.ale...@gmail.com
> > >> >> >
> > >> >> wrote:
> > >> >>
> > >> >> > ehehe Edwin, I think you should read again the document I linked
> > time
> > >> >> ago :
> > >> >> >
> > >> >> > http://lucidworks.com/blog/solr-suggester/
> > >> >> >
> > >> >> > The suggester you used is not meant to provide infix suggestions.
> > >> >> > The fuzzy suggester is working on a fuzzy basis , with the
> > *starting*
> > >> >> terms
> > >> >> > of a field content.
> > >> >> >
> > >> >> > What you are looking for is actually one of the Infix Suggesters.
> > >> >> > For example the AnalyzingInfixLookup approach.
> > >> >> >
> > >> >> > When working with Suggesters is important first to make a
> > distinction
> > >> :
> > >> >> >
> > >> >> > 1) Returning the full content of the field ( analysisInfix or
> > Fuzzy)
> > >> >> >
> > >> >> > 2) Returning token(s) ( Free Text Suggester)
> > >> >> >
> > >> >> > Then the second difference is :
> > >> >> >
> > >> >> > 1) Infix suggestions ( from the "middle" of the field content)
> > >> >> > 2) Classic suggester ( from the beginning of the field content)
> > >> >> >
> > >> >> > Clarified that, will be quite simple to work with suggesters.
> > >> >> >
> > >> >> > Cheers
> > >> >> >
> > >> >> > 2015-06-15 9:28 GMT+01:00 Zheng Lin Edwin Yeo <
> > edwinye...@gmail.com>:
> > >> >> >
> > >> >> > > I've indexed a rich-text documents with the following content:
> > >> >> > >
> > >> >> > > This is a testing rich text documents to test the uploading of
> > >> files to
> > >> >> > > Solr
> > >> >> > >
> > >> >> > >
> > >> >> > > When I tried to use the suggestion, it return me the entire
> > field in
> > >> >> the
> > >> >> > > content once I enter suggest?q=t. However, when I tried to
> search
> > >> for
> > >> >> > > q='rich', I don't get any results returned.
> > >> >> > >
> > >> >> > > This is my current configuration for the suggester:
> > >> >> > > <searchComponent name="suggest" class="solr.SuggestComponent">
> > >> >> > >   <lst name="suggester">
> > >> >> > > <str name="name">mySuggester</str>
> > >> >> > > <str name="lookupImpl">FuzzyLookupFactory</str>
> > >> >> > > <str name="dictionaryImpl">DocumentDictionaryFactory</str>
> > >> >> > > <str name="field">Suggestion</str>
> > >> >> > > <str name="suggestAnalyzerFieldType">suggestType</str>
> > >> >> > > <str name="buildOnStartup">true</str>
> > >> >> > > <str name="buildOnCommit">false</str>
> > >> >> > >   </lst>
> > >> >> > > </searchComponent>
> > >> >> > >
> > >> >> > > <requestHandler name="/suggest" class="solr.SearchHandler"
> > >> >> > startup="lazy" >
> > >> >> > >   <lst name="defaults">
> > >> >> > >     <str name="wt">json</str>
> > >> >> > >         <str name="indent">true</str>
> > >> >> > >
> > >> >> > > <str name="suggest">true</str>
> > >> >> > > <str name="suggest.count">10</str>
> > >> >> > > <str name="suggest.dictionary">mySuggester</str>
> > >> >> > >   </lst>
> > >> >> > >   <arr name="components">
> > >> >> > > <str>suggest</str>
> > >> >> > >   </arr>
> > >> >> > > </requestHandler>
> > >> >> > >
> > >> >> > > Is it possible to allow the suggester to return something even
> > from
> > >> the
> > >> >> > > middle of the sentence, and also not to return the entire
> > sentence
> > >> if
> > >> >> the
> > >> >> > > sentence. Perhaps it should just suggest the next 2 or 3
> fields,
> > >> and to
> > >> >> > > return more fields as the users type.
> > >> >> > >
> > >> >> > > For example,
> > >> >> > > When user type 'this', it should return 'This is a testing'
> > >> >> > > When user type 'this is a testing', it should return 'This is a
> > >> testing
> > >> >> > > rich text documents'.
> > >> >> > >
> > >> >> > >
> > >> >> > > Regards,
> > >> >> > > Edwin
> > >> >> > >
> > >> >> >
> > >> >> >
> > >> >> >
> > >> >> > --
> > >> >> > --------------------------
> > >> >> >
> > >> >> > Benedetti Alessandro
> > >> >> > Visiting card : http://about.me/alessandro_benedetti
> > >> >> >
> > >> >> > "Tyger, tyger burning bright
> > >> >> > In the forests of the night,
> > >> >> > What immortal hand or eye
> > >> >> > Could frame thy fearful symmetry?"
> > >> >> >
> > >> >> > William Blake - Songs of Experience -1794 England
> > >> >> >
> > >> >>
> > >> >
> > >> >
> > >> >
> > >> > --
> > >> > --------------------------
> > >> >
> > >> > Benedetti Alessandro
> > >> > Visiting card : http://about.me/alessandro_benedetti
> > >> >
> > >> > "Tyger, tyger burning bright
> > >> > In the forests of the night,
> > >> > What immortal hand or eye
> > >> > Could frame thy fearful symmetry?"
> > >> >
> > >> > William Blake - Songs of Experience -1794 England
> > >>
> >
>



-- 
--------------------------

Benedetti Alessandro
Visiting card : http://about.me/alessandro_benedetti

"Tyger, tyger burning bright
In the forests of the night,
What immortal hand or eye
Could frame thy fearful symmetry?"

William Blake - Songs of Experience -1794 England

Reply via email to