Re: Suggester not working for digit starting terms

2012-04-12 Thread jmlucjav
Well now I am really lost...

1. yes I want to suggest whole sentences too, I want the tokenizer to be
taken into account, and apparently it is working for me in 3.5.0?? I get
suggestions that are like foo bar abc.  Maybe what you mention is only for
file based dictionaries? I am using the field itself.

2. but for the digit issue, in that case nothing is suggested, not even the
term 500 that is there cause I can find it with this query
http://localhost:8983/solr/select/?q={!prefix f=a_suggest}500 

I tried to set threshold to 0 in case the term was being removed, and is not
that.

Moving to 3.6.0 is not a problem (I had already downloaded the rc actually)
but I still see weird things here.

xab

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Suggester-not-working-for-digit-starting-terms-tp3893433p3906303.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Suggester not working for digit starting terms

2012-04-12 Thread Robert Muir
On Thu, Apr 12, 2012 at 3:52 PM, jmlucjav jmluc...@gmail.com wrote:
 Well now I am really lost...

 1. yes I want to suggest whole sentences too, I want the tokenizer to be
 taken into account, and apparently it is working for me in 3.5.0?? I get
 suggestions that are like foo bar abc.  Maybe what you mention is only for
 file based dictionaries? I am using the field itself.

it doesnt use *JUST* your tokenizer. It splits and applies identifier
rules. Such identifier rules include things like, 'cannot start with a
digit'.

That's why i recommend you configure a SuggestQueryConverter so you
have complete control of what is going on rather than dealing with the
spellchecking one.


 Moving to 3.6.0 is not a problem (I had already downloaded the rc actually)
 but I still see weird things here.


installing 3.6 isnt going to do anything magical: as mentioned above
you have to configure the SuggestQueryConverter like the example in
the link if you want to have total control on how the input is treated
before going to the suggester.

-- 
lucidimagination.com


Re: Suggester not working for digit starting terms

2012-04-11 Thread Erick Erickson
Hmmm, I can't pursue this right now, anyone want to jump in?

Erick

On Tue, Apr 10, 2012 at 2:41 PM, jmlucjav jmluc...@gmail.com wrote:
 I have double checked and still get the same behaviour. My field is:
                fieldType name=simpletext class=solr.TextField
 positionIncrementGap=100
                        analyzer
                                charFilter 
 class=solr.MappingCharFilterFactory
 mapping=mapping-ISOLatin1Accent.txt/
                                tokenizer 
 class=solr.KeywordTokenizerFactory/
                                filter class=solr.LowerCaseFilterFactory/
                                filter class=solr.TrimFilterFactory /
                        /analyzer
                /fieldType

 Analisys shows numbers are there, for '500 $' I get as last step both in
 indexquery:

 org.apache.solr.analysis.TrimFilterFactory {luceneMatchVersion=LUCENE_35}
 position        1
 term text       500 $
 startOffset     0
 endOffset       5

 So I still see something going wrong here
 xab

 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Suggester-not-working-for-digit-starting-terms-tp3893433p3900783.html
 Sent from the Solr - User mailing list archive at Nabble.com.


Re: Suggester not working for digit starting terms

2012-04-11 Thread jmlucjav
Just to be sure, reproduced this with example config from 3.5.

1. add to schema.xml
fieldType name=simpletext class=solr.TextField
positionIncrementGap=100
analyzer
charFilter 
class=solr.MappingCharFilterFactory
mapping=mapping-ISOLatin1Accent.txt/
tokenizer 
class=solr.KeywordTokenizerFactory/
filter class=solr.LowerCaseFilterFactory/
filter class=solr.TrimFilterFactory /
/analyzer
/fieldType
field name=a_suggest type=simpletext stored=true omitNorms=true
multiValued=true/
copyField source=* dest=a_suggest/

2 1. add to solrconfig.xml
searchComponent name=suggest class=solr.SpellCheckComponent
lst name=spellchecker
str name=namea_suggest/str
str 
name=classnameorg.apache.solr.spelling.suggest.Suggester/str
str
name=lookupImplorg.apache.solr.spelling.suggest.fst.FSTLookup/str
str name=fielda_suggest/str

str name=buildOnCommittrue/str
int name=weightBuckets100/int
/lst
/searchComponent
requestHandler name=/suggest class=solr.SearchHandler
lst name=defaults
str name=spellchecktrue/str
str name=spellcheck.dictionarya_suggest/str
str name=spellcheck.onlyMorePopulartrue/str
str name=spellcheck.count5/str
str name=spellcheck.collatetrue/str
/lst
arr name=components
strsuggest/str
/arr
/requestHandler
3. wipe data and undex sample docs
4. 
http://localhost:8983/solr/suggest?q=720debugQuery=true   --- 0 result
http://localhost:8983/solr/select/?q={!prefix%20f=a_suggest}720 ---
1 result


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Suggester-not-working-for-digit-starting-terms-tp3893433p3903790.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Suggester not working for digit starting terms

2012-04-11 Thread Robert Muir
On Wed, Apr 11, 2012 at 4:37 PM, jmlucjav jmluc...@gmail.com wrote:
 Just to be sure, reproduced this with example config from 3.5.


Regardless of your tokenizer, be aware that with this version of solr
its going to split up terms based on 'identifier rules' (including
splitting on whitespace).
This is because suggesters go thru the ordinary spellchecker framework.

If you are trying to autosuggest actual phrases, have a look at
http://wiki.apache.org/solr/Suggester#Tips_and_tricks
which describes how to set this up along with example configurations.
More information is available in
https://issues.apache.org/jira/browse/SOLR-3143

Essentially this provides a QueryConverter thats hopefully more
suitable for autosuggesters, it just passes the whole entire input to
your query analyzer,
and its your responsibility to do whatever you need there to extract
the 'meat' of the query for autosuggest. The example configuration
linked from the wiki page is just that and uses some regexps to try to
imitate what google's does (discarding operators like +/- but still
keeping the whole thing as a phrase).

You will need Solr 3.6 for this..., but its on its way out.

-- 
lucidimagination.com


Re: Suggester not working for digit starting terms

2012-04-10 Thread jmlucjav
I have double checked and still get the same behaviour. My field is:
fieldType name=simpletext class=solr.TextField
positionIncrementGap=100
analyzer
charFilter 
class=solr.MappingCharFilterFactory
mapping=mapping-ISOLatin1Accent.txt/
tokenizer 
class=solr.KeywordTokenizerFactory/
filter class=solr.LowerCaseFilterFactory/
filter class=solr.TrimFilterFactory /
/analyzer
/fieldType

Analisys shows numbers are there, for '500 $' I get as last step both in
indexquery:

org.apache.solr.analysis.TrimFilterFactory {luceneMatchVersion=LUCENE_35}
position1
term text   500 $
startOffset 0
endOffset   5

So I still see something going wrong here
xab

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Suggester-not-working-for-digit-starting-terms-tp3893433p3900783.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Suggester not working for digit starting terms

2012-04-09 Thread Erick Erickson
Is it possible that your fieldType definition for a_suggest is
stripping out the digits? Consider using TermsComponent
http://wiki.apache.org/solr/TermsComponent or the admin
page or Luke to examine the terms actually _in_ your
index. Or look at the admin/analysis page and give it some
sample input to determine what the results of the analysis
chain is

Best
Erick

On Sat, Apr 7, 2012 at 3:24 PM, jmlucjav jmluc...@gmail.com wrote:
 Hi,

 I am using Suggester component, as advised in Solr3 book (using solr3.5):
        searchComponent name=suggest class=solr.SpellCheckComponent
                lst name=spellchecker
                        str name=namea_suggest/str
                        str 
 name=classnameorg.apache.solr.spelling.suggest.Suggester/str
                        str
 name=lookupImplorg.apache.solr.spelling.suggest.fst.FSTLookup/str
                        str name=fielda_suggest/str
                        str name=buildOnCommittrue/str
                        int name=weightBuckets100/int
                /lst
        /searchComponent
        requestHandler name=/suggest class=solr.SearchHandler
                lst name=defaults
                        str name=spellchecktrue/str
                        str name=spellcheck.dictionarya_suggest/str
                        str name=spellcheck.onlyMorePopulartrue/str
                        str name=spellcheck.count5/str
                        str name=spellcheck.collatetrue/str
                /lst
                arr name=components
                        strsuggest/str
                /arr
        /requestHandler

 But, even if it works fine with words, it seems it does not work for terms
 starting with diggits. For example:
 http://localhost:8983/solr/suggest?q=500
 gets 0 results, but I know '500 $' is in the a_suggest field, as I can find
 many hits by:
 http://localhost:8983/solr/select/?q={!prefix f=a_suggest}500

 Am I missing something? I have been trying to play with
 spellcheck.onlyMorePopular and spellcheck.accuracy but I get the same
 results.

 thansk
 xab

 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Suggester-not-working-for-digit-starting-terms-tp3893433p3893433.html
 Sent from the Solr - User mailing list archive at Nabble.com.


Suggester not working for digit starting terms

2012-04-07 Thread jmlucjav
Hi,

I am using Suggester component, as advised in Solr3 book (using solr3.5):
searchComponent name=suggest class=solr.SpellCheckComponent
lst name=spellchecker
str name=namea_suggest/str
str 
name=classnameorg.apache.solr.spelling.suggest.Suggester/str
str
name=lookupImplorg.apache.solr.spelling.suggest.fst.FSTLookup/str
str name=fielda_suggest/str
str name=buildOnCommittrue/str
int name=weightBuckets100/int
/lst
/searchComponent
requestHandler name=/suggest class=solr.SearchHandler
lst name=defaults
str name=spellchecktrue/str
str name=spellcheck.dictionarya_suggest/str
str name=spellcheck.onlyMorePopulartrue/str
str name=spellcheck.count5/str
str name=spellcheck.collatetrue/str
/lst
arr name=components
strsuggest/str
/arr
/requestHandler

But, even if it works fine with words, it seems it does not work for terms
starting with diggits. For example:
http://localhost:8983/solr/suggest?q=500
gets 0 results, but I know '500 $' is in the a_suggest field, as I can find
many hits by:
http://localhost:8983/solr/select/?q={!prefix f=a_suggest}500

Am I missing something? I have been trying to play with
spellcheck.onlyMorePopular and spellcheck.accuracy but I get the same
results.

thansk
xab

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Suggester-not-working-for-digit-starting-terms-tp3893433p3893433.html
Sent from the Solr - User mailing list archive at Nabble.com.