Re: Auto Complete

2014-08-06 Thread Michael Della Bitta
You'd still need to modify that schema to use the ASCII folding filter.

Alternatively, if you want something off the shelf, you might check out
Sematext's autocomplete product:
http://www.sematext.com/products/autocomplete/index.html

Michael Della Bitta

Applications Developer

o: +1 646 532 3062

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions  | g+:
plus.google.com/appinions

w: appinions.com 


On Wed, Aug 6, 2014 at 10:56 AM, benjelloun  wrote:

> Hello thanks for the tutorial i test all schema but its not what i need.
> what i need is to auto complete with an autocorrection like i said before:
> q="gene" -->autocomplete "genève" with accent
>
>
> 2014-08-05 18:03 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> ml-node+s472066n4151261...@n3.nabble.com>:
>
> > In this case, I recommend using the approach that this tutorial uses:
> >
> >
> http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/
> >
> > Basically the idea is you index the data a few different ways and then
> use
> > edismax to query them all with different boosts. You'd use the stored
> > version of you field for display, so your accented characters would not
> > get
> > stripped.
> >
> > Michael Della Bitta
> >
> > Applications Developer
> >
> > o: +1 646 532 3062
> >
> > appinions inc.
> >
> > “The Science of Influence Marketing”
> >
> > 18 East 41st Street
> >
> > New York, NY 10017
> >
> > t: @appinions  | g+:
> > plus.google.com/appinions
> > <
> >
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
> >
> >
> > w: appinions.com 
> >
> >
> > On Tue, Aug 5, 2014 at 9:32 AM, benjelloun <[hidden email]
> > > wrote:
> >
> > > yeah thats true i creat this index just for auto complete
> > > here is my schema:
> > >
> > >  > > required="false" multiValued="true"/>
> > >  > > required="false" multiValued="true"/>
> > >  > > required="false" multiValued="true"/>
> > >
> > > 
> > > 
> > > 
> > >
> > > the i use "suggestField" for autocomplet like i mentioned above
> > > do you have any other configuration which can do what i need ?
> > >
> > >
> > >
> > > 2014-08-05 15:19 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> > > [hidden email]  >>:
> > >
> > > > Unless I'm mistaken, it seems like you've created this index
> > specifically
> > > > for autocomplete? Or is this index used for general search also?
> > > >
> > > > The easy way to understand this question: Is there one entry in your
> > > index
> > > > for each term you want to autocomplete? Or are there multiple entries
> > > that
> > > > might contain the same term?
> > > >
> > > > Michael Della Bitta
> > > >
> > > > Applications Developer
> > > >
> > > > o: +1 646 532 3062
> > > >
> > > > appinions inc.
> > > >
> > > > “The Science of Influence Marketing”
> > > >
> > > > 18 East 41st Street
> > > >
> > > > New York, NY 10017
> > > >
> > > > t: @appinions  | g+:
> > > > plus.google.com/appinions
> > > > <
> > > >
> > >
> >
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
> > > >
> > > >
> > > > w: appinions.com 
> > > >
> > > >
> > > > On Tue, Aug 5, 2014 at 9:10 AM, benjelloun <[hidden email]
> > > > > wrote:
> > > >
> > > > > hello,
> > > > >
> > > > > did you find any solution to this problem ?
> > > > >
> > > > > regards
> > > > >
> > > > >
> > > > > 2014-08-04 16:16 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> > > > > [hidden email] <
> http://user/SendEmail.jtp?type=node&node=4151216&i=1
> >
> > > >>:
> > > > >
> > > > > > How are you implementing autosuggest? I'm assuming you're
> querying
> > an
> > > > > > indexed field and getting a stored value back. But there are a
> > wide
> > > > > > variety
> > > > > > of ways of doing it.
> > > > > >
> > > > > > Michael Della Bitta
> > > > > >
> > > > > > Applications Developer
> > > > > >
> > > > > > o: +1 646 532 3062
> > > > > >
> > > > > > appinions inc.
> > > > > >
> > > > > > “The Science of Influence Marketing”
> > > > > >
> > > > > > 18 East 41st Street
> > > > > >
> > > > > > New York, NY 10017
> > > > > >
> > > > > > t: @appinions  | g+:
> > > > > > plus.google.com/appinions
> > > > > > <
> > > > > >
> > > > >
> > > >
> > >
> >
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
> > > > > >
> > > > > >
> > > > > > w: appinions.com 
> > > > > >
> > > > > >
> > > > > > On Mon, Aug 4, 2014 at 10:10 AM, benjelloun <[hidden email]
> > > > > > 

Re: Auto Complete

2014-08-06 Thread benjelloun
Hello thanks for the tutorial i test all schema but its not what i need.
what i need is to auto complete with an autocorrection like i said before:
q="gene" -->autocomplete "genève" with accent


2014-08-05 18:03 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
ml-node+s472066n4151261...@n3.nabble.com>:

> In this case, I recommend using the approach that this tutorial uses:
>
> http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/
>
> Basically the idea is you index the data a few different ways and then use
> edismax to query them all with different boosts. You'd use the stored
> version of you field for display, so your accented characters would not
> get
> stripped.
>
> Michael Della Bitta
>
> Applications Developer
>
> o: +1 646 532 3062
>
> appinions inc.
>
> “The Science of Influence Marketing”
>
> 18 East 41st Street
>
> New York, NY 10017
>
> t: @appinions  | g+:
> plus.google.com/appinions
> <
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts>
>
> w: appinions.com 
>
>
> On Tue, Aug 5, 2014 at 9:32 AM, benjelloun <[hidden email]
> > wrote:
>
> > yeah thats true i creat this index just for auto complete
> > here is my schema:
> >
> >  > required="false" multiValued="true"/>
> >  > required="false" multiValued="true"/>
> >  > required="false" multiValued="true"/>
> >
> > 
> > 
> > 
> >
> > the i use "suggestField" for autocomplet like i mentioned above
> > do you have any other configuration which can do what i need ?
> >
> >
> >
> > 2014-08-05 15:19 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> > [hidden email] >:
> >
> > > Unless I'm mistaken, it seems like you've created this index
> specifically
> > > for autocomplete? Or is this index used for general search also?
> > >
> > > The easy way to understand this question: Is there one entry in your
> > index
> > > for each term you want to autocomplete? Or are there multiple entries
> > that
> > > might contain the same term?
> > >
> > > Michael Della Bitta
> > >
> > > Applications Developer
> > >
> > > o: +1 646 532 3062
> > >
> > > appinions inc.
> > >
> > > “The Science of Influence Marketing”
> > >
> > > 18 East 41st Street
> > >
> > > New York, NY 10017
> > >
> > > t: @appinions  | g+:
> > > plus.google.com/appinions
> > > <
> > >
> >
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
> > >
> > >
> > > w: appinions.com 
> > >
> > >
> > > On Tue, Aug 5, 2014 at 9:10 AM, benjelloun <[hidden email]
> > > > wrote:
> > >
> > > > hello,
> > > >
> > > > did you find any solution to this problem ?
> > > >
> > > > regards
> > > >
> > > >
> > > > 2014-08-04 16:16 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> > > > [hidden email] 
> > >>:
> > > >
> > > > > How are you implementing autosuggest? I'm assuming you're querying
> an
> > > > > indexed field and getting a stored value back. But there are a
> wide
> > > > > variety
> > > > > of ways of doing it.
> > > > >
> > > > > Michael Della Bitta
> > > > >
> > > > > Applications Developer
> > > > >
> > > > > o: +1 646 532 3062
> > > > >
> > > > > appinions inc.
> > > > >
> > > > > “The Science of Influence Marketing”
> > > > >
> > > > > 18 East 41st Street
> > > > >
> > > > > New York, NY 10017
> > > > >
> > > > > t: @appinions  | g+:
> > > > > plus.google.com/appinions
> > > > > <
> > > > >
> > > >
> > >
> >
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
> > > > >
> > > > >
> > > > > w: appinions.com 
> > > > >
> > > > >
> > > > > On Mon, Aug 4, 2014 at 10:10 AM, benjelloun <[hidden email]
> > > > > > wrote:
> > > > >
> > > > > > hello you didnt enderstand well my probleme,
> > > > > >
> > > > > > i give exemple: i have document contain "genève" with accent
> > > > > > when i do q="gene" --> autoSuggest "geneve" because of
> > > > > > ASCIIFoldingFilterFactory preserveOriginal="true"
> > > > > > when i do q="genè" --> autoSuggest "genève"
> > > > > > but what i need to is:
> > > > > > q="gene" without accent and get this result: "genève" with
> accent
> > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > View this message in context:
> > > > > >
> > > >
> > http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150989.html
> > > > >
> > > > > > Sent from the Solr - User mailing list archive at Nabble.com.
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > >  If you reply to this email, your message will be added to the
> > > discussion
> > > > > below:
> > > > >
> > >
> http://lucene.472066.n3.nabble.com/Au

Re: Auto Complete

2014-08-05 Thread Michael Della Bitta
In this case, I recommend using the approach that this tutorial uses:

http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/

Basically the idea is you index the data a few different ways and then use
edismax to query them all with different boosts. You'd use the stored
version of you field for display, so your accented characters would not get
stripped.

Michael Della Bitta

Applications Developer

o: +1 646 532 3062

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions  | g+:
plus.google.com/appinions

w: appinions.com 


On Tue, Aug 5, 2014 at 9:32 AM, benjelloun  wrote:

> yeah thats true i creat this index just for auto complete
> here is my schema:
>
>  required="false" multiValued="true"/>
>  required="false" multiValued="true"/>
>  required="false" multiValued="true"/>
>
> 
> 
> 
>
> the i use "suggestField" for autocomplet like i mentioned above
> do you have any other configuration which can do what i need ?
>
>
>
> 2014-08-05 15:19 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> ml-node+s472066n4151216...@n3.nabble.com>:
>
> > Unless I'm mistaken, it seems like you've created this index specifically
> > for autocomplete? Or is this index used for general search also?
> >
> > The easy way to understand this question: Is there one entry in your
> index
> > for each term you want to autocomplete? Or are there multiple entries
> that
> > might contain the same term?
> >
> > Michael Della Bitta
> >
> > Applications Developer
> >
> > o: +1 646 532 3062
> >
> > appinions inc.
> >
> > “The Science of Influence Marketing”
> >
> > 18 East 41st Street
> >
> > New York, NY 10017
> >
> > t: @appinions  | g+:
> > plus.google.com/appinions
> > <
> >
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
> >
> >
> > w: appinions.com 
> >
> >
> > On Tue, Aug 5, 2014 at 9:10 AM, benjelloun <[hidden email]
> > > wrote:
> >
> > > hello,
> > >
> > > did you find any solution to this problem ?
> > >
> > > regards
> > >
> > >
> > > 2014-08-04 16:16 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> > > [hidden email]  >>:
> > >
> > > > How are you implementing autosuggest? I'm assuming you're querying an
> > > > indexed field and getting a stored value back. But there are a wide
> > > > variety
> > > > of ways of doing it.
> > > >
> > > > Michael Della Bitta
> > > >
> > > > Applications Developer
> > > >
> > > > o: +1 646 532 3062
> > > >
> > > > appinions inc.
> > > >
> > > > “The Science of Influence Marketing”
> > > >
> > > > 18 East 41st Street
> > > >
> > > > New York, NY 10017
> > > >
> > > > t: @appinions  | g+:
> > > > plus.google.com/appinions
> > > > <
> > > >
> > >
> >
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
> > > >
> > > >
> > > > w: appinions.com 
> > > >
> > > >
> > > > On Mon, Aug 4, 2014 at 10:10 AM, benjelloun <[hidden email]
> > > > > wrote:
> > > >
> > > > > hello you didnt enderstand well my probleme,
> > > > >
> > > > > i give exemple: i have document contain "genève" with accent
> > > > > when i do q="gene" --> autoSuggest "geneve" because of
> > > > > ASCIIFoldingFilterFactory preserveOriginal="true"
> > > > > when i do q="genè" --> autoSuggest "genève"
> > > > > but what i need to is:
> > > > > q="gene" without accent and get this result: "genève" with accent
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > View this message in context:
> > > > >
> > >
> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150989.html
> > > >
> > > > > Sent from the Solr - User mailing list archive at Nabble.com.
> > > > >
> > > >
> > > >
> > > > --
> > > >  If you reply to this email, your message will be added to the
> > discussion
> > > > below:
> > > >
> > http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150990.html
> > > >  To unsubscribe from Auto Complete, click here
> > > > <
> > >
> > >
> > > > .
> > > > NAML
> > > > <
> > >
> >
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml
> >
> > > >
> > > >
> > >
> > >
> > >
> > >
> > > --
> > > View this message in context:
> > >
> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4151211.html
> >
> > > Sent from the Solr - User maili

Re: Auto Complete

2014-08-05 Thread benjelloun
i found this solution but when i test it nothing in suggestion



  fuzzySuggest
  org.apache.solr.spelling.suggest.Suggester
  org.apache.solr.spelling.suggest.fst.FuzzyLookupFactory
  suggestField
  suggestFolders
  true
  true
  texts
  false
  2
  suggestFolders/fuzzysuggest.txt
  
phrase_suggest
  

  

  fuzzySuggest
  true
  fuzzySuggest
  true
  10
  true
  10
  true


  fuzzySuggest

  


2014-08-05 15:32 GMT+02:00 anass benjelloun :

> yeah thats true i creat this index just for auto complete
> here is my schema:
>
>  required="false" multiValued="true"/>
>  required="false" multiValued="true"/>
>  required="false" multiValued="true"/>
>
> 
> 
> 
>
> the i use "suggestField" for autocomplet like i mentioned above
> do you have any other configuration which can do what i need ?
>
>
>
> 2014-08-05 15:19 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> ml-node+s472066n4151216...@n3.nabble.com>:
>
>  Unless I'm mistaken, it seems like you've created this index specifically
>> for autocomplete? Or is this index used for general search also?
>>
>> The easy way to understand this question: Is there one entry in your
>> index
>> for each term you want to autocomplete? Or are there multiple entries
>> that
>> might contain the same term?
>>
>> Michael Della Bitta
>>
>> Applications Developer
>>
>> o: +1 646 532 3062
>>
>> appinions inc.
>>
>> “The Science of Influence Marketing”
>>
>> 18 East 41st Street
>>
>> New York, NY 10017
>>
>> t: @appinions  | g+:
>> plus.google.com/appinions
>> <
>> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts>
>>
>> w: appinions.com 
>>
>>
>> On Tue, Aug 5, 2014 at 9:10 AM, benjelloun <[hidden email]
>> > wrote:
>>
>> > hello,
>> >
>> > did you find any solution to this problem ?
>> >
>> > regards
>> >
>> >
>> > 2014-08-04 16:16 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
>> > [hidden email] >:
>>
>> >
>> > > How are you implementing autosuggest? I'm assuming you're querying an
>> > > indexed field and getting a stored value back. But there are a wide
>> > > variety
>> > > of ways of doing it.
>> > >
>> > > Michael Della Bitta
>> > >
>> > > Applications Developer
>> > >
>> > > o: +1 646 532 3062
>> > >
>> > > appinions inc.
>> > >
>> > > “The Science of Influence Marketing”
>> > >
>> > > 18 East 41st Street
>> > >
>> > > New York, NY 10017
>> > >
>> > > t: @appinions  | g+:
>> > > plus.google.com/appinions
>> > > <
>> > >
>> >
>> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
>> > >
>> > >
>> > > w: appinions.com 
>> > >
>> > >
>> > > On Mon, Aug 4, 2014 at 10:10 AM, benjelloun <[hidden email]
>> > > > wrote:
>> > >
>> > > > hello you didnt enderstand well my probleme,
>> > > >
>> > > > i give exemple: i have document contain "genève" with accent
>> > > > when i do q="gene" --> autoSuggest "geneve" because of
>> > > > ASCIIFoldingFilterFactory preserveOriginal="true"
>> > > > when i do q="genè" --> autoSuggest "genève"
>> > > > but what i need to is:
>> > > > q="gene" without accent and get this result: "genève" with accent
>> > > >
>> > > >
>> > > >
>> > > > --
>> > > > View this message in context:
>> > > >
>> > http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150989.html
>> > >
>> > > > Sent from the Solr - User mailing list archive at Nabble.com.
>> > > >
>> > >
>> > >
>> > > --
>> > >  If you reply to this email, your message will be added to the
>> discussion
>> > > below:
>> > >
>> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150990.html
>> > >  To unsubscribe from Auto Complete, click here
>> > > <
>> >
>> >
>> > > .
>> > > NAML
>> > > <
>> >
>> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml
>>
>> > >
>> > >
>> >
>> >
>> >
>> >
>> > --
>> > View this message in context:
>> > http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4151211.html
>>
>> > Sent from the Solr - User mailing list archive at Nabble.com.
>> >
>>
>>
>> --
>>  If you reply to this email, your message will be added to the
>> discussion below:
>> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4151216.html
>>  To unsubscribe from Auto Complete, click here
>> 

Re: Auto Complete

2014-08-05 Thread benjelloun
yeah thats true i creat this index just for auto complete
here is my schema:









the i use "suggestField" for autocomplet like i mentioned above
do you have any other configuration which can do what i need ?



2014-08-05 15:19 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
ml-node+s472066n4151216...@n3.nabble.com>:

> Unless I'm mistaken, it seems like you've created this index specifically
> for autocomplete? Or is this index used for general search also?
>
> The easy way to understand this question: Is there one entry in your index
> for each term you want to autocomplete? Or are there multiple entries that
> might contain the same term?
>
> Michael Della Bitta
>
> Applications Developer
>
> o: +1 646 532 3062
>
> appinions inc.
>
> “The Science of Influence Marketing”
>
> 18 East 41st Street
>
> New York, NY 10017
>
> t: @appinions  | g+:
> plus.google.com/appinions
> <
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts>
>
> w: appinions.com 
>
>
> On Tue, Aug 5, 2014 at 9:10 AM, benjelloun <[hidden email]
> > wrote:
>
> > hello,
> >
> > did you find any solution to this problem ?
> >
> > regards
> >
> >
> > 2014-08-04 16:16 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> > [hidden email] >:
> >
> > > How are you implementing autosuggest? I'm assuming you're querying an
> > > indexed field and getting a stored value back. But there are a wide
> > > variety
> > > of ways of doing it.
> > >
> > > Michael Della Bitta
> > >
> > > Applications Developer
> > >
> > > o: +1 646 532 3062
> > >
> > > appinions inc.
> > >
> > > “The Science of Influence Marketing”
> > >
> > > 18 East 41st Street
> > >
> > > New York, NY 10017
> > >
> > > t: @appinions  | g+:
> > > plus.google.com/appinions
> > > <
> > >
> >
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
> > >
> > >
> > > w: appinions.com 
> > >
> > >
> > > On Mon, Aug 4, 2014 at 10:10 AM, benjelloun <[hidden email]
> > > > wrote:
> > >
> > > > hello you didnt enderstand well my probleme,
> > > >
> > > > i give exemple: i have document contain "genève" with accent
> > > > when i do q="gene" --> autoSuggest "geneve" because of
> > > > ASCIIFoldingFilterFactory preserveOriginal="true"
> > > > when i do q="genè" --> autoSuggest "genève"
> > > > but what i need to is:
> > > > q="gene" without accent and get this result: "genève" with accent
> > > >
> > > >
> > > >
> > > > --
> > > > View this message in context:
> > > >
> > http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150989.html
> > >
> > > > Sent from the Solr - User mailing list archive at Nabble.com.
> > > >
> > >
> > >
> > > --
> > >  If you reply to this email, your message will be added to the
> discussion
> > > below:
> > >
> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150990.html
> > >  To unsubscribe from Auto Complete, click here
> > > <
> >
> >
> > > .
> > > NAML
> > > <
> >
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml
>
> > >
> > >
> >
> >
> >
> >
> > --
> > View this message in context:
> > http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4151211.html
>
> > Sent from the Solr - User mailing list archive at Nabble.com.
> >
>
>
> --
>  If you reply to this email, your message will be added to the discussion
> below:
> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4151216.html
>  To unsubscribe from Auto Complete, click here
> 
> .
> NAML
> 
>




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4151222.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Auto Complete

2014-08-05 Thread Michael Della Bitta
Unless I'm mistaken, it seems like you've created this index specifically
for autocomplete? Or is this index used for general search also?

The easy way to understand this question: Is there one entry in your index
for each term you want to autocomplete? Or are there multiple entries that
might contain the same term?

Michael Della Bitta

Applications Developer

o: +1 646 532 3062

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions  | g+:
plus.google.com/appinions

w: appinions.com 


On Tue, Aug 5, 2014 at 9:10 AM, benjelloun  wrote:

> hello,
>
> did you find any solution to this problem ?
>
> regards
>
>
> 2014-08-04 16:16 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
> ml-node+s472066n4150990...@n3.nabble.com>:
>
> > How are you implementing autosuggest? I'm assuming you're querying an
> > indexed field and getting a stored value back. But there are a wide
> > variety
> > of ways of doing it.
> >
> > Michael Della Bitta
> >
> > Applications Developer
> >
> > o: +1 646 532 3062
> >
> > appinions inc.
> >
> > “The Science of Influence Marketing”
> >
> > 18 East 41st Street
> >
> > New York, NY 10017
> >
> > t: @appinions  | g+:
> > plus.google.com/appinions
> > <
> >
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts
> >
> >
> > w: appinions.com 
> >
> >
> > On Mon, Aug 4, 2014 at 10:10 AM, benjelloun <[hidden email]
> > > wrote:
> >
> > > hello you didnt enderstand well my probleme,
> > >
> > > i give exemple: i have document contain "genève" with accent
> > > when i do q="gene" --> autoSuggest "geneve" because of
> > > ASCIIFoldingFilterFactory preserveOriginal="true"
> > > when i do q="genè" --> autoSuggest "genève"
> > > but what i need to is:
> > > q="gene" without accent and get this result: "genève" with accent
> > >
> > >
> > >
> > > --
> > > View this message in context:
> > >
> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150989.html
> >
> > > Sent from the Solr - User mailing list archive at Nabble.com.
> > >
> >
> >
> > --
> >  If you reply to this email, your message will be added to the discussion
> > below:
> > http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150990.html
> >  To unsubscribe from Auto Complete, click here
> > <
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=4150987&code=YW5hc3MuYm5qQGdtYWlsLmNvbXw0MTUwOTg3fC0xMDQyNjMzMDgx
> >
> > .
> > NAML
> > <
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml
> >
> >
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4151211.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: Auto Complete

2014-08-05 Thread benjelloun
hello,

did you find any solution to this problem ?

regards


2014-08-04 16:16 GMT+02:00 Michael Della Bitta-2 [via Lucene] <
ml-node+s472066n4150990...@n3.nabble.com>:

> How are you implementing autosuggest? I'm assuming you're querying an
> indexed field and getting a stored value back. But there are a wide
> variety
> of ways of doing it.
>
> Michael Della Bitta
>
> Applications Developer
>
> o: +1 646 532 3062
>
> appinions inc.
>
> “The Science of Influence Marketing”
>
> 18 East 41st Street
>
> New York, NY 10017
>
> t: @appinions  | g+:
> plus.google.com/appinions
> <
> https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts>
>
> w: appinions.com 
>
>
> On Mon, Aug 4, 2014 at 10:10 AM, benjelloun <[hidden email]
> > wrote:
>
> > hello you didnt enderstand well my probleme,
> >
> > i give exemple: i have document contain "genève" with accent
> > when i do q="gene" --> autoSuggest "geneve" because of
> > ASCIIFoldingFilterFactory preserveOriginal="true"
> > when i do q="genè" --> autoSuggest "genève"
> > but what i need to is:
> > q="gene" without accent and get this result: "genève" with accent
> >
> >
> >
> > --
> > View this message in context:
> > http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150989.html
>
> > Sent from the Solr - User mailing list archive at Nabble.com.
> >
>
>
> --
>  If you reply to this email, your message will be added to the discussion
> below:
> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150990.html
>  To unsubscribe from Auto Complete, click here
> 
> .
> NAML
> 
>




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4151211.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Auto Complete

2014-08-04 Thread benjelloun
if you have another configuration to can solve this problem please share it,
thanks




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4151002.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Auto Complete

2014-08-04 Thread benjelloun
here is my configuration:


  suggestDic
  org.apache.solr.spelling.suggest.Suggester
  org.apache.solr.spelling.suggest.fst.WFSTLookupFactory
  
  suggestFolder
  suggestField  
  true
  true
   suggestFolder/emptyDic.txt
 
  

 
textSuggest
  
  
  

  suggests
  true
  
  suggestDic
  true
  6   
  true
  6 
  true  


  suggests

  



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150992.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Auto Complete

2014-08-04 Thread Michael Della Bitta
How are you implementing autosuggest? I'm assuming you're querying an
indexed field and getting a stored value back. But there are a wide variety
of ways of doing it.

Michael Della Bitta

Applications Developer

o: +1 646 532 3062

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions  | g+:
plus.google.com/appinions

w: appinions.com 


On Mon, Aug 4, 2014 at 10:10 AM, benjelloun  wrote:

> hello you didnt enderstand well my probleme,
>
> i give exemple: i have document contain "genève" with accent
> when i do q="gene" --> autoSuggest "geneve" because of
> ASCIIFoldingFilterFactory preserveOriginal="true"
> when i do q="genè" --> autoSuggest "genève"
> but what i need to is:
> q="gene" without accent and get this result: "genève" with accent
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150989.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: Auto Complete

2014-08-04 Thread benjelloun
hello you didnt enderstand well my probleme,

i give exemple: i have document contain "genève" with accent
when i do q="gene" --> autoSuggest "geneve" because of
ASCIIFoldingFilterFactory preserveOriginal="true"
when i do q="genè" --> autoSuggest "genève" 
but what i need to is:
q="gene" without accent and get this result: "genève" with accent



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987p4150989.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Auto Complete

2014-08-04 Thread Michael Della Bitta
You need to use this filter in your analysis chain:

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ASCIIFoldingFilterFactory

Michael Della Bitta

Applications Developer

o: +1 646 532 3062

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions  | g+:
plus.google.com/appinions

w: appinions.com 


On Mon, Aug 4, 2014 at 9:59 AM, benjelloun  wrote:

> Hello,
>
> I have an index which contain "genève"
> I need to do this query q="gene" and get in auto complete this result :
> "genève"  (e -> è)
> I'm using StandardTokenizerFactory for field and SpellCheckComponent for
> searchCompenent.
> All solutions are welcome,
>
> Thanks,
> Best regards,
> Anass BENJELLOUN
>
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Auto-Complete-tp4150987.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


RE: Auto complete with 50TB of data - Need your inputs?

2014-06-05 Thread Toke Eskildsen
bbi123 [bbar...@gmail.com] wrote:
> We have a requirement to for large data set like Billing data for example.
> The Business wants to do sorting and type ahead functions for it.  For
> example, when I start typing “8164…” they want to list ALL the unique number
> and the associated attributes displayed (name, description, etc).

So either a search for prefix or a lookup with TermsComponent? I do not like 
the "ALL" in the requirements though. What if the prefix matches 5M documents?

> We have about 50TB of files that needs to be indexed. I haven't indexed this
> much data before hence thought of getting your valuable inputs. I am
> thinking of using SOLR cloud and use SSD for faster IO. I might need your
> inputs on hardware requirements too.

The index size it next to impossible to predict without more knowledge. Try and 
acquire just a few GB of content and experiment, so that you can get an idea of 
the final index size. The estimated number of documents and unique values in 
your lookup field are also very valuable to know.

As for storage, the question these days should be "Are there any reasons not to 
use SSDs for index storage"? The amount of RAM needed will have to be 
determined experimentally: Type-ahead does require very low latency and might 
need more caching than normally.

- Toke Eskildsen, State and University Library, Denmark


Re: Auto complete with 50TB of data - Need your inputs?

2014-06-05 Thread Shawn Heisey
On 6/5/2014 10:55 AM, bbi123 wrote:
> We have a requirement to for large data set like Billing data for example. 
> The Business wants to do sorting and type ahead functions for it.  For
> example, when I start typing “8164…” they want to list ALL the unique number
> and the associated attributes displayed (name, description, etc). 
>  
> We have about 50TB of files that needs to be indexed. I haven't indexed this
> much data before hence thought of getting your valuable inputs. I am
> thinking of using SOLR cloud and use SSD for faster IO. I might need your
> inputs on hardware requirements too.

It's nearly impossible to give you a hardware requirement projection. 
There are simply too many variables.  One variable is that we cannot
know how much of that 50TB of data will actually end up in the Solr
index.  The archive for my data is getting close to 300TB, but because
that is mostly photos and video, the total size of the resulting Solr
index is about 100GB.  My actual data source is a MySQL database that's
probably about 250GB.

http://searchhub.org/2012/07/23/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer/

The one thing that I can say is that RAM is king with Solr.  Once you
know how big the Solr index contained on each server will actually be,
you'll have some idea of how much RAM you might need.  Add up the Solr
heap size and the total index size on disk for each server.  That is the
ideal total memory size for each server.  You might not actually need
that much RAM, but if you have it, we can *almost guarantee* good
performance.

http://wiki.apache.org/solr/SolrPerformanceProblems

SSD will help performance, but it is not a complete substitute for RAM. 
If you have the ideal RAM size, SSD is not required, because all the
important data will be in RAM, which is much faster than SSD.

> I assume there is no limitations in terms of the maximum number of documents
> that can be indexed in latest version of SOLR (4.8). Am I right?

Each shard has a limit of just over two billion documents.  The actual
number is 2147483647, the maximum number a 32bit java integer can hold. 
This includes deleted documents, so we recommend not going over 1
billion.  SolrCloud has no limits, because the collection can have many
shards.

Thanks,
Shawn



Re: auto-complete with typo fuzzy suggests

2013-02-13 Thread Jack Krupansky

Try the spellchecker rather than the suggester/auto-complete:

http://wiki.apache.org/solr/SpellCheckComponent

-- Jack Krupansky

-Original Message- 
From: ALEX PKB 
Sent: Wednesday, February 13, 2013 2:34 PM 
To: solr-user@lucene.apache.org 
Subject: auto-complete with typo fuzzy suggests 


Hi,
I tried to implement auto-complete with some fuzzy matches, I've
tried phonetic, ngram, the results are "too fuzzy", Is there any analyzer
to handle "typo".
Thanks!


Re: Auto-complete phrase

2012-03-28 Thread Rémy Loubradou
Thanks Otis but that's not an option for me. "Should" be pretty easy to do
this with Solr, I will still continue to work on it.

Great William I will give a try with this method, thanks.

On 28 March 2012 06:11, William Bell  wrote:

> I am also very confused at the use case for the Suggester component.
> With collate on, it will try to combine random words together not the
> actual phrases that are there.
>
> I get better mileage out of EDGE grams and tokenize on whitespace...
> Left to right... Since that is how most people think.
>
> However, I would like Suggester to work as follows:
>
> Index:
> Chris Smith
> Tony Dawson
> Chris Leaf
> Daddy Golucky
>
> Query:
> 1. "Chris" it returns "Chris Leaf" but not both Chris Smith and Chris Leaf.
> 2. I seem to get collated (take first work and combine with second
> word). SO I would see things like "Smith Leaf" Very strange and
> not what we expect. These are formal names.
>
> When I use Ngrams I can index:
>
> C
> Ch
> Chr
> Chri
> Chris
> S
> Sm
> Smi
> Smit
> Smith
>
> Thus if I search on "Smi" it will match Chris Smith and also Chris
> Leaf. Exactly what I want.
>
>
>
>
> On Tue, Mar 27, 2012 at 11:05 AM, Rémy Loubradou  wrote:
> > Hello, I am working on creating a auto-complete functionality for my
> field
> > merchant_name present all over my documents. I am using the version 3.4
> of
> > Solr and I am trying to take advantage of the Suggester functionality.
> > Unfortunately so far I didn't figure out how to make it works as  I
> > expected.
> >
> > If my list of merchants present in my documents is:(my real list is
> bigger
> > than the following list, that's the reason why I don't use dictionnary
> and
> > also because it will change often.)
> > Redoute
> > Suisse Trois
> > Conforama
> > But
> > Cult Beauty
> > Brother Trois
> >
> > I expect from the Suggester component to match words or part of them and
> > return phrases where words or part of them have been matched.
> > for example with /suggest?q=tro, I would like to get this:
> >
> > 
> > 
> > 0
> > 0
> > 
> > 
> > 
> > 
> > 2
> > 0
> > x
> > 
> > Bother Trois
> > Suisse Trois
> > 
> > 
> > 
> > 
> > 
> >
> > I experimented suggestion on a field configured with the tokenizer
> > "solr.KeywordTokenizerFactory" or "solr.WhitespaceTokenizerFactory".
> > In my mind I have to find a way to handle 3 cases:
> > /suggest?q=bo ->(should return) bother trois
> > /suggest?q=tro ->(should return) bother trois, suisse trois
> > /suggest?q=bo%20tro ->(should return) bother trois
> >
> > With the "solr.KeywordTokenizerFactory" I get:
> > /suggest?q=bo -> bother trois
> > /suggest?q=tro -> "nothing"
> > /suggest?q=bo%20tro -> "nothing"
> >
> > With the "solr.WhitespaceTokenizerFactory" I get:
> > /suggest?q=bo -> bother
> > /suggest?q=troi -> trois
> > /suggest?q=bo%20tro -> bother, trois
> >
> > Not exactly what I want ... :(
> >
> > My configuration in the file solrconfig.xml for the suggester component:
> >
> > 
> >
> >  suggestMerchant
> >   name="classname">org.apache.solr.spelling.suggest.Suggester
> >   > name="lookupImpl">org.apache.solr.spelling.suggest.fst.FSTLookup
> >  
> >  merchant_name_autocomplete  
> >  0.0
> >  true
> > 
> >
> >  
> >   > name="/suggest/merchant">
> >
> >  true
> >  suggestMerchant
> >  true
> >  10
> >  true
> >  10
> >
> >
> >  suggestMerchant
> >
> >  
> >
> > How can I implement autocomplete with the Suggester component to get
> what I
> > expect? Thanks for your help, I really appreciate.
>
>
>
> --
> Bill Bell
> billnb...@gmail.com
> cell 720-256-8076
>


Re: Auto-complete phrase

2012-03-27 Thread William Bell
I am also very confused at the use case for the Suggester component.
With collate on, it will try to combine random words together not the
actual phrases that are there.

I get better mileage out of EDGE grams and tokenize on whitespace...
Left to right... Since that is how most people think.

However, I would like Suggester to work as follows:

Index:
Chris Smith
Tony Dawson
Chris Leaf
Daddy Golucky

Query:
1. "Chris" it returns "Chris Leaf" but not both Chris Smith and Chris Leaf.
2. I seem to get collated (take first work and combine with second
word). SO I would see things like "Smith Leaf" Very strange and
not what we expect. These are formal names.

When I use Ngrams I can index:

C
Ch
Chr
Chri
Chris
S
Sm
Smi
Smit
Smith

Thus if I search on "Smi" it will match Chris Smith and also Chris
Leaf. Exactly what I want.




On Tue, Mar 27, 2012 at 11:05 AM, Rémy Loubradou  wrote:
> Hello, I am working on creating a auto-complete functionality for my field
> merchant_name present all over my documents. I am using the version 3.4 of
> Solr and I am trying to take advantage of the Suggester functionality.
> Unfortunately so far I didn't figure out how to make it works as  I
> expected.
>
> If my list of merchants present in my documents is:(my real list is bigger
> than the following list, that's the reason why I don't use dictionnary and
> also because it will change often.)
> Redoute
> Suisse Trois
> Conforama
> But
> Cult Beauty
> Brother Trois
>
> I expect from the Suggester component to match words or part of them and
> return phrases where words or part of them have been matched.
> for example with /suggest?q=tro, I would like to get this:
>
> 
> 
> 0
> 0
> 
> 
> 
> 
> 2
> 0
> x
> 
> Bother Trois
> Suisse Trois
> 
> 
> 
> 
> 
>
> I experimented suggestion on a field configured with the tokenizer
> "solr.KeywordTokenizerFactory" or "solr.WhitespaceTokenizerFactory".
> In my mind I have to find a way to handle 3 cases:
> /suggest?q=bo ->(should return) bother trois
> /suggest?q=tro ->(should return) bother trois, suisse trois
> /suggest?q=bo%20tro ->(should return) bother trois
>
> With the "solr.KeywordTokenizerFactory" I get:
> /suggest?q=bo -> bother trois
> /suggest?q=tro -> "nothing"
> /suggest?q=bo%20tro -> "nothing"
>
> With the "solr.WhitespaceTokenizerFactory" I get:
> /suggest?q=bo -> bother
> /suggest?q=troi -> trois
> /suggest?q=bo%20tro -> bother, trois
>
> Not exactly what I want ... :(
>
> My configuration in the file solrconfig.xml for the suggester component:
>
> 
>    
>      suggestMerchant
>      org.apache.solr.spelling.suggest.Suggester
>       name="lookupImpl">org.apache.solr.spelling.suggest.fst.FSTLookup
>      
>      merchant_name_autocomplete  
>      0.0
>      true
> 
>    
>  
>   name="/suggest/merchant">
>    
>      true
>      suggestMerchant
>      true
>      10
>      true
>      10
>    
>    
>      suggestMerchant
>    
>  
>
> How can I implement autocomplete with the Suggester component to get what I
> expect? Thanks for your help, I really appreciate.



-- 
Bill Bell
billnb...@gmail.com
cell 720-256-8076


RE: Auto complete

2008-07-10 Thread sundar shankar
Daniel,
  I have tested your config of autocomplete. That works perfectly. 
THANKS A LOT FOR THAT. Truly appreciate your help.
 
All,
  I am not able to wiki for a lot of the 1.3 filters and analysers. Is 
there somewhere that I can get documentation on the same. 
 
-S



> Date: Tue, 8 Jul 2008 23:13:57 +0530> From: [EMAIL PROTECTED]> To: 
> solr-user@lucene.apache.org> Subject: Re: Auto complete> > He must be using a 
> nightly build of Solr 1.3 -- I think you can consider> using it as it is 
> quite stable and close to release.> > On Tue, Jul 8, 2008 at 10:38 PM, sundar 
> shankar <[EMAIL PROTECTED]>> wrote:> > > Hi Daniel,> > Thanks for the code. I 
> just did observe that you have> > EdgeNGramFilterFactory. I didnt find it in 
> the 1.2 Solr version. Which> > version are you using for this. 1.3 isnt out 
> yet rite. Is there any other> > production version of Solr available that I 
> can use?> >> > Regards> > Sundar> >> >> >> > > Subject: Re: Auto complete> 
> From: [EMAIL PROTECTED]> To:> > solr-user@lucene.apache.org> Date: Tue, 8 Jul 
> 2008 11:30:31 +0100> > Hi,>> > > This is how we implement our autocomplete 
> feature, excerpt from>> > schema.xml> > -First accept the input as is without 
> alteration> -Lowercase> > the input, and eliminate all non a-z0-9 chars to 
> normalize> the input>> > -split into multiple tokens with 
> EdgeNGramFilterFactory upto a max of> 100> > chars, all starting from the 
> beginning of the input, e.g. hello> becomes> > h,he,hel,hell,hello. > -For 
> queries we accept the first 20 chars.> > Hope> > this helps.> > >  name="autocomplete" class="solr.TextField">>> > > 
> >> >  class="solr.LowerCaseFilterFactory" />>  > 
> class="solr.PatternReplaceFilterFactory"> pattern="([^a-z0-9])"> > 
> replacement="" replace="all" />>  > 
> class="solr.EdgeNGramFilterFactory"> maxGramSize="100" minGramSize="1" />>> > 
> > >  > 
> class="solr.KeywordTokenizerFactory"/>>  > 
> class="solr.LowerCaseFilterFactory" />>  > 
> class="solr.PatternReplaceFilterFactory"> pattern="([^a-z0-9])"> > 
> replacement="" replace="all" />>  > 
> class="solr.PatternReplaceFilterFactory"> pattern="^(.{20})(.*)?"> > 
> replacement="$1" replace="all" />> > > ...>  > 
> name="ac" type="autocomplete" indexed="true" stored="true"> required="false"> 
> > />> > Regards,> Dan> > > > > On Mon, 2008-07-07 at 17:12 +, sundar> > 
> shankar wrote:> > Hi All,> > I am using Solr for some time and am having> > 
> trouble with an auto complete feature that I have been trying to> > 
> incorporate. I am indexing solr as a database column to solr field mapping.> 
> > I have tried various configs that were mentioned in the solr user 
> community> > suggestions and have tried a few option of my own too. Each of 
> them seem to> > either not bring me the exact data I want or seems to get 
> excess data.> > >> > > I have tried.> > text_ws,> > text,> > string> > 
> EdgeNGramTokenizerFactory>> > > the subword example> > textTight> > and 
> juggling arnd some of the filters> > and analysers togther.> > > > Couldnt 
> get dismax to work as somehow it wasnt> > able to connect my field defined in 
> the schema to the qf param that I was> > passing in the request.> > > > Text 
> tight was the best results I had but the> > problem there was it was 
> searching for whole words and not part words.> >> > example> > > > if my 
> query String was field1:Word1 word2* I was getting back> > results but if my 
> query string was field1: Word1 wor* I didnt get a result> > back.> > > > I am 
> little perplexed on how to implement this. I dont know> > what has to be 
> done.> > > > The schema> > > > > >  type="text_ws" indexed="true" stored="true"> > termVectors="true"/>> > 
> > > > > 
>  > indexed="true" 
> stored="false" multiValued="true"/>> > > >  > type="text"

RE: Auto complete

2008-07-08 Thread sundar shankar
Thanks Shalin.

> Date: Tue, 8 Jul 2008 23:13:57 +0530> From: [EMAIL PROTECTED]> To: 
> solr-user@lucene.apache.org> Subject: Re: Auto complete> > He must be using a 
> nightly build of Solr 1.3 -- I think you can consider> using it as it is 
> quite stable and close to release.> > On Tue, Jul 8, 2008 at 10:38 PM, sundar 
> shankar <[EMAIL PROTECTED]>> wrote:> > > Hi Daniel,> > Thanks for the code. I 
> just did observe that you have> > EdgeNGramFilterFactory. I didnt find it in 
> the 1.2 Solr version. Which> > version are you using for this. 1.3 isnt out 
> yet rite. Is there any other> > production version of Solr available that I 
> can use?> >> > Regards> > Sundar> >> >> >> > > Subject: Re: Auto complete> 
> From: [EMAIL PROTECTED]> To:> > solr-user@lucene.apache.org> Date: Tue, 8 Jul 
> 2008 11:30:31 +0100> > Hi,>> > > This is how we implement our autocomplete 
> feature, excerpt from>> > schema.xml> > -First accept the input as is without 
> alteration> -Lowercase> > the input, and eliminate all non a-z0-9 chars to 
> normalize> the input>> > -split into multiple tokens with 
> EdgeNGramFilterFactory upto a max of> 100> > chars, all starting from the 
> beginning of the input, e.g. hello> becomes> > h,he,hel,hell,hello. > -For 
> queries we accept the first 20 chars.> > Hope> > this helps.> > >  name="autocomplete" class="solr.TextField">>> > > 
> >> >  class="solr.LowerCaseFilterFactory" />>  > 
> class="solr.PatternReplaceFilterFactory"> pattern="([^a-z0-9])"> > 
> replacement="" replace="all" />>  > 
> class="solr.EdgeNGramFilterFactory"> maxGramSize="100" minGramSize="1" />>> > 
> > >  > 
> class="solr.KeywordTokenizerFactory"/>>  > 
> class="solr.LowerCaseFilterFactory" />>  > 
> class="solr.PatternReplaceFilterFactory"> pattern="([^a-z0-9])"> > 
> replacement="" replace="all" />>  > 
> class="solr.PatternReplaceFilterFactory"> pattern="^(.{20})(.*)?"> > 
> replacement="$1" replace="all" />> > > ...>  > 
> name="ac" type="autocomplete" indexed="true" stored="true"> required="false"> 
> > />> > Regards,> Dan> > > > > On Mon, 2008-07-07 at 17:12 +, sundar> > 
> shankar wrote:> > Hi All,> > I am using Solr for some time and am having> > 
> trouble with an auto complete feature that I have been trying to> > 
> incorporate. I am indexing solr as a database column to solr field mapping.> 
> > I have tried various configs that were mentioned in the solr user 
> community> > suggestions and have tried a few option of my own too. Each of 
> them seem to> > either not bring me the exact data I want or seems to get 
> excess data.> > >> > > I have tried.> > text_ws,> > text,> > string> > 
> EdgeNGramTokenizerFactory>> > > the subword example> > textTight> > and 
> juggling arnd some of the filters> > and analysers togther.> > > > Couldnt 
> get dismax to work as somehow it wasnt> > able to connect my field defined in 
> the schema to the qf param that I was> > passing in the request.> > > > Text 
> tight was the best results I had but the> > problem there was it was 
> searching for whole words and not part words.> >> > example> > > > if my 
> query String was field1:Word1 word2* I was getting back> > results but if my 
> query string was field1: Word1 wor* I didnt get a result> > back.> > > > I am 
> little perplexed on how to implement this. I dont know> > what has to be 
> done.> > > > The schema> > > > > >  type="text_ws" indexed="true" stored="true"> > termVectors="true"/>> > 
> > > > > 
>  > indexed="true" 
> stored="false" multiValued="true"/>> > > >  > type="text" 
> indexed="true" stored="true" termVectors="true"> > multiValued="true"/>> > 
>  > indexed="true" stored="true" 
> termVectors="true" multiValued="true"/>> > > &

Re: Auto complete

2008-07-08 Thread Shalin Shekhar Mangar
He must be using a nightly build of Solr 1.3 -- I think you can consider
using it as it is quite stable and close to release.

On Tue, Jul 8, 2008 at 10:38 PM, sundar shankar <[EMAIL PROTECTED]>
wrote:

> Hi Daniel,
> Thanks for the code. I just did observe that you have
> EdgeNGramFilterFactory. I didnt find it in the 1.2 Solr version. Which
> version are you using for this. 1.3 isnt out yet rite. Is there any other
> production version of Solr available that I can use?
>
> Regards
> Sundar
>
>
>
> > Subject: Re: Auto complete> From: [EMAIL PROTECTED]> To:
> solr-user@lucene.apache.org> Date: Tue, 8 Jul 2008 11:30:31 +0100> > Hi,>
> > This is how we implement our autocomplete feature, excerpt from>
> schema.xml> > -First accept the input as is without alteration> -Lowercase
> the input, and eliminate all non a-z0-9 chars to normalize> the input>
> -split into multiple tokens with EdgeNGramFilterFactory upto a max of> 100
> chars, all starting from the beginning of the input, e.g. hello> becomes
> h,he,hel,hell,hello. > -For queries we accept the first 20 chars.> > Hope
> this helps.> > > >
> > >
> >  class="solr.PatternReplaceFilterFactory"> pattern="([^a-z0-9])"
> replacement="" replace="all" />>  class="solr.EdgeNGramFilterFactory"> maxGramSize="100" minGramSize="1" />>
> > >  class="solr.KeywordTokenizerFactory"/>>  class="solr.LowerCaseFilterFactory" />>  class="solr.PatternReplaceFilterFactory"> pattern="([^a-z0-9])"
> replacement="" replace="all" />>  class="solr.PatternReplaceFilterFactory"> pattern="^(.{20})(.*)?"
> replacement="$1" replace="all" />> > > ...>  name="ac" type="autocomplete" indexed="true" stored="true"> required="false"
> />> > Regards,> Dan> > > > > On Mon, 2008-07-07 at 17:12 +, sundar
> shankar wrote:> > Hi All,> > I am using Solr for some time and am having
> trouble with an auto complete feature that I have been trying to
> incorporate. I am indexing solr as a database column to solr field mapping.
> I have tried various configs that were mentioned in the solr user community
> suggestions and have tried a few option of my own too. Each of them seem to
> either not bring me the exact data I want or seems to get excess data.> > >
> > I have tried.> > text_ws,> > text,> > string> > EdgeNGramTokenizerFactory>
> > the subword example> > textTight> > and juggling arnd some of the filters
> and analysers togther.> > > > Couldnt get dismax to work as somehow it wasnt
> able to connect my field defined in the schema to the qf param that I was
> passing in the request.> > > > Text tight was the best results I had but the
> problem there was it was searching for whole words and not part words.> >
> example> > > > if my query String was field1:Word1 word2* I was getting back
> results but if my query string was field1: Word1 wor* I didnt get a result
> back.> > > > I am little perplexed on how to implement this. I dont know
> what has to be done.> > > > The schema> > > > > >  termVectors="true"/>> > > > > >  indexed="true" stored="false" multiValued="true"/>> > > >  type="text" indexed="true" stored="true" termVectors="true"
> multiValued="true"/>> >  indexed="true" stored="true" termVectors="true" multiValued="true"/>> > > >
>  multiValued="true" termVectors="true"/>> >  type="textTight" indexed="true" stored="true" multiValued="true"
> termVectors="true"/>> > > > > > > > I Index institution.name only, the
> rest are copy fields of the same.> > > > > > Any help is appreciated.> > > >
> Thanks> > Sundar> > > >
> _> > Chose
> your Life Partner? Join MSN Matrimony> >
> http://www.shaadi.com/msn/matrimony.php > > > > < scanned for virus and spam content>>> Daniel Rosher> Developer>
> www.thehotonlinenetwork.com> d: 0207 3489 912> > t: 0845 4680 568> > f:
> 0845 4680 868> > m: > > Beaumont House, Kensington Village, Avonmore Road,
> London, W14 8TS> > > > - - - - - - - - - - - - - - - - - - - - - - - - - - -
> - - - - - - - - - - - - - - - - - - - - - - - - - - - -> > This message is
> sent in confidence for the addressee only. It may contain privileged> >
> information. The contents are not to be disclosed to anyone other than the
> addressee.> > Unauthorised recipients are requested to preserve this
> confidentiality and to advise> > us of any errors in transmission. Thank
> you.> > hotonline ltd is registered in England & Wales. Registered office:
> One Canada Square,> > Canary Wharf, London E14 5AP. Registered No: 1904765.
> _
> Wish to Marry Now? Join Shaadi.com FREE!
> http://www.shaadi.com/registration/user/index.php?ptnr=mhottag




-- 
Regards,
Shalin Shekhar Mangar.


RE: Auto complete

2008-07-08 Thread sundar shankar
Hi Daniel,
 Thanks for the code. I just did observe that you have 
EdgeNGramFilterFactory. I didnt find it in the 1.2 Solr version. Which version 
are you using for this. 1.3 isnt out yet rite. Is there any other production 
version of Solr available that I can use?
 
Regards
Sundar



> Subject: Re: Auto complete> From: [EMAIL PROTECTED]> To: 
> solr-user@lucene.apache.org> Date: Tue, 8 Jul 2008 11:30:31 +0100> > Hi,> > 
> This is how we implement our autocomplete feature, excerpt from> schema.xml> 
> > -First accept the input as is without alteration> -Lowercase the input, and 
> eliminate all non a-z0-9 chars to normalize> the input> -split into multiple 
> tokens with EdgeNGramFilterFactory upto a max of> 100 chars, all starting 
> from the beginning of the input, e.g. hello> becomes h,he,hel,hell,hello. > 
> -For queries we accept the first 20 chars.> > Hope this helps.> > > 
> >  type="index">> >  class="solr.LowerCaseFilterFactory" />>  class="solr.PatternReplaceFilterFactory"> pattern="([^a-z0-9])" 
> replacement="" replace="all" />>  
> maxGramSize="100" minGramSize="1" />> > > 
> >  class="solr.LowerCaseFilterFactory" />>  class="solr.PatternReplaceFilterFactory"> pattern="([^a-z0-9])" 
> replacement="" replace="all" />>  class="solr.PatternReplaceFilterFactory"> pattern="^(.{20})(.*)?" 
> replacement="$1" replace="all" />> > > ...>  name="ac" type="autocomplete" indexed="true" stored="true"> required="false" 
> />> > Regards,> Dan> > > > > On Mon, 2008-07-07 at 17:12 +, sundar 
> shankar wrote:> > Hi All,> > I am using Solr for some time and am having 
> trouble with an auto complete feature that I have been trying to incorporate. 
> I am indexing solr as a database column to solr field mapping. I have tried 
> various configs that were mentioned in the solr user community suggestions 
> and have tried a few option of my own too. Each of them seem to either not 
> bring me the exact data I want or seems to get excess data.> > > > I have 
> tried.> > text_ws,> > text,> > string> > EdgeNGramTokenizerFactory> > the 
> subword example> > textTight> > and juggling arnd some of the filters and 
> analysers togther.> > > > Couldnt get dismax to work as somehow it wasnt able 
> to connect my field defined in the schema to the qf param that I was passing 
> in the request.> > > > Text tight was the best results I had but the problem 
> there was it was searching for whole words and not part words.> > example> > 
> > > if my query String was field1:Word1 word2* I was getting back results but 
> if my query string was field1: Word1 wor* I didnt get a result back.> > > > I 
> am little perplexed on how to implement this. I dont know what has to be 
> done.> > > > The schema> > > > > >  type="text_ws" indexed="true" stored="true" termVectors="true"/>> > 
> > > > > 
>  stored="false" multiValued="true"/>> > > >  indexed="true" stored="true" termVectors="true" multiValued="true"/>> > 
>  termVectors="true" multiValued="true"/>> > > >  type="subword" indexed="true" stored="true" multiValued="true" 
> termVectors="true"/>> >  indexed="true" stored="true" multiValued="true" termVectors="true"/>> > > > > 
> > > > I Index institution.name only, the rest are copy fields of the same.> > 
> > > > > Any help is appreciated.> > > > Thanks> > Sundar> > > > 
> _> > Chose 
> your Life Partner? Join MSN Matrimony> > 
> http://www.shaadi.com/msn/matrimony.php > > > > < for virus and spam content>>> Daniel Rosher> Developer> 
> www.thehotonlinenetwork.com> d: 0207 3489 912> > t: 0845 4680 568> > f: 0845 
> 4680 868> > m: > > Beaumont House, Kensington Village, Avonmore Road, London, 
> W14 8TS> > > > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 
> - - - - - - - - - - - - - - - - - - - - - - - -> > This message is sent in 
> confidence for the addressee only. It may contain privileged> > information. 
> The contents are not to be disclosed to anyone other than the addressee.> > 
> Unauthorised recipients are requested to preserve this confidentiality and to 
> advise> > us of any errors in transmission. Thank you.> > hotonline ltd is 
> registered in England & Wales. Registered office: One Canada Square,> > 
> Canary Wharf, London E14 5AP. Registered No: 1904765.
_
Wish to Marry Now? Join Shaadi.com FREE! 
http://www.shaadi.com/registration/user/index.php?ptnr=mhottag

Re: Auto complete

2008-07-08 Thread daniel rosher
Hi,

This is how we implement our autocomplete feature, excerpt from
schema.xml

-First accept the input as is without alteration
-Lowercase the input, and eliminate all non a-z0-9 chars to normalize
the input
-split into multiple tokens with EdgeNGramFilterFactory upto a max of
100 chars, all starting from the beginning of the input, e.g. hello
becomes h,he,hel,hell,hello. 
-For queries we accept the first 20 chars.

Hope this helps.
















...


Regards,
Dan




On Mon, 2008-07-07 at 17:12 +, sundar shankar wrote:
> Hi All,
>I am using Solr for some time and am having trouble with an auto 
> complete feature that I have been trying to incorporate. I am indexing solr 
> as a database column to solr field mapping. I have tried various configs that 
> were mentioned in the solr user community suggestions and have tried a few 
> option of my own too. Each of them seem to either not bring me the exact data 
> I want or seems to get excess data.
> 
> I have tried.
> text_ws,
> text,
> string
> EdgeNGramTokenizerFactory
> the subword example
> textTight
> and juggling arnd some of the filters and analysers togther.
> 
> Couldnt get dismax to work as somehow it wasnt able to connect my field 
> defined in the schema to the qf param that I was passing in the request.
> 
> Text tight was the best results I had but the problem there was it was 
> searching for whole words and not part words.
> example
> 
> if my query String was field1:Word1 word2* I was getting back results but if 
> my query string was field1: Word1 wor* I didnt get a result back.
> 
> I am little perplexed on how to implement this. I dont know what has to be 
> done.
> 
> The schema
> 
> 
> termVectors="true"/>
>
> 
> stored="false" multiValued="true"/>
>
> termVectors="true" multiValued="true"/>
> termVectors="true" multiValued="true"/>
> 
> multiValued="true"  termVectors="true"/>
> multiValued="true"  termVectors="true"/>
> 
> 
> 
> I Index institution.name only, the rest are copy fields of the same.
> 
> 
> Any help is appreciated.
> 
> Thanks
> Sundar
> 
> _
> Chose your Life Partner? Join MSN Matrimony
> http://www.shaadi.com/msn/matrimony.php 
> 
> <>
Daniel Rosher
Developer
www.thehotonlinenetwork.com
d: 0207 3489 912

t: 0845 4680 568

f: 0845 4680 868

m: 

Beaumont House, Kensington Village, Avonmore Road, London, W14 
8TS



- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 
- - - - - - - - - - - - - - - - -

This message is sent in confidence for the addressee only. It may contain 
privileged

information. The contents are not to be disclosed to anyone other than the 
addressee.

Unauthorised recipients are requested to preserve this confidentiality and 
to advise

us of any errors in transmission. Thank you.

hotonline ltd is registered in England & Wales. Registered office: One 
Canada Square,

Canary Wharf, London E14 5AP. Registered No: 1904765.