Using Solr to build a product matcher, with learning to rank

2018-03-28 Thread Xavier Schepler
tor? Kind regards, Xavier Schepler

Re: Tomcat6 and Log4j

2011-02-10 Thread Xavier SCHEPLER
Yes thanks. This works fine : log4j.rootLogger=INFO, SOLR log4j.appender.SOLR=org.apache.log4j.DailyRollingFileAppender log4j.appender.SOLR.file=/home/quetelet_bdq/logs/bdq.log log4j.appender.SOLR.datePattern='.'-MM-dd log4j.appender.SOLR.layout=org.apache.log4j.PatternLayout log4j.appender.SO

Re: Tomcat6 and Log4j

2011-02-10 Thread Xavier SCHEPLER
I added it to /etc/default/tomcat6. What happened is that the same error message appeared twice in /var/log/tomcat6/catalina.out. Like the same file was loaded twice. -- Tous les courriers électroniques émis depuis la messag

Re: Tomcat6 and Log4j

2011-02-10 Thread Xavier SCHEPLER
A_HOME/webapps/solr/WEB-INF/classes/log4j.properties > > -- > Jan Høydahl, search solution architect > Cominvent AS - www.cominvent.com > > On 10. feb. 2011, at 09.41, Xavier Schepler wrote: > > > Hi, > > > > I added “slf4j-log4j12-1.5.5.jar” and “log4j-1.2.

Tomcat6 and Log4j

2011-02-10 Thread Xavier Schepler
Hi, I added “slf4j-log4j12-1.5.5.jar” and “log4j-1.2.15.jar” to $CATALINA_HOME/webapps/solr/WEB-INF/lib , then deleted the library “slf4j-jdk14-1.5.5.jar” from $CATALINA_HOME/webapps/solr/WEB-INF/lib, then created a directory $CATALINA_HOME/webapps/solr/WEB-INF/classes. and created $CATALINA_H

Re: Local param tag voodoo ?

2011-01-20 Thread Xavier SCHEPLER
Since it seems to be no voodoo available I did it on the client side. I send a first request to get the facets and a second to get the documents and their highlighting. It works well but requires more processing. > > From: Xavier SCHEPLER > Sent

Re: Local param tag voodoo ?

2011-01-20 Thread Xavier SCHEPLER
ur examples! If you explain what you want to do, someone may > be able to tell you how. From the hints in your last message, I suspect > nested queries _might_ be helpful to you. > > On 1/19/2011 3:46 AM, Xavier SCHEPLER wrote: > > Ok I was already at this point. > >

Re: Local param tag voodoo ?

2011-01-19 Thread Xavier SCHEPLER
Ok I was already at this point. My facetting system use exactly what is described in this page. I read it from the Solr 1.4 book. Otherwise I would'nt ask. The problem is that the filter queries doesn't affect the relevance score of the results so I want the terms in the main query. > -

Re: Local param tag voodoo ?

2011-01-19 Thread Xavier SCHEPLER
You're right the second query didn't result in an error but neither gave the expected result. I'm gone to have a look at the link you gave me. Thanks ! > > From: Markus Jelsma > Sent: Tue Jan 18 21:31:52 CET 2011 > To: > Subject: Re: Local param tag vood

Local param tag voodoo ?

2011-01-18 Thread Xavier Schepler
Hey, here are my needs : - a query that has tagged and untagged contents - facets that ignore the tagged contents I tryed : q=({!tag=toExclude} ignored) taken into account q={tag=toExclude v='ignored'} take into account Both resulted in a error. Is this possible or do I have to try another

Re: Solr boolean operators

2011-01-13 Thread Xavier SCHEPLER
> query ... it's the same. In terms of the score of the results no, > since, if you are using the first query, the documents that will match both > the "a" and the "b" terms, will match higher then the ones matching just the > "a" term. > > O

Solr boolean operators

2011-01-13 Thread Xavier Schepler
Hi, with the Lucene query syntax, is : a AND (a OR b) equivalent to : a (absorption) ?

Re: No response from Solr on complex request after several days

2010-10-29 Thread Xavier Schepler
need to do it, but you can automatically restart it once a day. On Thu, Oct 28, 2010 at 2:20 AM, Xavier Schepler wrote: Hi, We are in a beta testing phase, with several users a day. After several days of waiting, the solr server didn't respond to requests that require a lot of proce

No response from Solr on complex request (real issue explained)

2010-10-28 Thread Xavier Schepler
Hi, We are in a beta testing phase, with several users a day. After several days of running well, the solr server stopped responding to requests that require a lot of processing time, like this one : wt=json&omitHeader=true&q=qiAndMSwFR%3A%28transport%29&q.op=AND&start=0&rows=5&fl=id,domainId

No response from Solr on complex request after several days

2010-10-28 Thread Xavier Schepler
Hi, We are in a beta testing phase, with several users a day. After several days of waiting, the solr server didn't respond to requests that require a lot of processing time. I'm using Solr inside Tomcat. This is the request that had no response from the server : wt=json&omitHeader=true&q=q

Re: More like this and terms positions

2010-10-04 Thread Xavier Schepler
On 04/10/2010 16:40, Robert Muir wrote: On Mon, Oct 4, 2010 at 10:16 AM, Xavier Schepler< xavier.schep...@sciences-po.fr> wrote: Hi, does the more like this search uses terms positions information in the score formula ? no, it would be nice if it did use them though (base

More like this and terms positions

2010-10-04 Thread Xavier Schepler
Hi, does the more like this search uses terms positions information in the score formula ?

Re: Phrase search + multi-word index time expanded synonym

2010-09-08 Thread Xavier Schepler
On 08/09/2010 12:21, Grijesh.singh wrote: see the analysis.jsp with debug verbose and see what happens at index time and search time during analysis with your data Also u can use debugQuery=on for seeing what actually parsed query is. - Grijesh I've found a first solution by myself, usi

Phrase search + multi-word index time expanded synonym

2010-09-08 Thread Xavier Schepler
t file : ... PS,Parti socialiste ... and here's the query : "PS et". It returns no result, whereas "Parti socialiste et" returns the results. How can I have both queries working ? I'm thinking about different configurations but I didn't found any solution at the moment. Thx for reading, Xavier Schepler

Re: spellcheck distance measure algorithms error ?

2010-09-03 Thread Xavier Schepler
On 03/09/2010 15:31, Grant Ingersoll wrote: On Sep 3, 2010, at 9:14 AM, Xavier Schepler wrote: On 03/09/2010 14:47, Grant Ingersoll wrote: On Sep 3, 2010, at 6:02 AM, Xavier Schepler wrote: no, jopsin isn't in the index. I tryed this with other words and I had the

Re: spellcheck distance measure algorithms error ?

2010-09-03 Thread Xavier Schepler
On 03/09/2010 14:47, Grant Ingersoll wrote: On Sep 3, 2010, at 6:02 AM, Xavier Schepler wrote: Hi, When I take the two letters from the middle of a word and put the first in place of the second and the second in place of the first, ex : jospin => jopsin, I don't get any suggest

spellcheck distance measure algorithms error ?

2010-09-03 Thread Xavier Schepler
Hi, When I take the two letters from the middle of a word and put the first in place of the second and the second in place of the first, ex : jospin => jopsin, I don't get any suggestion from the spellchecker component. I tryed the default algorithm and the Jaro Winkler Distance, with a coef

Re: Proximity search + Highlighting

2010-09-01 Thread Xavier Schepler
On 01/09/2010 13:54, Xavier Schepler wrote: On 01/09/2010 12:38, Markus Jelsma wrote: I think you need to enable usePhraseHighlighter in order to use the highlightMultiTerm parameter. On Wednesday 01 September 2010 12:12:11 Xavier Schepler wrote: Hi, can the highlighting component

Re: Proximity search + Highlighting

2010-09-01 Thread Xavier Schepler
On 01/09/2010 12:38, Markus Jelsma wrote: I think you need to enable usePhraseHighlighter in order to use the highlightMultiTerm parameter. On Wednesday 01 September 2010 12:12:11 Xavier Schepler wrote: Hi, can the highlighting component highlight terms only if the distance between them

Proximity search + Highlighting

2010-09-01 Thread Xavier Schepler
Hi, can the highlighting component highlight terms only if the distance between them matches the query ? I use those parameters : hl=on&hl.fl=qFR,iFR,mFR,vlFR&hl.usePhraseHighlighter=false&hl.highlightMultiTerm=true&hl.simple.pre=&hl.simple.post=<%2Fb>&hl.mergeContiguous=false

Re: Highlighting, return the matched terms only

2010-08-31 Thread Xavier Schepler
Chris Hostetter wrote: : how could I have the highlighting component return only the terms that were : matched, without any surrounding text ? I'm not a Highlighter expert, but this is somethign that certainly *sounds* like it should be easy. I took a shot at it and this is hte best i could c

Expanded Synonyms + phrase search

2010-08-30 Thread Xavier Schepler
Hi, several documents from my index contain the phrase : "PS et". However, PS is expanded to "parti socialiste" and a phrase search for "PS et" fails. A phrase search for "parti socialiste et" succeeds. Can I have both queries working ? Here's the field type :

Highlighting, return the matched terms only

2010-08-03 Thread Xavier Schepler
Hi, how could I have the highlighting component return only the terms that were matched, without any surrounding text ?

Multi word synonyms + highlighting

2010-06-04 Thread Xavier Schepler
Hi, Here's a field type using synonyms : synonyms="french-synonyms.txt" ignoreCase="true" expand="true"/> mapping="mapping-ISOLatin1Accent.txt"/> mapping="mapping-ISOLatin1Accent.txt"/> Here are the contents of 'french-synonyms.txt' that I used for testing : PC,parti co

Re: Targeting two fields with the same query or one field gathering contents from both ?

2010-05-17 Thread Xavier Schepler
7; in A and C Marco Martínez Bautista http://www.paradigmatecnologico.com Avenida de Europa, 26. Ática 5. 3ª Planta 28224 Pozuelo de Alarcón Tel.: 91 352 59 42 2010/5/17 Xavier Schepler Hey, let's say I have : - a field named A with specific contents - a field named B with specific co

Re: Targeting two fields with the same query or one field gathering contents from both ?

2010-05-17 Thread Xavier Schepler
Le 17/05/2010 16:57, Xavier Schepler a écrit : Hey, let's say I have : - a field named A with specific contents - a field named B with specific contents - a field named C witch contents only from A and B added with copyField. Are those queries equivalents in terms of performance :

Targeting two fields with the same query or one field gathering contents from both ?

2010-05-17 Thread Xavier Schepler
Hey, let's say I have : - a field named A with specific contents - a field named B with specific contents - a field named C witch contents only from A and B added with copyField. Are those queries equivalents in terms of performance : - A: (the lazy fox) AND B: (the lazy fox) - C: (the lazy

Re: What hardware do I need ?

2010-04-23 Thread Xavier Schepler
From: Xavier Schepler To: solr-user@lucene.apache.org Sent: Fri, April 23, 2010 11:01:24 AM Subject: What hardware do I need ? Hi, I'm working with Solr 1.4. My schema has about 50 fields. I'm using full text search in short strings (~ 30-100 terms) and facet

What hardware do I need ?

2010-04-23 Thread Xavier Schepler
Hi, I'm working with Solr 1.4. My schema has about 50 fields. I'm using full text search in short strings (~ 30-100 terms) and facetted search. My index will have 100 000 documents. The number of requests per second will be low. Let's say between 0 and 1000 because of auto-complete. Is a st

More like this - setting a minimum number of terms used to build queries

2010-03-29 Thread Xavier Schepler
Hey, Is there a way to make the "more like this" feature build its queries from a minimum number of interesting terms ? It looks like this component fires query with only 1 term in them. I got a lot of results that aren't similar at all with the parsed document fields. My parameters : &mlt.

Highlighting inside a field with HTML contents

2010-02-22 Thread Xavier Schepler
Hello, this field would not be searched, but it would be used to display results. A query could be : q=table&hl=true&hl.fl=htmlfield&hl.fragsize=0 It would be tokenized with the HTMLStripStandardTokenizerFactory, then analyzed the same way as the searcheable fields. Could this result in hig

Re: Need feedback on solr security

2010-02-17 Thread Xavier Schepler
Xavier Schepler wrote: Vijayant Kumar wrote: Hi Xavier, Thanks for your feedback the firewall rule for the trusted IP is not fessiable for us because the application is open for public so we can not work through IP banning. Vijayant Kumar wrote: Hi Group, I need some feedback on solr

Re: Need feedback on solr security

2010-02-17 Thread Xavier Schepler
Vijayant Kumar wrote: Hi Xavier, Thanks for your feedback the firewall rule for the trusted IP is not fessiable for us because the application is open for public so we can not work through IP banning. Vijayant Kumar wrote: Hi Group, I need some feedback on solr security. For Making

Re: Need feedback on solr security

2010-02-17 Thread Xavier Schepler
Vijayant Kumar wrote: Hi Group, I need some feedback on solr security. For Making by solr admin password protected, I had used the Path Based Authentication form http://wiki.apache.org/solr/SolrSecurity. In this way my admin area,search,delete,add to index is protected.But Now when I make s

Phrase similarity - "more like this" feature for small set of terms

2010-02-15 Thread Xavier Schepler
Hi, there is an indexed field in my Solr's schema, in which one phrase is stored per document. I have to implement a feature that will allow users to have "more like this" results, based on the contents of this field. I think that the Solr's built in "more like this" feature requires too many

Re: Dynamic fields with more than 100 fields inside

2010-02-09 Thread Xavier Schepler
Shalin Shekhar Mangar a écrit : On Tue, Feb 9, 2010 at 2:43 PM, Xavier Schepler < xavier.schep...@sciences-po.fr> wrote: Shalin Shekhar Mangar a écrit : On Mon, Feb 8, 2010 at 9:47 PM, Xavier Schepler < xavier.schep...@sciences-po.fr> wrote: Hey, I'm thi

Re: Dynamic fields with more than 100 fields inside

2010-02-09 Thread Xavier Schepler
Shalin Shekhar Mangar a écrit : On Mon, Feb 8, 2010 at 9:47 PM, Xavier Schepler < xavier.schep...@sciences-po.fr> wrote: Hey, I'm thinking about using dynamic fields. I need one or more user specific field in my schema, for example, "concept_user_*", and I will hav

Dynamic fields with more than 100 fields inside

2010-02-08 Thread Xavier Schepler
Hey, I'm thinking about using dynamic fields. I need one or more user specific field in my schema, for example, "concept_user_*", and I will have maybe more than 200 users using this feature. One user will send and retrieve values from its field. It will then be used to filter result. How w

Need help with highlighting (detailed problem with code samples)

2010-01-14 Thread Xavier Schepler
Hey, I have highlighting working with this field : when I query the solr server this way : /select/?q=election&version=2.2&start=0&rows=10&indent=on&hl=on&hl.fl=questionsLabelsFr&hl.fragsize=0 I got results like this : Au second tour des élections législatives dimanche prochain, avez

Re: Field highlighting

2010-01-07 Thread Xavier Schepler
10 at 7:47 AM, Xavier Schepler < xavier.schep...@sciences-po.fr> wrote: Hi, I'm trying to highlight short text values. The field they came from has a type shared with other fields. I have highlighting working on other fields but not on this one. Why ? Thanks for your re

Field highlighting

2010-01-07 Thread Xavier Schepler
Hi, I'm trying to highlight short text values. The field they came from has a type shared with other fields. I have highlighting working on other fields but not on this one. Why ?