Scoring with wild cars
}, expand:{ time:0.0}, debug:{ time:1.0} Romain PIGEYRE Centre de service de Lyon [Sopra] Sopra Parc du Puy d'Or 72 Allée des Noisetiers - CS 10137 69578 - LIMONEST France Phone : +33 (0)4 37 26 43 33 romain.pige...@sopra.commailto:romain.pige...@sopra.com - www.sopra.comhttp://www.sopra.com [cid:image004.png@01CFD833.DFE6CB90]http://www.linkedin.com/company/sopra [cid:image006.png@01CFD833.DFE6CB90] https://www.youtube.com/user/SopraChannel [cid:image008.png@01CFD833.DFE6CB90] https://www.facebook.com/sopragroup [cid:image010.png@01CFD833.DFE6CB90] https://twitter.com/soprarh [cid:image012.png@01CFD833.DFE6CB90] http://fr.viadeo.com/fr/company/sopra Ce message peut contenir des informations confidentielles dont la divulgation est à ce titre rigoureusement interdite en l'absence d'autorisation explicite de l'émetteur. Dans l'hypothèse où vous auriez reçu par erreur ce message, merci de le renvoyer à l'émetteur et de détruire toute copie. P Pensez à l'environnement avant d'imprimer.
Using def function in fl criteria,
Hi I'm trying to use a query with fl=name_UK,name_FRA,itemDesc:def(name_UK,name_FRA) As you can see, the itemDesc field (builded by solr) is truncated : { name_UK: MEN S SUIT\n, name_FRA: 24 RELAX 2 BTS ST GERMAIN TOILE FLAMMEE LIN ET SOIE, itemDesc: suit } Do you have any idea to change it? Thanks. Regards, Romain
Re: Using def function in fl criteria,
I want to return : -the field name_UK (if it exists) -Otherwise the name_FRA field ... into an alias field (itemDesc, created at query time). There is no schema definition for itemDesc because, it is only a virtual field declared in fl= criteria. I don't understand while filter is applying to this field. On Tue, Sep 9, 2014 at 17:44 AM, Erick Erickson erickerick...@gmail.commailto:erickerick...@gmail.com wrote: I'm really confused about what you're trying to do here. What do you intend the syntax itemDesc:def(name_UK,name_FRA) to do? It's also really difficult to say much of anything unless we see the schema definition for itemDesc and sample input. Likely you're somehow applying an analysis chain that is truncating the input. Or it's also possible that you aren't indexing quite what you think you are. Best, Erick On Tue, Sep 9, 2014 at 4:36 AM, Pigeyre Romain romain.pige...@sopra.commailto:romain.pige...@sopra.com wrote: Hi I'm trying to use a query with fl=name_UK,name_FRA,itemDesc:def(name_UK,name_FRA) As you can see, the itemDesc field (builded by solr) is truncated : { name_UK: MEN S SUIT\n, name_FRA: 24 RELAX 2 BTS ST GERMAIN TOILE FLAMMEE LIN ET SOIE, itemDesc: suit } Do you have any idea to change it? Thanks. Regards, Romain
Using wild characters in query doesn't work with my configuraiton
Hi, I have a little mistake using Solr : I can query this : lastName:HK+IE The result contains the next record : { customerId: 0003500226598, countryLibelle: HONG KONG, firstName1: lC /o, countryCode: HK, address1: 1F0/, address2: 11-35, storeId: 100, lastName1: HK IE, city: HONG KONG, _version_: 1477612965227135000 } NB : lastName contains the lastName1 field. When I'm adding * on the same query : lastName:*HK*+*IE*, there is no result. I hoped that the * character replace 0 to n character. Here is my configuration : field name=lastName type=text_general indexed=true stored=false multiValued=true/ copyField source=lastName1 dest=lastName/ copyField source=lastName2 dest=lastName/ fieldType name=text_general class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / !-- in this example, we will only use synonyms at query time filter class=solr.SynonymFilterFactory synonyms=index_synonyms.txt ignoreCase=true expand=false/ -- filter class=solr.LowerCaseFilterFactory/ /analyzer analyzer type=query tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt / filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldType I'm using a WhitespaceTokenizerFactory at indexing time in order to keep specials characters : /?... After this configuration, I restarted Solr and re-indexed data. Is Somebody have any idea to resolve this issue? Thanks a lot -- *-Romain PIGEYRE*
Re: Histogram facet?
This looks nice! The only missing piece for more interactivity would be to be able to map multiple field values into the same bucket. e.g. http://localhost:8983/solr/query? q=*:* facet=true facet.field=*round(date, '15MINUTES')* facet.stat=sum(retweetCount) This is a bit similar to SOLR-4772https://issues.apache.org/jira/browse/SOLR-4772for the rounding. Then we could zoom out just by changing the size of the bucket, without any index change, e.g.: http://localhost:8983/solr/query? q=*:* facet=true facet.field=*round(date, '1HOURS')* facet.stat=sum(retweetCount) Romain On Tue, May 6, 2014 at 10:09 AM, Yonik Seeley yo...@heliosearch.com wrote: On Mon, May 5, 2014 at 6:18 PM, Romain romain@gmail.com wrote: Hi, I am trying to plot a non date field by time in order to draw an histogram showing its evolution during the week. For example, if I have a tweet index: Tweet: date retweetCount 3 tweets indexed: Tweet | Date | Retweet A01/01 100 B01/01 100 C01/02 100 If I want to plot the number of tweets by day: easy with a date range facet: Day 1: 2 Day 2: 1 But now counting the number of retweet by day is not possible natively: Day 1: 200 Day 2: 100 Check out facet functions in Heliosearch (an experimental fork of Solr): http://heliosearch.org/solr-facet-functions/ All you would need to do is add: facet.stat=sum(retweetCount) -Yonik http://heliosearch.org - solve Solr GC pauses with off-heap filters and fieldcache
Re: Histogram facet?
This is super nice, I tried (even without subfacets) and it works! Thanks a lot! Romain facet=truefacet.range=pricefacet.range.start=0facet.range.end=1000facet.range.gap=100facet.stat=avg(popularity) facets: { price: { buckets: [ { val: 0.0, avg(popularity): 3.5714285714285716 }, { val: 100.0, avg(popularity): 5.5 }, { val: 200.0, avg(popularity): 6 }, { val: 300.0, avg(popularity): 7.667 }, { val: 400.0, avg(popularity): 7 }, { val: 500.0, avg(popularity): NaN }, { val: 600.0, avg(popularity): 7}, { val: 700.0, avg(popularity): NaN }, { val: 800.0, avg(popularity): NaN }, { val: 900.0, avg(popularity): NaN } ], gap: 100, start: 0, end: 1000 } On Tue, May 6, 2014 at 3:15 PM, Yonik Seeley yo...@heliosearch.com wrote: On Tue, May 6, 2014 at 5:30 PM, Romain Rigaux rom...@cloudera.com wrote: This looks nice! The only missing piece for more interactivity would be to be able to map multiple field values into the same bucket. e.g. http://localhost:8983/solr/query? q=*:* facet=true facet.field=*round(date, '15MINUTES')* facet.stat=sum(retweetCount) This is a bit similar to SOLR-4772https://issues.apache.org/jira/browse/SOLR-4772for the rounding. Then we could zoom out just by changing the size of the bucket, without any index change, e.g.: http://localhost:8983/solr/query? q=*:* facet=true facet.field=*round(date, '1HOURS')* facet.stat=sum(retweetCount) For this specific example, I think map multiple field values into the same bucket equates to a range facet? facet.range=mydatefield facet.range.start=... facet.range.end=... facet.range.gap=+1HOURS facet.stat=sum(retweetCount) And then if you need additional breakouts by time range, you can use subfacets: subfacet.mydatefield.field=mycategoryfield That will provide retweet counts broken out by mycategoryfield for every bucket produced by the range query. See http://heliosearch.org/solr-subfacets/ -Yonik http://heliosearch.org - facet functions, subfacets, off-heap filtersfieldcache
Histogram facet?
Hi, I am trying to plot a non date field by time in order to draw an histogram showing its evolution during the week. For example, if I have a tweet index: Tweet: date retweetCount 3 tweets indexed: Tweet | Date | Retweet A01/01 100 B01/01 100 C01/02 100 If I want to plot the number of tweets by day: easy with a date range facet: Day 1: 2 Day 2: 1 But now counting the number of retweet by day is not possible natively: Day 1: 200 Day 2: 100 On current workaround would be to do a date rage facet to get the date slots and ask only for the retweet field and compute the sums in the client. We could compute other stats like average, etc... too The closest I could see was https://issues.apache.org/jira/browse/SOLR-4772but it seems to be slightly different. Basically I am trying to do something very similar to the Date Histogram Facethttp://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-facets-date-histogram-facet.html#search-facets-date-histogram-facetin ES. Is there a way to move the counting logic to the Solr server? Thanks! Romain
Re: Histogram facet?
The dates won't match unless you truncate all of them to day. But then if you want to have slots of 15minutes it won't work as you would need to truncate the dates every 15minutes in the index. In ES, they have 1 field to make the slots and 1 field to insert into the bucket, e.g.: { query : { match_all : {} }, facets : { histo1 : { date_histogram : { key_field : timestamp, value_field : price, interval : day } } } } Romain On Mon, May 5, 2014 at 9:05 PM, Erick Erickson erickerick...@gmail.comwrote: Hmmm, I _think_ pivot faceting works here. One dimension would be day and the other retweet count. The response will have the number of retweets per day, you'd have to sum them up I suppose. Best, Erick On Mon, May 5, 2014 at 3:18 PM, Romain romain@gmail.com wrote: Hi, I am trying to plot a non date field by time in order to draw an histogram showing its evolution during the week. For example, if I have a tweet index: Tweet: date retweetCount 3 tweets indexed: Tweet | Date | Retweet A01/01 100 B01/01 100 C01/02 100 If I want to plot the number of tweets by day: easy with a date range facet: Day 1: 2 Day 2: 1 But now counting the number of retweet by day is not possible natively: Day 1: 200 Day 2: 100 On current workaround would be to do a date rage facet to get the date slots and ask only for the retweet field and compute the sums in the client. We could compute other stats like average, etc... too The closest I could see was https://issues.apache.org/jira/browse/SOLR-4772but it seems to be slightly different. Basically I am trying to do something very similar to the Date Histogram Facet http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-facets-date-histogram-facet.html#search-facets-date-histogram-facet in ES. Is there a way to move the counting logic to the Solr server? Thanks! Romain
Re: Query with star returns double type values equal 0
Hi iorixxx, I am using lucene On Monday, October 17, 2011 5:58:31 PM, iorixxx [via Lucene] wrote: I am experiencing an unexpected behavior using solr 3.4.0. if my query includes a star, all the properties of type 'long' or 'LatLon' have 0 as value (ex: select/?start=0q=way*rows=10version=2) Though the same request without stars returns correct values (ex: select/?start=0q=wayrows=10version=2) Does anyone have an idea? Please keep in mind that wildcard queries are not analyzed. What query parser are you using? lucene, dismax, edismax? If you reply to this email, your message will be added to the discussion below: http://lucene.472066.n3.nabble.com/Query-with-star-returns-double-type-values-equal-0-tp3428721p3429578.html To unsubscribe from Query with star returns double type values equal 0, click here http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_codenode=3428721code=cm9tYWluLmR1cGFzQGdtYWlsLmNvbXwzNDI4NzIxfDE3MzgwNjIyOTA=. -- View this message in context: http://lucene.472066.n3.nabble.com/Query-with-star-returns-double-type-values-equal-0-tp3428721p3432312.html Sent from the Solr - User mailing list archive at Nabble.com.
Query with star returns double type values equal 0
Hello, I am experiencing an unexpected behavior using solr 3.4.0. if my query includes a star, all the properties of type 'long' or 'LatLon' have 0 as value (ex: select/?start=0q=way*rows=10version=2) Though the same request without stars returns correct values (ex: select/?start=0q=wayrows=10version=2) Does anyone have an idea? Romain. -- View this message in context: http://lucene.472066.n3.nabble.com/Query-with-star-returns-double-type-values-equal-0-tp3428721p3428721.html Sent from the Solr - User mailing list archive at Nabble.com.