Scoring with wild cars

2014-09-24 Thread Pigeyre Romain
},

expand:{

  time:0.0},

debug:{

  time:1.0}





Romain PIGEYRE
Centre de service de Lyon

[Sopra]

Sopra
Parc du Puy d'Or
72 Allée des Noisetiers - CS 10137
69578 - LIMONEST
France
Phone : +33 (0)4 37 26 43 33
romain.pige...@sopra.commailto:romain.pige...@sopra.com - 
www.sopra.comhttp://www.sopra.com


[cid:image004.png@01CFD833.DFE6CB90]http://www.linkedin.com/company/sopra 
[cid:image006.png@01CFD833.DFE6CB90] 
https://www.youtube.com/user/SopraChannel  
[cid:image008.png@01CFD833.DFE6CB90] https://www.facebook.com/sopragroup  
[cid:image010.png@01CFD833.DFE6CB90] https://twitter.com/soprarh  
[cid:image012.png@01CFD833.DFE6CB90] http://fr.viadeo.com/fr/company/sopra
Ce message peut contenir des informations confidentielles dont la divulgation 
est à ce titre rigoureusement interdite en l'absence d'autorisation explicite 
de l'émetteur. Dans l'hypothèse où vous auriez reçu par erreur ce message, 
merci de le renvoyer à l'émetteur et de détruire toute copie.

P Pensez à l'environnement avant d'imprimer.



Using def function in fl criteria,

2014-09-09 Thread Pigeyre Romain
Hi

I'm trying to use a query with 
fl=name_UK,name_FRA,itemDesc:def(name_UK,name_FRA)
As you can see, the itemDesc field (builded by solr) is truncated :

{
name_UK: MEN S SUIT\n,
name_FRA: 24 RELAX 2 BTS ST GERMAIN TOILE FLAMMEE LIN ET SOIE,
itemDesc: suit
  }

Do you have any idea to change it?

Thanks.

Regards,

Romain


Re: Using def function in fl criteria,

2014-09-09 Thread Pigeyre Romain
I want to return :

-the field name_UK (if it exists)

-Otherwise the name_FRA field
... into an alias field (itemDesc, created at query time).

There is no schema definition for itemDesc because, it is only a virtual field 
declared in fl= criteria. I don't understand while filter is applying to this 
field.

On Tue, Sep 9, 2014 at 17:44 AM, Erick Erickson 
erickerick...@gmail.commailto:erickerick...@gmail.com wrote:

 I'm really confused about what you're trying to do here. What do you
 intend the syntax
 itemDesc:def(name_UK,name_FRA)
 to do?

 It's also really difficult to say much of anything unless we see the
 schema definition for itemDesc and sample input.

 Likely you're somehow applying an analysis chain that is truncating
 the input. Or it's also possible that you aren't indexing quite what
 you think you are.

 Best,
 Erick

 On Tue, Sep 9, 2014 at 4:36 AM, Pigeyre Romain 
 romain.pige...@sopra.commailto:romain.pige...@sopra.com wrote:
  Hi
 
  I'm trying to use a query with 
  fl=name_UK,name_FRA,itemDesc:def(name_UK,name_FRA)
  As you can see, the itemDesc field (builded by solr) is truncated :
 
  {
  name_UK: MEN S SUIT\n,
  name_FRA: 24 RELAX 2 BTS ST GERMAIN TOILE FLAMMEE LIN ET SOIE,
  itemDesc: suit
}
 
  Do you have any idea to change it?
 
  Thanks.
 
  Regards,
 
  Romain




Using wild characters in query doesn't work with my configuraiton

2014-08-27 Thread Romain Pigeyre
Hi,

I have a little mistake using Solr :

I can query this : lastName:HK+IE
The result contains the next record :
{ customerId: 0003500226598, countryLibelle: HONG KONG, firstName1:
lC /o, countryCode: HK, address1:  1F0/, address2: 11-35, 
storeId: 100, lastName1: HK IE, city: HONG KONG, _version_:
1477612965227135000 }
NB : lastName contains the lastName1 field.

When I'm adding * on the same query : lastName:*HK*+*IE*, there is no
result. I hoped that the * character replace 0 to n character.

Here is my configuration :
field name=lastName type=text_general indexed=true stored=false
multiValued=true/

copyField source=lastName1 dest=lastName/
copyField source=lastName2 dest=lastName/

fieldType name=text_general class=solr.TextField
positionIncrementGap=100
  analyzer type=index
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.StopFilterFactory ignoreCase=true
words=stopwords.txt /
!-- in this example, we will only use synonyms at query time
filter class=solr.SynonymFilterFactory
synonyms=index_synonyms.txt ignoreCase=true expand=false/
--
filter class=solr.LowerCaseFilterFactory/
  /analyzer
  analyzer type=query
tokenizer class=solr.StandardTokenizerFactory/
filter class=solr.StopFilterFactory ignoreCase=true
words=stopwords.txt /
filter class=solr.SynonymFilterFactory synonyms=synonyms.txt
ignoreCase=true expand=true/
filter class=solr.LowerCaseFilterFactory/
  /analyzer
/fieldType

I'm using a WhitespaceTokenizerFactory at indexing time in order to keep
specials characters : /?...
After this configuration, I restarted Solr and re-indexed data.

Is Somebody have any idea to resolve this issue?

Thanks a lot

-- 

*-Romain PIGEYRE*


Re: Histogram facet?

2014-05-06 Thread Romain Rigaux
This looks nice!

The only missing piece for more interactivity would be to be able to map
multiple field values into the same bucket.

e.g.

http://localhost:8983/solr/query?
   q=*:*
   facet=true
   facet.field=*round(date, '15MINUTES')*
   facet.stat=sum(retweetCount)

This is a bit similar to
SOLR-4772https://issues.apache.org/jira/browse/SOLR-4772for the
rounding.

Then we could zoom out just by changing the size of the bucket, without any
index change, e.g.:
http://localhost:8983/solr/query?
   q=*:*
   facet=true
   facet.field=*round(date, '1HOURS')*
   facet.stat=sum(retweetCount)

Romain

On Tue, May 6, 2014 at 10:09 AM, Yonik Seeley yo...@heliosearch.com wrote:

 On Mon, May 5, 2014 at 6:18 PM, Romain romain@gmail.com wrote:
  Hi,
 
  I am trying to plot a non date field by time in order to draw an
 histogram
  showing its evolution during the week.
 
  For example, if I have a tweet index:
 
  Tweet:
date
retweetCount
 
  3 tweets indexed:
  Tweet | Date | Retweet
  A01/01   100
  B01/01   100
  C01/02   100
 
  If I want to plot the number of tweets by day: easy with a date range
 facet:
  Day 1: 2
  Day 2: 1
 
  But now counting the number of retweet by day is not possible natively:
  Day 1: 200
  Day 2: 100

 Check out facet functions in Heliosearch (an experimental fork of Solr):
 http://heliosearch.org/solr-facet-functions/

 All you would need to do is add:
 facet.stat=sum(retweetCount)

 -Yonik
 http://heliosearch.org - solve Solr GC pauses with off-heap filters
 and fieldcache



Re: Histogram facet?

2014-05-06 Thread Romain Rigaux
This is super nice, I tried (even without subfacets) and it works! Thanks a
lot!

Romain

facet=truefacet.range=pricefacet.range.start=0facet.range.end=1000facet.range.gap=100facet.stat=avg(popularity)


facets: { price: { buckets: [ { val: 0.0, avg(popularity):
3.5714285714285716 }, { val: 100.0, avg(popularity): 5.5 }, { val:
200.0, avg(popularity): 6 }, { val: 300.0, avg(popularity):
7.667 }, { val: 400.0, avg(popularity): 7 }, { val:
500.0, avg(popularity): NaN }, { val: 600.0, avg(popularity): 7},
{ val: 700.0, avg(popularity): NaN }, { val: 800.0, 
avg(popularity): NaN }, { val: 900.0, avg(popularity): NaN } ], 
gap: 100, start: 0, end: 1000 }


On Tue, May 6, 2014 at 3:15 PM, Yonik Seeley yo...@heliosearch.com wrote:

 On Tue, May 6, 2014 at 5:30 PM, Romain Rigaux rom...@cloudera.com wrote:
  This looks nice!
 
  The only missing piece for more interactivity would be to be able to map
  multiple field values into the same bucket.
 
  e.g.
 
  http://localhost:8983/solr/query?
 q=*:*
 facet=true
 facet.field=*round(date, '15MINUTES')*
 facet.stat=sum(retweetCount)
 
  This is a bit similar to
  SOLR-4772https://issues.apache.org/jira/browse/SOLR-4772for the
  rounding.
 
  Then we could zoom out just by changing the size of the bucket, without
 any
  index change, e.g.:
  http://localhost:8983/solr/query?
 q=*:*
 facet=true
 facet.field=*round(date, '1HOURS')*
 facet.stat=sum(retweetCount)

 For this specific example, I think map multiple field values into the
 same bucket equates to a range facet?

 facet.range=mydatefield
 facet.range.start=...
 facet.range.end=...
 facet.range.gap=+1HOURS
 facet.stat=sum(retweetCount)

 And then if you need additional breakouts by time range, you can use
 subfacets:

 subfacet.mydatefield.field=mycategoryfield

 That will provide retweet counts broken out by mycategoryfield for
 every bucket produced by the range query.

 See http://heliosearch.org/solr-subfacets/

 -Yonik
 http://heliosearch.org - facet functions, subfacets, off-heap
 filtersfieldcache



Histogram facet?

2014-05-05 Thread Romain
Hi,

I am trying to plot a non date field by time in order to draw an histogram
showing its evolution during the week.

For example, if I have a tweet index:

Tweet:
  date
  retweetCount

3 tweets indexed:
Tweet | Date | Retweet
A01/01   100
B01/01   100
C01/02   100

If I want to plot the number of tweets by day: easy with a date range facet:
Day 1: 2
Day 2: 1

But now counting the number of retweet by day is not possible natively:
Day 1: 200
Day 2: 100

On current workaround would be to do a date rage facet to get the date
slots and ask only for the retweet field and compute the sums in the
client. We could compute other stats like average, etc... too

The closest I could see was
https://issues.apache.org/jira/browse/SOLR-4772but it seems to be
slightly different.

Basically I am trying to do something very similar to the Date Histogram
Facethttp://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-facets-date-histogram-facet.html#search-facets-date-histogram-facetin
ES.

Is there a way to move the counting logic to the Solr server?

Thanks!

Romain


Re: Histogram facet?

2014-05-05 Thread Romain Rigaux
The dates won't match unless you truncate all of them to day. But then if
you want to have slots of 15minutes it won't work as you would need to
truncate the dates every 15minutes in the index.

In ES, they have 1 field to make the slots and 1 field to insert into the
bucket, e.g.:

{
query : {


match_all : {}


},
facets : {


histo1 : {


date_histogram : {


key_field : timestamp,


value_field : price,


interval : day


}
}


}
}

Romain


On Mon, May 5, 2014 at 9:05 PM, Erick Erickson erickerick...@gmail.comwrote:

 Hmmm, I _think_ pivot faceting works here. One dimension would be day
 and the other retweet count. The response will have the number of
 retweets per day, you'd have to sum them up I suppose.

 Best,
 Erick

 On Mon, May 5, 2014 at 3:18 PM, Romain romain@gmail.com wrote:
  Hi,
 
  I am trying to plot a non date field by time in order to draw an
 histogram
  showing its evolution during the week.
 
  For example, if I have a tweet index:
 
  Tweet:
date
retweetCount
 
  3 tweets indexed:
  Tweet | Date | Retweet
  A01/01   100
  B01/01   100
  C01/02   100
 
  If I want to plot the number of tweets by day: easy with a date range
 facet:
  Day 1: 2
  Day 2: 1
 
  But now counting the number of retweet by day is not possible natively:
  Day 1: 200
  Day 2: 100
 
  On current workaround would be to do a date rage facet to get the date
  slots and ask only for the retweet field and compute the sums in the
  client. We could compute other stats like average, etc... too
 
  The closest I could see was
  https://issues.apache.org/jira/browse/SOLR-4772but it seems to be
  slightly different.
 
  Basically I am trying to do something very similar to the Date Histogram
  Facet
 http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-facets-date-histogram-facet.html#search-facets-date-histogram-facet
 in
  ES.
 
  Is there a way to move the counting logic to the Solr server?
 
  Thanks!
 
  Romain



Re: Query with star returns double type values equal 0

2011-10-18 Thread romain
Hi iorixxx,

I am using lucene

On Monday, October 17, 2011 5:58:31 PM, iorixxx [via Lucene] wrote:
  I am experiencing an unexpected behavior using solr 3.4.0.
 
  if my query includes a star, all the properties of type
  'long' or 'LatLon'
  have 0 as value
  (ex: select/?start=0q=way*rows=10version=2)
 
  Though the same request without stars returns correct
  values
  (ex: select/?start=0q=wayrows=10version=2)
 
  Does anyone have an idea?

 Please keep in mind that wildcard queries are not analyzed.

 What query parser are you using? lucene, dismax, edismax?




 
 If you reply to this email, your message will be added to the 
 discussion below:
 http://lucene.472066.n3.nabble.com/Query-with-star-returns-double-type-values-equal-0-tp3428721p3429578.html
  

 To unsubscribe from Query with star returns double type values equal 
 0, click here 
 http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_codenode=3428721code=cm9tYWluLmR1cGFzQGdtYWlsLmNvbXwzNDI4NzIxfDE3MzgwNjIyOTA=.
  



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Query-with-star-returns-double-type-values-equal-0-tp3428721p3432312.html
Sent from the Solr - User mailing list archive at Nabble.com.

Query with star returns double type values equal 0

2011-10-17 Thread romain
Hello,

I am experiencing an unexpected behavior using solr 3.4.0.

if my query includes a star, all the properties of type 'long' or 'LatLon'
have 0 as value
(ex: select/?start=0q=way*rows=10version=2)

Though the same request without stars returns correct values
(ex: select/?start=0q=wayrows=10version=2)

Does anyone have an idea?

Romain.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Query-with-star-returns-double-type-values-equal-0-tp3428721p3428721.html
Sent from the Solr - User mailing list archive at Nabble.com.