term frequencies in sharded environment

2011-09-20 Thread Massimo Schiavon
Seems that when I submit a query in a sharded environment the idf 
component of the scoring formula takes into consideration the local 
terms frequencies (local to the single shard index). The effect of that 
is that the calculation is correct only if the distribution terms in the 
shards is balanced.


Are there any way to avoid that? Perhaps by using cumulative frequencies 
in the calculation? Anything else?


Regards

Massimo


Re: Delete documents with empty fields

2011-09-15 Thread Massimo Schiavon

On 15/09/2011 13:01, Ahmet Arslan wrote:

+*:* -title:[* TO *]


Worked fine.
Thanks a lot!


Massimo


Delete documents with empty fields

2011-09-15 Thread Massimo Schiavon

I want to delete all documents with empty title field.
If i run the query -title:[* TO *] I obtain the correct list of 
documents but when I submit to solr the delete command:


curl http://localhost:8080/solr/web/update\?commit=true -H 
'Content-Type: text/xml' --data-binary \

'-title:[* TO *]'

none of the documents were deleted.

After a bit of debugging I have noted that the query was internally 
rewritten by org.apache.lucene.search.Searcher.createNormalizedWeight to 
an empty query.


It is a bug or there is another way to do this operation? (or there is 
no way?)



Regards

Massimo


Re: Possible bug in FastVectorHighlighter

2011-08-10 Thread Massimo Schiavon

Worked fine. Thanks a lot!

Massimo

On 09/08/2011 11:58, Jayendra Patil wrote:

Try using -

  
  

Regards,
Jayendra


On Tue, Aug 9, 2011 at 4:46 AM, Massimo Schiavon  wrote:

In my Solr (3.3) configuration I specified these two params:




when I do a simple search I obtain correctly highlighted results where
matches areenclosed with correct tag.
If I do the same request with hl.useFastVectorHighlighter=true in the http
query string (or specifying the same parameter in the config file) the
metches are enclosed with  tag (the default value).

Anyone has encountered the same




Possible bug in FastVectorHighlighter

2011-08-09 Thread Massimo Schiavon

In my Solr (3.3) configuration I specified these two params:




when I do a simple search I obtain correctly highlighted results where 
matches areenclosed with correct tag.
If I do the same request with hl.useFastVectorHighlighter=true in the 
http query string (or specifying the same parameter in the config file) 
the metches are enclosed with  tag (the default value).


Anyone has encountered the same issue?


Encoding of alternate fields in highlighting

2011-06-16 Thread Massimo Schiavon
I have an index with various fields and I want to highlight query 
matchings on "title" and "content" fields.
These fields could contain html tags so I've configured HtmlFormatter 
for highlighting. The problem is that if the query doesn't match the 
text of the field, solr returns the value of configured alternate field 
without encoding it.
Is there any way to get encoded value also for alternate fields? And in 
general there is a way to do html escaping on values returned from a 
response writer?


I'm using solr 3.1 and here is an excerpt from requestHandler configuration

[...]
json
true
title,content


1024
title
512
1
content
512
2
[...]

and from highlighting configuration

[...]

default="true">


default="true" />
class="org.apache.solr.highlight.ScoreOrderFragmentsBuilder"

default="true" />

[...]

Thanks
Massimo

--
DISCLAIMER: This e-mail and any attachment is for authorised use by
the intended recipient(s) only. It may contain proprietary material,
confidential information and/or be subject to legal privilege. It
should not be copied, disclosed to, retained or used by, any other
party. If you are not an intended recipient then please promptly
delete this e-mail and any attachment and all copies and inform
the sender. Thank you.