Re: Facet count mismatch between solr simple facet and Json facet API.
I am facing the same issue. Thanks for letting me know about the JIRA. I think this is a very big issue especially for those looking at Solr as a NoSQL analytics engine. Ritesh -- View this message in context: http://lucene.472066.n3.nabble.com/Facet-count-mismatch-between-solr-simple-facet-and-Json-facet-API-tp4242461p4242650.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Search for All CAPS words
Took me a while but I tried it and its works perfectly. Thanks a lot !! Ritesh -- View this message in context: http://lucene.472066.n3.nabble.com/Search-for-All-CAPS-words-tp4219893p4221597.html Sent from the Solr - User mailing list archive at Nabble.com.
Search for All CAPS words
Hi, I need the capability to search for /GATE/ separately from /gate/. I cannot remove the lowercase filter factory in both my search and analysis chains since that will break many other search scenarios. Is there a way to payload/mark an ALL CAPS word in the index analyzer chain before it gets lowercased (by the lowercasefilterfactory) so that I can search it with some custom grammar and logic in my query parser. Say I want: Field:_gate to match /GATE/ only Field:gate to match both /GATE/ and /gate/ Any pointers would be helpful. thanks Ritesh -- View this message in context: http://lucene.472066.n3.nabble.com/Search-for-All-CAPS-words-tp4219893.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Search for All CAPS words
Thanks and I did think of the copy field option. So what you are suggesting is that I have a copyfield in which I do not keep the lowercase factory analyzer in my indexing/query chains. I am afraid that would not help if my search query is complex with many words (say a boolean with proximity operators) because the full search string would have go into the copyfield (not having the lowercase). The rest of the words other than /GATE/ wouldnt match properly then. Ritesh -- View this message in context: http://lucene.472066.n3.nabble.com/Search-for-All-CAPS-words-tp4219893p4219959.html Sent from the Solr - User mailing list archive at Nabble.com.
Question on Solr Relevancy using Okapi BM25F
I am trying to get OkapiBM25F working over some press release articles I am indexing. The data has text portions spread across 3 fields - Title, Summary and Full Article. I would like to influence the standard BM25 by giving more weight to words in title and summary of the article than the full description. The importance has to be of the order title Summary Full description. I am unable to find schema examples online that can help me with it. Can someone guide me with a possible schema for this. (or a link to an article/blog that explains it) Thanks for your help. -Ritesh -- View this message in context: http://lucene.472066.n3.nabble.com/Question-on-Solr-Relevancy-using-Okapi-BM25F-tp4153866.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Group.Facet issue in Sharded Solr Setup
Quick follow up on my question below and if anyone is using Group.facets in a sharded solr setup ? Based on further testing, the group.facets counts dont seem reliable at all for lesser popular items in the facet list. -- View this message in context: http://lucene.472066.n3.nabble.com/Group-Facet-issue-in-Sharded-Solr-Setup-tp4116077p4116635.html Sent from the Solr - User mailing list archive at Nabble.com.
Group.Facet issue in Sharded Solr Setup
Am facing an issue with counts when using group.facets in my sharded solr. (Groups do not overlap across shards and for various reasons I cannot use group.truncate) Now, the problem being faced is that for items ranking lower in the faceted list sorted by count, the group facet counts are coming *higher* than actual values. So on doing an online search I came across details of sharded faceting at this link: http://lucene.472066.n3.nabble.com/At-a-high-level-how-does-faceting-in-SolrCloud-work-td4009897.html From the above link it appears there is a *third corrective step* wherein the coordinator node after getting individual results and building a final list, asks each shard to compute it's exact count for selected constraints. I wanted to ask if the Group.facet implementation in 4.x has been factored in this step and that the coordinator node is asking for grouped facet values instead of ungrouped facet counts during the corrective step ? Asking this because, counts are coming right for the 50% of the popular items but are incorrect (and always higher) for lesser items. Also has anyone else faced this ? Ritesh -- View this message in context: http://lucene.472066.n3.nabble.com/Group-Facet-issue-in-Sharded-Solr-Setup-tp4116077.html Sent from the Solr - User mailing list archive at Nabble.com.
Can this type of sorting/boosting be done by solr
Hi, I have a journal article citation schema like this: { AT - article_title AID - article_id (Unique id) AREFS - article_references_list (List of article id's referred/cited in this article. Multi-valued) AA - Article Abstract --- other_article_stuff ... } So for example, in order to search for all those articles that refer(cite) article id 51643, I simply need to search for AREFS:51643 and it will give me the list of articles that have 51643 listed in AREFS. Now, I want to be able to search in the text of articles and sort the results by most referred articles. How can I do this ? Say if my search query is q=AT:metal and it gives me 1700 results. How can I sort 1700 results by those that have received maximum number of citations by others. I have been researching function queries to solve this but have been unable to do so. Thanks in advance. Ritesh -- View this message in context: http://lucene.472066.n3.nabble.com/Can-this-type-of-sorting-boosting-be-done-by-solr-tp3769315p3769315.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Can this type of sorting/boosting be done by solr
Dear Chantal, Thanks for your reply, but thats not what I was asking. Let me explain. The size of the list in AREFS would give me how many records are *referred by* an article and NOT how many records *refer to* an article. Say if an article id - 51463 has been published in 2002 and refers to 10 articles dating from 1990-2002. Then the count of AREFS would be 10 which is static once the journal has been published. However if the same article is being *referred to* by 20 articles published from 2003-2012 then I am talking about this 20 count. This count is dynamic and as we keep adding records to the index, there are more articles that will refer to article 51463 it in their AREFS field in the future. /(Obviously when we are adding article 51463 to the index we have no clue who will be referring to it in the future, so we can have another field in it for this, nor can be update 51463 everytime someone refers to it)/ So today, if I want to know who all are referring to 51463, by actually searching for this id in the AREFS field. The query is as simple as q=AREFS:51463 and it will given the list of articles from 2003 to 2012 and the result count would be 20. So back to the question, say if my search query is q=AT:metal and it gives me 1700 results. How can I sort 1700 results by those that have received maximum number of citations (till date) by others. (i.e., that have maximum number of results if I individually search their ids in the AREFS field). Hope this makes it clear. I feel this is a sort/boost by function query candidate. But I am not able to figure it out. Thanks Ritesh -- View this message in context: http://lucene.472066.n3.nabble.com/Can-this-type-of-sorting-boosting-be-done-by-solr-tp3769315p3769475.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Can this type of sorting/boosting be done by solr
Hi Chantal, Yes, I have thought about the docfreq(field_name,'search_text') function, but somehow I will have dereference the article id's (AID) from the result of the query to the sort. The below query does not work: q=AT:metalsort=docfreq(AREFS,$q.AID) Is there a mistake in the query that am missing out or is dereferencing not supported in Relevence functions ? Thanks, Ritesh -- View this message in context: http://lucene.472066.n3.nabble.com/Can-this-type-of-sorting-boosting-be-done-by-solr-tp3769315p3769779.html Sent from the Solr - User mailing list archive at Nabble.com.