DocSet getting cached in filterCache for facet request with {!cache=false}

2014-11-11 Thread Mohsin Beg Beg
Hello, It seems Solr is caching when facting even with fq={!cache=false}*:* specified. This is what I am doing on Solr 4.10.0 on jre 1.7.0_51. Query 1) No cache in filterCache as expected http://localhost:8983/solr/collection1/select?q=*:*rows=0fq={!cache=false}*:*

How to specify a property for all cores

2014-11-11 Thread Andreas Hubold
Hi, I'm using Solr 4.10.1 with the new solr.xml format (auto-discovered cores). I'm trying to set a property that I can reference in solrconfig.xml files of all cores. I know I can use JVM system properties or add the property to each core's core.properties file. Is there another

Removing Common Web Page Header and Footer from content

2014-11-11 Thread Moumita Dhar01
Hi, I am using Nutch 1.9 and Solr 4.6 to index a web application with approximately 100 distinct URL and contents. Nutch is used to fetch the urls, links and the crawl the entire web application to extract all the content for all pages, and send the content to Solr. The problem that I have

Re: Proper way to backup solr.

2014-11-11 Thread elmerfudd
First, I want to thank you for your response! can you provide more information about the suggested hardlink solution? What are the advantages and disadvantages using it? can you provide an example please? meanwhile try to read about it and test it myself asap. thanks! -- View this

Re: create new core based on named config set using the admin page

2014-11-11 Thread Andreas Hubold
Okay, I've created https://issues.apache.org/jira/browse/SOLR-6728 Erick Erickson wrote on 11/06/2014 08:00 PM: Yeah, please create a JIRA. There are a couple of umbrella JIRAs that you might want to link it to I'm not sure it quite fits in either, if not just let it hang out there bear:

I want to translate solr wiki to Korean.

2014-11-11 Thread Jeon Woosung
In Korea, only few people can read English well. Thus, it is difficult to use solr. But I want solr to spread out . So I would like to translate solr wiki to Korean. Is there any good ways to translate it?

Can I select dummy field(for count) from solr?

2014-11-11 Thread suhyunjeon
I want to show cumulative graph from banana framework(SiLK). https://docs.lucidworks.com/display/SiLK/Banana There is no cumulative graph so I want to select count(*) from solr collection like dummy field. So then I am going to sum the field(count(*)) and show histogram graph. Do you have idea?

Parent query yields document which is not matched by parents filter

2014-11-11 Thread ku3ia
Hi, folks! We are using parent/child architecture in our project and sometimes when using child transformer ([child]) there are an exception: Parent query yields document which is not matched by parents filter, docID=... Examples of query are:

Re: Suggester not suggesting anything using DictionaryCompoundWordTokenFilterFactory

2014-11-11 Thread Thomas Michael Engelke
I think I found the problem. The definition of the suggester component has a field option which references the field that the suggester uses to generate suggestions. Changing this to the field using the DictionaryCompundWordTokenFilterFactory also suggests word parts. Am 11.11.2014 08:52 schrieb

How to suggest from multiple fields?

2014-11-11 Thread Thomas Michael Engelke
Like in this article (http://www.andornot.com/blog/post/Advanced-autocomplete-with-Solr-Ngrams-and-Twitters-typeaheadjs.aspx), I am using multiple fields to generate different options for an autosuggest functionality: - First, the whole field (top priority) - Then, the whole field as

Re: Removing Common Web Page Header and Footer from content

2014-11-11 Thread Ahmet Arslan
Hi Moumita, Once, I used https://code.google.com/p/boilerpipe/ to remove common header/footers etc. Ahmet On Tuesday, November 11, 2014 10:41 AM, Moumita Dhar01 moumita_dha...@infosys.com wrote: Hi, I am using Nutch 1.9 and Solr 4.6 to index a web application with approximately 100

Re: Analytics result for each Result Group

2014-11-11 Thread Talat Uyarer
Hi Anurag, How can I find median function ? I use a lot that. 2014-11-09 20:39 GMT+02:00 Anurag Sharma anura...@gmail.com: Can a function query(http://wiki.apache.org/solr/FunctionQuery) serves your use case On Wed, Nov 5, 2014 at 3:36 PM, Talat Uyarer ta...@uyarer.com wrote: I searched

Re: Proper way to backup solr.

2014-11-11 Thread Shawn Heisey
On 11/11/2014 1:45 AM, elmerfudd wrote: First, I want to thank you for your response! can you provide more information about the suggested hardlink solution? What are the advantages and disadvantages using it? can you provide an example please? meanwhile try to read about it and test

Re: I want to translate solr wiki to Korean.

2014-11-11 Thread Steve Rowe
Hi Jeon Woosung, The Solr community wiki is no longer the official Solr documentation location. The Solr Reference Guide is where Solr documentation is now maintained: https://cwiki.apache.org/confluence/display/solr/Apache+Solr+Reference+Guide. I’m not sure what you mean when you ask “Is

Re: DocSet getting cached in filterCache for facet request with {!cache=false}

2014-11-11 Thread Shawn Heisey
On 11/11/2014 1:22 AM, Mohsin Beg Beg wrote: It seems Solr is caching when facting even with fq={!cache=false}*:* specified. This is what I am doing on Solr 4.10.0 on jre 1.7.0_51. Query 1) No cache in filterCache as expected

Re: Lucene to Solrcloud migration

2014-11-11 Thread Michal Krajňanský
Hi Eric, Michael, thank you both for your comments. 2014-11-11 5:05 GMT+01:00 Erick Erickson erickerick...@gmail.com: bq: - the documents are organized in shards according to date (integer) and language (a possibly extensible discrete set) bq: - the indexes are disjunct OK, I'm having a

Re: Lucene to Solrcloud migration

2014-11-11 Thread Michael Della Bitta
Yeah, Erick confused me a bit too, but I think what he's talking about takes for granted that you'd have your various indexes directly set up as individual collections. If instead you're considering one big collection, or a few collections based on aggregations of your individual indexes,

Re: Lucene to Solrcloud migration

2014-11-11 Thread Michal Krajňanský
Hm. So I found that one can update stored fields with atomic update operation, however according to http://stackoverflow.com/questions/19058795/it-is-possible-to-update-uniquekey-in-solr-4 this will not work for uniqueKey. So I guess with compositeId router I am out of luck. I have been also

Re: DocSet getting cached in filterCache for facet request with {!cache=false}

2014-11-11 Thread Erick Erickson
Well, the difference that you're faceting with method=enum, which uses the filterCache (I think, it's been a while). I admit I'm a little surprised that when I tried faceting with the inStock field in the standard distro I got 3 entries when there are only two values but I'm willing to let that

Re: Lucene to Solrcloud migration

2014-11-11 Thread Erick Erickson
bq: So I guess with compositeId router I am out of luck. No, not at all. Atomic updates are exactly about updating a doc and NOT changing the id. A different uniqueKey is a different doc by definition. So you can easily use atomic updates with composite IDs since you are changing a field of an

Re: How to Facet external fields

2014-11-11 Thread bbarani
Thanks for your response.. It's indeed a good idea..I will try that out.. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-generate-calculate-facet-counts-for-external-fields-tp4168653p4168790.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Does ReRankQuery support reranking the result of a FuzzyQuery?

2014-11-11 Thread Joel Bernstein
This issue should be resolved in https://issues.apache.org/jira/browse/SOLR-6323. This is committed in trunk, 5x, 4x, and 4_10, but this did not make it into 4.10.2. If you take the version in the 4_10 branch you should be good to go. If a version 4.10.3 is cut, this will be included. Joel

Re: DocSet getting cached in filterCache for facet request with {!cache=false}

2014-11-11 Thread Mohsin Beg Beg
Shawn, then how to skip filterCache for facet.method=enum ? Wiki says fq={!cache=false}*:* is ok, no? https://wiki.apache.org/solr/SolrCaching#filterCache -Mohsin - Original Message - From: erickerick...@gmail.com To: solr-user@lucene.apache.org Sent: Tuesday, November 11, 2014

Re: DocSet getting cached in filterCache for facet request with {!cache=false}

2014-11-11 Thread Yonik Seeley
On Tue, Nov 11, 2014 at 1:25 PM, Mohsin Beg Beg mohsin@oracle.com wrote: Wiki says fq={!cache=false}*:* is ok, no? That's for the filtering... not for the faceting. then how to skip filterCache for facet.method=enum ? Specify a high minDF (the min docfreq or number of documents that need

Re: Does ReRankQuery support reranking the result of a FuzzyQuery?

2014-11-11 Thread Joel Bernstein
Just verified that fuzzy queries work in trunk with this test: params = new ModifiableSolrParams(); params.add(rq, {!rerank reRankQuery=$rqq reRankDocs=6}); params.add(q, term_s:~1 AND test_ti:[0 TO 2000]); params.add(rqq, id:1^10 id:2^20 id:3^30 id:4^40 id:5^50 id:6^60);

SOLRJ Atomic updates of String field

2014-11-11 Thread bbarani
I am using the below code to do partial update (in SOLR 4.2) partialUpdate = new HashMapString, Object(); partialUpdate.put(set,Object); doc.setField(description, partialUpdate); server.add(docs); server.commit(); I am seeing the below description value with {set =...}, Any idea why this is

Re: Analytics result for each Result Group

2014-11-11 Thread Anurag Sharma
Probably sum and division can be applied to get the median. If you are using ver above 5, http://svn.apache.org/repos/asf/lucene/dev//trunk/solr/contrib/analytics/src/java/org/apache/solr/analytics/statistics/MedianStatsCollector.java can be used directly On Tue, Nov 11, 2014 at 5:49 PM, Talat

Re: How to suggest from multiple fields?

2014-11-11 Thread Michael Sokolov
The usual approach is to use copyField to copy multiple fields to a single field. I posted a solution using an UpdateRequestProcessor to merge fields, but with different analyzers, here: https://blog.safaribooksonline.com/2014/04/15/search-suggestions-with-solr-2/ My latest approach is

Different ids for the same document in different replicas.

2014-11-11 Thread S.L
Hi All, I am seeing interesting behavior on the replicas , I have a single shard and 6 replicas and on SolrCloud 4.10.1 . I only have a small number of documents ~375 that are replicated across the six replicas . The interesting thing is that the same document has a different id in each one of

Re: Different ids for the same document in different replicas.

2014-11-11 Thread Garth Grimm
“uuid” isn’t an out of the box field type that I’m familiar with. Generally, I’d stick with the out of the box advice of the schema.xml file, which includes things like…. !-- Only remove the id field if you have a very good reason to. While not strictly required, it is highly

Re: Different ids for the same document in different replicas.

2014-11-11 Thread Garth Grimm
Looking a little deeper, I did find this about UUIDField http://lucene.apache.org/solr/4_9_0/solr-core/org/apache/solr/schema/UUIDField.html NOTE: Configuring a UUIDField instance with a default value of NEW is not advisable for most users when using SolrCloud (and not possible if the UUID

Re: DocSet getting cached in filterCache for facet request with {!cache=false}

2014-11-11 Thread Erick Erickson
The first thing I'd try is to stop explicitly _telling_ solr to use the enum method by omitting the facet.method=enum from your URL ;)... I'm guessing that the field in question has very few unique values, so you probably need to do what Yonik suggests Erick On Tue, Nov 11, 2014 at 1:30 PM,

Re: SOLRJ Atomic updates of String field

2014-11-11 Thread Anurag Sharma
Sorry didn't get what you are trying to achieve and the issue. On Wed, Nov 12, 2014 at 12:20 AM, bbarani bbar...@gmail.com wrote: I am using the below code to do partial update (in SOLR 4.2) partialUpdate = new HashMapString, Object(); partialUpdate.put(set,Object);

Re: SOLRJ Atomic updates of String field

2014-11-11 Thread Ahmet Arslan
Hi Bbarani, Partial update solrJ example can be found in : http://find.searchhub.org/document/5b1187abfcfad33f Ahmet On Tuesday, November 11, 2014 8:51 PM, bbarani bbar...@gmail.com wrote: I am using the below code to do partial update (in SOLR 4.2) partialUpdate = new HashMapString,

DIH Blob data

2014-11-11 Thread Rahul
I am trying to index json data present under blob data type in data base. JSON stored in database as {a:1,b:2,c:3}. I want to Search based on fields later like fq= a:1. The fields a,b,c are dynamic and can be anything based on data posted by users. What is the correct way to index data based on