Re: To warm the whole cache of Solr other than the only autowarmcount

2014-07-28 Thread YouPeng Yang
Hi Shawn No affense to your work,I am still confusing about the cache warm processing about your explanation.So I check the warm method of FastLRUCache as [1]. As far as I see,there is no values refresh during the the warm processing. the *regenerator.regenerateItem* just put the old value to

Re: Slow inserts when using Solr Cloud

2014-07-28 Thread Shalin Shekhar Mangar
I'm benchmarking this right now so I'll share some numbers soon. On Mon, Jul 28, 2014 at 12:45 AM, Erick Erickson erickerick...@gmail.com wrote: bq: Whoa! That's awesome! And scary. Ian: Thanks a _lot_ for trying this out and reporting back. Also, let me say that this was a nice

copy EnumField to text field

2014-07-28 Thread Elran Dvir
Hi all, I have an enumField called severity. these are its relevant definitions in schema.xml: field name=severity type=severityType indexed=true stored=true docValues=true default=0/ fieldType name=severityType class=solr.EnumField enumsConfig=enumsConfig.xml enumName=severity/ copyField

Re: copy EnumField to text field

2014-07-28 Thread Alexandre Rafalovitch
On Mon, Jul 28, 2014 at 1:31 PM, Elran Dvir elr...@checkpoint.com wrote: But when no value is sent with severity, and the default of 0 is used, the fix doesn't seem to work. I guess the default in this case is figured out at the query time because there is no empty value as such. So that would

RE: copy EnumField to text field

2014-07-28 Thread Elran Dvir
Are you saying that default values are for query and not for indexing? Thanks. -Original Message- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: Monday, July 28, 2014 9:38 AM To: solr-user Subject: Re: copy EnumField to text field On Mon, Jul 28, 2014 at 1:31 PM, Elran

Re: Auto Suggest

2014-07-28 Thread benjelloun
Hello Erick, So in your opinion what is the solution to use autosuggest with sentece :) an exemple will be very helpfull, Thanks, best regards, Anass BENJELLOUN -- View this message in context: http://lucene.472066.n3.nabble.com/Auto-Suggest-tp4149004p4149441.html Sent from the Solr - User

Re: Passivate core in Solr Cloud

2014-07-28 Thread aurelien . mazoyer
Thank you Erick, Ok, I will probably perform some tests. It seems to be a good candidate for a future blog post... Regards, Aurelien On 27.07.2014 20:20, Erick Erickson wrote: Does not play nice really means it was designed to run in a non-distributed mode. There has been no work done to

solr uima and opencalais

2014-07-28 Thread tomcool
Hi. I'm looking into different possibilities of named-entity extraction features offered by Solr uima. The OpenCalais web service would fit my needs, but I can't get it to work right. First question : is the openCalais annotator up to date ? Right now, I can send a request to the openCalais

Bloom filter

2014-07-28 Thread Per Steffensen
Hi Where can I find documentation on how to use Bloom filters in Solr (4.4). http://wiki.apache.org/solr/BloomIndexComponent seems to be outdated - there is no BloomIndexComponent included in 4.4 code. Regards, Per Steffensen

Re: copy EnumField to text field

2014-07-28 Thread Jack Krupansky
There is a distinction between the original source input value for the indexing process and what value is actually indexed. Query searching will see whatever is actually indexed, not the original source input value. An URP could explicitly set the source input value to the default value if it

Re: Perm Gen issues in SolrCloud

2014-07-28 Thread Poornima Jay
Hi Nitin, Not sure of you have tried these steps. 1. Stop the Tomcat Server. 2.Find catalina.bat 3.Assign following line to JAVA_OPTS variable and add it into catalina.bat file. set JAVA_OPTS=-server -Xms512M -Xmx768M -XX:MaxPermSize=256m  4. restart On Saturday, 1 March 2014 6:02 AM,

RE: copy EnumField to text field

2014-07-28 Thread Elran Dvir
So if I have a document without severity, I can't see severity has its default value (0) in the stage of copy fields (in class DocumentBuilder)? Thanks. -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: Monday, July 28, 2014 2:39 PM To:

Re: copy EnumField to text field

2014-07-28 Thread Jack Krupansky
Correct - copy field copies the raw, original, source input value, before the actual field type has had a chance to process it in any way. -- Jack Krupansky -Original Message- From: Elran Dvir Sent: Monday, July 28, 2014 8:08 AM To: solr-user@lucene.apache.org Subject: RE: copy

RE: copy EnumField to text field

2014-07-28 Thread Elran Dvir
Do you think that I that the change I suggested In DocumentBuilder is right or should we leave it as it? The change: Instead of: // Perhaps trim the length of a copy field Object val = v; The code will be: // Perhaps trim the length of a copy field Object val =

Re: To warm the whole cache of Solr other than the only autowarmcount

2014-07-28 Thread Shawn Heisey
On 7/28/2014 12:06 AM, YouPeng Yang wrote: No affense to your work,I am still confusing about the cache warm processing about your explanation.So I check the warm method of FastLRUCache as [1]. As far as I see,there is no values refresh during the the warm processing. the

Re: Bloom filter

2014-07-28 Thread Shalin Shekhar Mangar
I don't think that issue was ever committed. On Mon, Jul 28, 2014 at 4:47 PM, Per Steffensen st...@designware.dk wrote: Hi Where can I find documentation on how to use Bloom filters in Solr (4.4). http://wiki.apache.org/solr/BloomIndexComponent seems to be outdated - there is no

Re: /solr/admin/ping causing exceptions in log?

2014-07-28 Thread Nathan Neulinger
Thing is - I wouldn't expect any of the default options mentioned to change the behavior intermittently. i.e. it's working for 95% of the health check requests, it's just the intermittent ones that seem to be cut off... I'm inquiring with haproxy devs since it appears that at least one other

Re: Bloom filter

2014-07-28 Thread Lukas Drbal
Hi Per, link to jira - https://issues.apache.org/jira/browse/SOLR-1375 Unresolved ;-) L. On Mon, Jul 28, 2014 at 1:17 PM, Per Steffensen st...@designware.dk wrote: Hi Where can I find documentation on how to use Bloom filters in Solr (4.4). http://wiki.apache.org/solr/BloomIndexComponent

Re: java.lang.OutOfMemoryError: Requested array size exceeds VM limit

2014-07-28 Thread Harald Kirsch
Hi, the stack trace points to tika, which is likely in the process of extracting indexable plain text from some document. Tika's job is one of the dirtiest you can think of in the whole indexing business. We throw all kinds of more or less

Re: Bloom filter

2014-07-28 Thread Per Steffensen
Yes I found that one, along with SOLR-3950. Well at least it seems like the support is there in Lucene. I will figure out myself how to make it work via Solr, the way I need it to work. My use-case is not as specified in SOLR-1375, but the solution might be the same. Any input is of course

Query about vacuum full

2014-07-28 Thread Ameya Aware
Hi, I am seeing considerable decrease in speed of indexing of documents. I am using PostgreSQL. So is this a right time to do vacuum on PostgreSQL because i am using this since a week. Also, to invoke vacuum full do i just need to go to PostgreSQL command prompt and invoke VACUUM FULL

Re: Understanding the Debug explanations for Query Result Scoring/Ranking

2014-07-28 Thread O. Olson
Thank you very much Chris. I was not aware of debug.explain.structured. It seems to be what I was looking for. Thanks also to Jack Krupansky. Yes, delving into those numbers would be my next step, but I will get to that later. O. O. Chris Hostetter-3 wrote Just to be clear, regardless of

Re: Query about vacuum full

2014-07-28 Thread Rafał Kuć
Hello! Please refer to PostgreSQL mailing list with this question. This question is purely about that database and this mailing list is about Solr. -- Regards, Rafał Kuć Performance Monitoring * Log Analytics * Search Analytics Solr Elasticsearch Support * http://sematext.com/ Hi, I am

Re: Query about vacuum full

2014-07-28 Thread Jack Krupansky
Or are you using ManifoldCF? -- Jack Krupansky -Original Message- From: Rafał Kuć Sent: Monday, July 28, 2014 11:00 AM To: solr-user@lucene.apache.org Subject: Re: Query about vacuum full Hello! Please refer to PostgreSQL mailing list with this question. This question is purely

Re: Query about vacuum full

2014-07-28 Thread Ameya Aware
yes.. i intended to post this query there. By mistake, i put it here. Apologizing Ameya On Mon, Jul 28, 2014 at 11:07 AM, Jack Krupansky j...@basetechnology.com wrote: Or are you using ManifoldCF? -- Jack Krupansky -Original Message- From: Rafał Kuć Sent: Monday, July 28, 2014

Re: To warm the whole cache of Solr other than the only autowarmcount

2014-07-28 Thread Erick Erickson
bq: autowarmcount=1024... That's the point, this is quite a high number in my experience. I've rarely seen numbers above 128 show much of any improvement. I've seen a large number of installations use much smaller autowarm numbers, as in the 16-32 range and be quite content. I _really_

Re: how to extract stats component with solrj 4.9.0

2014-07-28 Thread Edith Au
I tried getFieldStatsInfo(). I got a stats of the stats with this method (ie. sum (sum(count)) of all the group'ed results. But it is not what I want. I want a list of stats (ie. sum(count), group by block num). With a debugger, I could see the information I want in this private object

Re: how to extract stats component with solrj 4.9.0

2014-07-28 Thread Shawn Heisey
On 7/28/2014 10:08 AM, Edith Au wrote: I tried getFieldStatsInfo(). I got a stats of the stats with this method (ie. sum (sum(count)) of all the group'ed results. But it is not what I want. I want a list of stats (ie. sum(count), group by block num). With a debugger, I could see the

Re: how to extract stats component with solrj 4.9.0

2014-07-28 Thread Edith Au
Thanks Shawn. I found this method FieldStatsInfo().getFacets() in the Solr 4.9.0 doc. But it seems to me the method is missing in my Solrj 4.9.0 distribution. Could this be a bug? or I have a bad distro? On Mon, Jul 28, 2014 at 9:43 AM, Shawn Heisey s...@elyograg.org wrote: On

Getting Started with Enterprise Search using Apache Solr

2014-07-28 Thread Xavier Morera
Hi. Most of the members here are already seasoned search professionals. However I believe there may also be a few who joined because they want to get started on search and IMHO, probably like you, Solr is the best way to start. Therefore I wanted to post a link to a course that I created on

crawling all links of same domain in nutch in solr

2014-07-28 Thread Vivekanand Ittigi
Hi, Can anyone tel me how to crawl all other pages of same domain. For example i'm feeding a website http://www.techcrunch.com/ in seed.txt. Following property is added in nutch-site.xml property namedb.ignore.internal.links/name valuefalse/value descriptionIf true, when adding new links