Re: Solr Search Across Multiple Cores not working when quering on specific field

2011-12-14 Thread pravesh
>>but when i searched on a specific field than it is not working >>http://localhost:8983/solr/core0/select?shards=localhost:8983/solr/core0,localhost:8983/solr/core1&; >>q=mnemonic_value:"United" >>Why distributed search is not working when i search on a particular field.? Since you have multipl

Re: Delta Replication in SOLR

2011-12-14 Thread Walter Underwood
On Dec 14, 2011, at 9:58 PM, mechravi25 wrote: > We would like know whether it is possible to replicate only a certain > documents from master to slave. More like a Delta Replication process. No, it is not. wunder -- Walter Underwood wun...@wunderwood.org

Re: Too many connections in CLOSE_WAIT state on master solr server

2011-12-14 Thread samarth s
Thanks Erick and Mikhail. I'll try this out. On Wed, Dec 14, 2011 at 7:11 PM, Erick Erickson wrote: > I'm guessing (and it's just a guess) that what's happening is that > the container is queueing up your requests while waiting > for the other connections to close, so Mikhail's suggestion > seems

Re: Migrate Lucene 2.9 To SOLR

2011-12-14 Thread Chris Hostetter
: I have a old project that use Lucene 2.9. Its possible to use the index : created by lucene in SOLR? May i just copy de index to data directory of : SOLR, or exists some mechanism to import Lucene index? you can use an index created directly with lucene libraries in Solr, but in order for Solr

Re: How to get SolrServer within my own servlet

2011-12-14 Thread Joey
Hi Chris, There won't be deadlock I think because there is only one place(from my own servlet) can trigger a index. Yes, I am trying to embed Solr application - I could separate my servlet to another app and talk to Sorl via HTTP, but there will be two pieces(Solr and my own app) of software I h

Re: Possible to adjust FieldNorm?

2011-12-14 Thread cnyee
Sorry, I did not give the full output in the first post. For what it looks, the fieldNorm is saying that: 1 match out of 3 words in record 1 is more significant than 2 matches out of 8 words in record 2. That would be true for simple arithmetic, but unsatisfactory in human 'meaning'. Here are the

Re: Arabic suppport,

2011-12-14 Thread Chris Hostetter
: how can I add arabic support to the solr? https://wiki.apache.org/solr/LanguageAnalysis https://wiki.apache.org/solr/LanguageAnalysis#Arabic -Hoss

Re: How to get SolrServer within my own servlet

2011-12-14 Thread Chris Hostetter
: So what I want to do is to modify Solr a bit - add one servlet so I can : trigger a full index of a folder in the file system. ... : I guess there are two SolrServer instances(one is EmbeddedSolrServer, : created by myself and the other is come with Solr itself and they are : holding di

XPath with ExtractingRequestHandler

2011-12-14 Thread Michael Kelleher
I want to restrict the HTML that is returned by Tika to basically: /xhtml:html/xhtml:body//xhtml:div[@class='bibliographicData']/descendant:node() and it seems that the XPath class being used does not support the '//' syntax. Is there anyway to configure Tika to use a different XPath e

Re: NumericRangeQuery: what am I doing wrong?

2011-12-14 Thread Chris Hostetter
I'm a little lost in this thread ... if you are programaticly construction a NumericRangeQuery object to execute in the JVM against a Solr index, that suggests you are writting some sort of SOlr plugin (or uembedding solr in some way) why manually construct the query using options that may or

Re: queryResultCache hit count is not being increased when programmatically adding Lucene queries as filters in the SearchComponent

2011-12-14 Thread Igor Muntyan
Solr version: 3.2.0 -- View this message in context: http://lucene.472066.n3.nabble.com/queryResultCache-hit-count-is-not-being-increased-when-programmatically-adding-Lucene-queries-as-filt-tp3586892p3586904.html Sent from the Solr - User mailing list archive at Nabble.com.

queryResultCache hit count is not being increased when programmatically adding Lucene queries as filters in the SearchComponent

2011-12-14 Thread Igor Muntyan
In my application I need to deal with a very large number of filter queries that I cannot pass as http parameters - instead I add them as filters on the ResponseBuilder: public void process(ResponseBuilder rb) { List filters = rb.getFilters(); if (filters == null) {

Re: NumericRangeQuery: what am I doing wrong?

2011-12-14 Thread Jay Luker
On Wed, Dec 14, 2011 at 2:04 PM, Erick Erickson wrote: > Hmmm, seems like it should work, but there are two things you might try: > 1> just execute the query in Solr. id:1 TO 100]. Does that work? Yep, that works fine. > 2> I'm really grasping at straws here, but it's *possible* that you >  

Re: Solr Join with Dismax

2011-12-14 Thread Chris Hostetter
: I have been doing more tracing in the code. And I think that I understand a : bit more. The problem does not seem to be dismax+join, but : dismax+join+fromIndex. Correct. join+dismax works fine as i already demonstrated... : >> Note: even with that hardcoded "lucene" bug, you can still overr

Re: Solr Join with Dismax

2011-12-14 Thread Pascal Dimassimo
Hi, I have been doing more tracing in the code. And I think that I understand a bit more. The problem does not seem to be dismax+join, but dismax+join+fromIndex. When doing this joined dismax query on the same index: http://localhost:8080/solr/gutenberg/select?q={!join+from=id+to=id+v=$qq}&qq={!d

Re: CRUD on solr Index while replicating between master/slave

2011-12-14 Thread Otis Gospodnetic
Hi Tarun, Sounds like you should stop reindexing *everything*.  Just index deltas and you'll be in a much better shape. Otis  Performance Monitoring SaaS for Solr - http://sematext.com/spm/solr-performance-monitoring/index.html > > From: Tarun Jain >To:

Re: Possible to adjust FieldNorm?

2011-12-14 Thread Chris Hostetter
: From what I can see, the problem there is not with the field norm, but with : the fact that "leadership" is not matching the second document for some : reason. Is it possible that you are having some kind of analysis problem? Agreed ... if those are your full score explanations for those two d

Re: Copy in multivalued field and faceting

2011-12-14 Thread Ahmet Arslan
> I read the document of "Facet.sort=count" which seems to > return the facets > order by the doc hit counts. > > So, suppose one doc has title "value1 value2 value3", and > another doc has > title "value2 value 4 value 5", and use WhitespaceTokenizer > (no matter > designed in single field or mul

Re: Copy in multivalued field and faceting

2011-12-14 Thread yunfei wu
Hi, Eric, Just interested in this topic, so might want to ask further question based on Jul's topic. I read the document of "Facet.sort=count" which seems to return the facets order by the doc hit counts. So, suppose one doc has title "value1 value2 value3", and another doc has title "value2 val

Re: Shutdown hook issue

2011-12-14 Thread François Schiettecatte
I am not an expert on this but the oom-killer will kill off the process consuming the greatest amount of memory if the machine runs out of memory, and you should see something to that effect in the system log, /var/log/messages I think. François On Dec 14, 2011, at 2:54 PM, Adolfo Castro Menna

Re: Solr Join with Dismax

2011-12-14 Thread Pascal Dimassimo
Thanks Hoss! But unfortunately, the dismax parameters (like qf) are not passed over to the fromIndex. In fact, even if using var dereferencing makes Dismax to be selected as the "fromQueryParser", the query that is passed to the JoinQuery object contains nothing to indicate that it should use dism

Re: Shutdown hook issue

2011-12-14 Thread Adolfo Castro Menna
I think I found the issue. The ubuntu server is running OOM-Killer which might be sending a SIGINT to the java process, probably because of memory consumption. Thanks, Adolfo. On Wed, Dec 14, 2011 at 12:44 PM, Otis Gospodnetic < otis_gospodne...@yahoo.com> wrote: > Hi, > > Solr won't shut down b

Re: Possible to adjust FieldNorm?

2011-12-14 Thread Tomás Fernández Löbbe
>From what I can see, the problem there is not with the field norm, but with the fact that "leadership" is not matching the second document for some reason. Is it possible that you are having some kind of analysis problem? On Wed, Dec 14, 2011 at 6:50 AM, cnyee wrote: > Hi, > > Is it possible to

Re: How to get SolrServer

2011-12-14 Thread Tomás Fernández Löbbe
Hi Joey, if what you want is to customize Solr so that you do the indexing code on the server side, you could implement your own RequestHandler, then the only thing you need to do is to add it to the solrconfig.xml and you can call it through http GET method. On Tue, Dec 13, 2011 at 4:42 PM, Schmi

Re: CRUD on solr Index while replicating between master/slave

2011-12-14 Thread Tarun Jain
Hi, We do optimize the whole index because we index our entire content every 4 hrs.  From an application/business point of view the replication time if acceptable. Thanks for the information though. We will try to change this behaviour in the future so that replication time if reduced. Tarun Ja

Re: NumericRangeQuery: what am I doing wrong?

2011-12-14 Thread Erick Erickson
Hmmm, seems like it should work, but there are two things you might try: 1> just execute the query in Solr. id:1 TO 100]. Does that work? 2> I'm really grasping at straws here, but it's *possible* that you need to use the same precisionstep as tint (8?)? There's a constructor that tak

Re: CRUD on solr Index while replicating between master/slave

2011-12-14 Thread Erick Erickson
Whoa! Replicating takes 15 mins? That's a really long time. Are you including about the polling interval here? Or is this just raw replication time? Because this is really suspicious. Are you optimizing your index all the time or something? Replication should pull down ONLY the changed segments. B

Re: Large RDBMS dataset

2011-12-14 Thread Erick Erickson
You can also consider using SolrJ to do this. I posted a small example a couple of days ago. Best Erick On Wed, Dec 14, 2011 at 10:39 AM, Gora Mohanty wrote: > On Wed, Dec 14, 2011 at 3:48 PM, Finotti Simone wrote: >> Hello, >> I have a very large dataset (> 1 Mrecords) on the RDBMS which I wan

Re: Optimal Setup

2011-12-14 Thread Walter Underwood
You need dev, test, and live on separate boxes so that you can do capacity tests. When you are sending queries to find out the max rate before overload, you need to do that on dev or test, not live. Also, you'll need to test new versions of Solr, so you need separate Solr installations. wunde

Optimal Setup

2011-12-14 Thread Dave Stuart
Background: We have around 100 web sites of various sizes (in terms of indexable content) and I'm trying to come up with the best architectural design from a performance perspective. - Each of the sites has a need for DEV, TEST and LIVE indices. - The content on the sites divided into 5 group

Re: cache monitoring tools?

2011-12-14 Thread Dmitry Kan
Thanks, Justin. With zabbix I can gather jmx exposed stats from SOLR, how about munin, what protocol / way it uses to accumulate stats? It wasn't obvious from their online documentation... On Mon, Dec 12, 2011 at 4:56 PM, Justin Caratzas wrote: > Dmitry, > > The only added stress that munin puts

Re: NumericRangeQuery: what am I doing wrong?

2011-12-14 Thread Dmitry Kan
Maybe you should index your values differently? Here is what Lucene's 2.9 javadoc says: To use this, you must first index the numeric values using NumericField(expert: NumericTokenStream

NumericRangeQuery: what am I doing wrong?

2011-12-14 Thread Jay Luker
I can't get NumericRangeQuery or TermQuery to work on my integer "id" field. I feel like I must be missing something obvious. I have a test index that has only two documents, id:9076628 and id:8003001. The id field is defined like so: A MatchAllDocsQuery will return the 2 documents, but any que

Re: CRUD on solr Index while replicating between master/slave

2011-12-14 Thread Otis Gospodnetic
Hi, The slave will get the changes next time it polls the master and master tells it the index has changed. Note that master doesn't replicate to slave, but rather the slave copies changes from the master. Otis  Performance Monitoring SaaS for Solr - http://sematext.com/spm/solr-performan

Re: Shutdown hook issue

2011-12-14 Thread Otis Gospodnetic
Hi, Solr won't shut down by itself just because it's idle. :) You could run it with debugger attached and breakpoint set in the shutdown hook you are talking about and see what calls it. Otis Performance Monitoring SaaS for Solr - http://sematext.com/spm/solr-performance-monitoring/index.

Re: CRUD on solr Index while replicating between master/slave

2011-12-14 Thread Tarun Jain
Hi, We have an index which needs constant updates in the master. One more question.. The scenario is 1) Master starts replicating to slave (takes approx 15 mins) 2) We do some changes to index on master while it is replicating So question is what happens to the changes in master index while it i

Re: Large RDBMS dataset

2011-12-14 Thread Gora Mohanty
On Wed, Dec 14, 2011 at 3:48 PM, Finotti Simone wrote: > Hello, > I have a very large dataset (> 1 Mrecords) on the RDBMS which I want my Solr > application to pull data from. [...] > It works, but it takes 1'38" to parse 100 records: it means 1 rec/s! That > means that digesting the whole data

Using LocalParams in StatsComponent to create a price slider?

2011-12-14 Thread Mark Schoy
Hi, I'm using the StatsComponent to receive to lower and upper bounds of a price field to create a "price slider". If someone sets the price range to $100-$200 I have to add a filter to the query. But then the lower and upper bound are calculated of the filtered result. Is it possible to use Loca

Re: Solr using very high I/O

2011-12-14 Thread Martin Koch
Do you commit often? If so, try committing less often :) /Martin On Wed, Dec 7, 2011 at 12:16 PM, Adrian Fita wrote: > Hi. I experience an issue where Solr is using huge ammounts of I/O. > Basically it uses the whole HDD continously, leaving nothing to the > other processes. Solr is called by a

Re: Large RDBMS dataset

2011-12-14 Thread Martin Koch
Instead of handling it from within solr, I'd suggest writing an external application (e.g. in python using pysolr) that wraps the (fast) SQL query you like. Then retrieve a batch of documents, and write them to solr. For extra speed, don't commit until you're done. /Martin On Wed, Dec 14, 2011 at

Re: Getting Error while running Query

2011-12-14 Thread Gora Mohanty
On Wed, Dec 14, 2011 at 5:34 PM, Sanket Shah wrote: > Thanks Gora for your reply. >   How can I come to know that alfresco or share is running n SOLR? I meant, > when I login, clicking some folders or creating or uploading new files. How > can I know that it is being done by SOLR and not by the

Re: Use Solr to process/analyze docs without indexing

2011-12-14 Thread tesnick
Thanks iorixxx! I think that's exactly what I was looking. -- View this message in context: http://lucene.472066.n3.nabble.com/Use-Solr-to-process-analyze-docs-without-indexing-tp3585263p3585522.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: edismax phrase matching with a non-word char inbetween

2011-12-14 Thread Erick Erickson
What I think is happening here is that WordDelimiterFilterFactory is throwing away your non-alpha-numeric characters. You can see this in admin/analysis, which I've found *extremely* helpful when faced with this kind of question. Best Erick On Tue, Dec 13, 2011 at 10:37 AM, Robert Brown wrote: >

Re: Copy in multivalued field and faceting

2011-12-14 Thread Erick Erickson
I don't quite understand what you're trying to do. MultiValued is a bit misleading. All it means is that you can add the same field multiple times to a document, i.e. (XML example) value1 value2 value3 value4 value5 value6 will succeed if "field" is multiValued and fail if not. This will wo

Re: Too many connections in CLOSE_WAIT state on master solr server

2011-12-14 Thread Erick Erickson
I'm guessing (and it's just a guess) that what's happening is that the container is queueing up your requests while waiting for the other connections to close, so Mikhail's suggestion seems like a good idea. Best Erick On Wed, Dec 14, 2011 at 12:28 AM, samarth s wrote: > The updates to the maste

Faceting with null dates

2011-12-14 Thread kenneth hansen
hello,I have the following faceting parameters, which gives me some unwanted non-null dates in the result set. Is there a way to query the index to not give me non-null dates in return? I.e. I would like to get a result set which contains only non-nulls on the validToDate, but as I am faceting

Re: Use Solr to process/analyze docs without indexing

2011-12-14 Thread Ahmet Arslan
> I would use Solr to analyze / process documents using > stemming analyzers, > stopwordsfilters, etc. and then return the results instead > of indexing. > There is already some api service out-of-box to do this? It > would be easy to > implement? > > I'm thinking of using a RequestHandler to re

Shutdown hook issue

2011-12-14 Thread Adolfo Castro Menna
Hi All, I'm experiencing some issues with solr. From time to time solr goes down. After checking the logs, I see that it's due to the shutdown hook being triggered. I still don't know why it happens but it seems to be related to solr being idle. Does anyone have any insights? I'm using Ubuntu 10.

Use Solr to process/analyze docs without indexing

2011-12-14 Thread tesnick
Hello, I would use Solr to analyze / process documents using stemming analyzers, stopwordsfilters, etc. and then return the results instead of indexing. There is already some api service out-of-box to do this? It would be easy to implement? I'm thinking of using a RequestHandler to receive the d

RE: Getting Error while running Query

2011-12-14 Thread Sanket Shah
Thanks Gora for your reply. How can I come to know that alfresco or share is running n SOLR? I meant, when I login, clicking some folders or creating or uploading new files. How can I know that it is being done by SOLR and not by the old way before alfresco 4.0. I have put the following thi

Re: Getting Error while running Query

2011-12-14 Thread Gora Mohanty
On Wed, Dec 14, 2011 at 5:00 PM, Sanket Shah wrote: > Hi All, > >   I am sorry If I have sent this email at wrong list. If it is then > kindly let me know! > >   I am using Alfresco 4.0 which is having SOLR for Lucene. I am able to > see the SOLR page and also able to fire queris But they do not r

Getting Error while running Query

2011-12-14 Thread Sanket Shah
Hi All, I am sorry If I have sent this email at wrong list. If it is then kindly let me know! I am using Alfresco 4.0 which is having SOLR for Lucene. I am able to see the SOLR page and also able to fire queris But they do not return any results and sometimes giving errors. I am using SOLR

Solr Search Across Multiple Cores not working when quering on specific field

2011-12-14 Thread ravicv
I have two Solr cores. Core0 and core1 Both cores are having same schema and configuration. after indexing both cores data is retried from both cores individually http://localhost:8983/solr/core0/select?q=fieldName:%22United%22 http://localhost:8983/solr/core1/select?q=fieldName:%22United%22 *S

Large RDBMS dataset

2011-12-14 Thread Finotti Simone
Hello, I have a very large dataset (> 1 Mrecords) on the RDBMS which I want my Solr application to pull data from. Problem is that the document fields which I have to index aren't in the same table, but I have to join records with two other tables. Well, in fact they are views, but I don't thin

Re: Copy in multivalued field and faceting

2011-12-14 Thread yunfei wu
Sounds like working by carefully choosing tokenizer, and then use facet.sort and facet.limit parameters to do faceting. Will see any expert's comments on this one. Yunfei On Wed, Dec 14, 2011 at 12:26 AM, darul wrote: > Hello, > > Field for this scenario is "Title" and contains several words.

Possible to adjust FieldNorm?

2011-12-14 Thread cnyee
Hi, Is it possible to adjust FieldNorm? I have a scenario where the search is not producing the desired result because of fieldNorm: Search terms: coaching leadership Record 1: name="Ask the Coach", desc="...",... Record 2: name="Coaching as a Leadership Development Tool Part 1", desc="...",...

Sorting and searching on a field

2011-12-14 Thread Swapna Vuppala
Hi, I have a field in Solr that I want to be sortable. But at the same time, I want to be able to search on that field without using wild cards. Is that possible ? For example, if I have a field "Subject" with a value "This is my first subject", searching in solr as subject:first should give me

Copy in multivalued field and faceting

2011-12-14 Thread darul
Hello, Field for this scenario is "Title" and contains several words. For a specific query, I would like get the top ten words by frequency in a specific field. My idea was the following: - Title in my schema is stored/indexed in a specific field - A copyField copy Title field content into a mu