Re: Regarding improving performance of the solr

2013-09-10 Thread prabu palanisamy
@Shawn: Correctly I am trying to reduce the index size. I am working on reindex the solr with some of the features as indexed and not stored @Jean: I tried with different caches. It did not show much improvement. On Fri, Sep 6, 2013 at 3:17 PM, Shawn Heisey wrote: > On 9/6/2013 2:54 AM, prabu

Error with Solr 4.4.0, Glassfish, and CentOS 6.2

2013-09-10 Thread vhoangvu
Yesterday, I just install latest version of Solr 4.4.0 on Glassfish and CentOS 6.2 and got an error when try to access the administration page. I have checked this version on Mac OS one month ago, it works well. So, please help me clarify what problem. [#|2013-09-10T18:31:36.433+|INFO|oracle-g

when does one core become 'down'

2013-09-10 Thread YouPeng Yang
Hi all In which situations,one core will become down?And how can I simulate these situations Any suggestions will be appreatiated. Regards

Re: Solr4.4 or zookeeper 3.4.5 do not support too many collections? more than 600?

2013-09-10 Thread diyun2008
Thank you Lance for you experience share. That will be useful to me. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr4-4-or-zookeeper-3-4-5-do-not-support-too-many-collections-more-than-600-tp4088689p4089212.html Sent from the Solr - User mailing list archive at Nabble.c

Re: Tweaking boosts for more search results variety

2013-09-10 Thread Sai Gadde
Perfect. This is exactly what we need! I wish there is an option for plugin (or) if there is some feature like this in mainstream Solr release. Still this is a great resource for us. Thanks Marc for pointing to very useful information. Thanks all for the help. On Tue, Sep 10, 2013 at 5:30 PM

Re: "delta-import" giving Total Documents Processed = 0

2013-09-10 Thread Chris Hostetter
: Subject: "delta-import" giving Total Documents Processed = 0 : I am using solr 4.3.1, during Delta-import, i am always getting "Total : Documents Processed" as 0 enenthough it is getting the changed documents. : And no error in the log. I tried with "dih" instead of "dataimporter", still : same

Re: apache bench solr keep alive not working?

2013-09-10 Thread Chris Hostetter
: Chunked is HTTP/1.1 only, so that could be the problem. --wunder well ... yes and no. the *request* ab sends are specified as http/1.0, and in that request it uses the (non-RFC complieant but generally accepted) http/1.0 style keep-alive headers. jetty parses the http/1.0 request fine, bu

Re: apache bench solr keep alive not working?

2013-09-10 Thread Walter Underwood
Chunked is HTTP/1.1 only, so that could be the problem. --wunder On Sep 10, 2013, at 5:41 PM, Chris Hostetter wrote: > > : I've also used apache bench for a bunch of stuff and keep-alive works fine > : with things like Java netty.io servers ... strange that tomcat isn't > : respecting the htt

Re: apache bench solr keep alive not working?

2013-09-10 Thread Chris Hostetter
: I've also used apache bench for a bunch of stuff and keep-alive works fine : with things like Java netty.io servers ... strange that tomcat isn't : respecting the http protocol or headers as i said, it's probably *very* specific to the combination of trying to use a chunked response (beca

Re: apache bench solr keep alive not working?

2013-09-10 Thread Brent Ryan
thanks guys. I saw this other post with curl and verified it working. I've also used apache bench for a bunch of stuff and keep-alive works fine with things like Java netty.io servers ... strange that tomcat isn't respecting the http protocol or headers There must be a bug in this version o

Re: Unexpected value for boolean field in FunctionQuery

2013-09-10 Thread Chris Hostetter
: When I use your url example : http://localhost:8983/solr/select?q=*:*&rows=100&fl=id,inStock,if%28inStock,10,0%29&debugQuery=true : I have : : 10 : (and my document does not have the inStock field) Elodie, this is definitely a tricy one... I can reproduce the problme you are describing, on 4.

Re: Restrict Parsing duplicate file in Solr

2013-09-10 Thread Chris Hostetter
: Thanks for the response. My requirement is make sure I detect file if its : already indexed , neglect instead of replacing the existing one. A trivial way to make Solr reject (ie: actually generate an error, not just ignore) a document that already exists is to use the Optimistic Concurrency

Re: exceeded limit of maxWarmingSearchers=4

2013-09-10 Thread gfbj
*sleep 1.5 seconds* command per file ...FWIW I found in trying to cfindex 35K documents that if I did a cfdirectory list and added a delay per file indexed (and a cfsetting with a REALLY long timeout), CPU use dropped from 58% to ~19% and I got much farther without the dread maxWarmingSearchers=4 e

apache bench solr keep alive not working?

2013-09-10 Thread Brent Ryan
Does anyone know why solr is not respecting keep-alive requests when using apache bench? ab -v 4 -H "Connection: Keep-Alive" -H "Keep-Alive: 3000" -k -c 10 -n 100 " http://host1:8983/solr/test.solr/select?q=*%3A*&wt=xml&indent=true"; Response contains this and if you look at debug output you see

Re: apache bench solr keep alive not working?

2013-09-10 Thread Chris Hostetter
: Does anyone know why solr is not respecting keep-alive requests when using : apache bench? I've seen this before from people trying to test with "ab", but never fully understood it. There is some combination of using ab (which uses HTTP/1.0 and the non-RFC compliant HTTP/1.0 version of optio

Javascript StatelessScriptUpdateProcessor

2013-09-10 Thread Luís Portela Afonso
It's that possible to execute queries on a javascript script on StatelessScriptUpdateProcessor. I'm processing data with a javascript i want to execute a query to the indexed data of solr. I know that the javascript script, has an instance of SolrQueryRequest and SolrQueryResponse, but neither

Re: SolrCloud 4.x hangs under high update volume

2013-09-10 Thread Tim Vaillancourt
Hey guys, Based on my understanding of the problem we are encountering, I feel we've been able to reduce the likelihood of this issue by making the following changes to our app's usage of SolrCloud: 1) We increased our document batch size to 200 from 10 - our app batches updates to reduce HTTP re

Re: apache bench solr keep alive not working?

2013-09-10 Thread Shawn Heisey
On 9/10/2013 2:37 PM, Brent Ryan wrote: Does anyone know why solr is not respecting keep-alive requests when using apache bench? ab -v 4 -H "Connection: Keep-Alive" -H "Keep-Alive: 3000" -k -c 10 -n 100 " http://host1:8983/solr/test.solr/select?q=*%3A*&wt=xml&indent=true"; Response contains th

Re: Help in resolving the below retrieval issue

2013-09-10 Thread Prathik Puthran
I'm using Solr 3.4. This bug is causing the 2nd term i.e. "kumar" to be treated as an exclusion operator? Is it possible to configure the query parser to not treat the '-' as exclusion operator ? If not the only way is to remove the '-' from the query string? Thanks, Prathik On Tue, Sep 10, 20

Re: Help in resolving the below retrieval issue

2013-09-10 Thread Jack Krupansky
Removing stray hyphens (embedded hyphens, like "CD-ROM", are okay) or escaping them with backslash looks like your best bests. There's no query parser option to disable the hyphen as an exlusion operator, although an upgrade to a "modern" Solr should fix the problem. -- Jack Krupansky -Or

Re: "delta-import" giving Total Documents Processed = 0

2013-09-10 Thread suren
I am aware of this..my actual delta query is like below, to test the issue, i restricted the delta query to one record earlier. deltaQuery ="select distinct clai_idn as clai_idn from claim_history where TO_CHAR(EVENT_DTE , '-MM-D

Re: "delta-import" giving Total Documents Processed = 0

2013-09-10 Thread tamanjit.bin...@yahoo.co.in
Your delta query i.e. deltaQuery ="select distinct clai_idn as clai_idn from claim_history where clai_idn=29"> always gets only one row with a fixed "clai_idn". So here you fetch the same row. What you would want is to get all rows after a p

Re: Solr4.4 or zookeeper 3.4.5 do not support too many collections? more than 600?

2013-09-10 Thread Lance Norskog
Yes, Solr/Lucene works fine with other indexes this large. There are many indexes with hundreds of gigabytes and hundreds of millions of documents. My experience years ago was that at this scale, searching worked great, sorting & facets less so, and the real problem was IT: a 200G blob of data

Re: Solr doesnt return answer when searching numbers

2013-09-10 Thread Jack Krupansky
Is your df parameter also set to PackageName? The fl parameter just sets the fields to return, not which fields are queried. -- Jack Krupansky -Original Message- From: Mysurf Mail Sent: Tuesday, September 10, 2013 9:25 AM To: solr-user@lucene.apache.org Subject: Solr doesnt return ans

Re: using tika inside SOLR vs using nutch

2013-09-10 Thread Furkan KAMACI
If you have tens of millions of documents to parse and do want to do that job inside Solr than it means that you will make a workload on Solr. If there are many queries into your Solr node than you should consider that CPU and RAM may not be enough for you while both parsing and somebody is queryin

using tika inside SOLR vs using nutch

2013-09-10 Thread adfel70
Hi What are the pros and cons of both use cases? 1. use nutch to crawl file system + parse files + perform other data manipulation and eventually index to solr. 2. use solr dataimporthandlers and plugins in order to perform this task. Note that I have tens of millions of docs which I need to ha

Re: Facet values for spacial field

2013-09-10 Thread Upayavira
Note that Solr facets on the *indexed* value, not the stored one. If you want to facet on the points themselves, store them in your index alongside as a 'string' field, and facet on that. But as Erick says, that might give you rather a lot of facets, unless things tend to be grouped by location. U

Re: How to facet data from a multivalued field?

2013-09-10 Thread Erick Erickson
You can't facet on fields where indexed="false". When you look at output docs, you're seeing _stored_ not indexed data. Set indexed="true" and re-index... Best, Erick On Tue, Sep 10, 2013 at 5:51 AM, Rah1x wrote: > Hi buddy, > > I am having this problem that I cant even reach to what you did a

Re: Javascript StatelessScriptUpdateProcessor

2013-09-10 Thread Luís Portela Afonso
Solved On Sep 10, 2013, at 4:55 PM, Luís Portela Afonso wrote: > It's that possible to execute queries on a javascript script on > StatelessScriptUpdateProcessor. > I'm processing data with a javascript i want to execute a query to the > indexed data of solr. > > I know that the javascript scr

"delta-import" giving Total Documents Processed = 0

2013-09-10 Thread suren
I am using solr 4.3.1, during Delta-import, i am always getting "Total Documents Processed" as 0 enenthough it is getting the changed documents. And no error in the log. I tried with "dih" instead of "dataimporter", still same issue. Any suggestions please? --- 0 7 db-data-config.x

Re: Facet values for spacial field

2013-09-10 Thread Erick Erickson
You might be able to facet by query, but faceting by location fields doesn't make a huge amount of sense, you'll have lots of facets on individual lat/lon points. What is the use-case you are trying to support here? Best, Erick On Tue, Sep 10, 2013 at 8:43 AM, Christian Köhler - ZFMK wrote: >

Re: Help in resolving the below retrieval issue

2013-09-10 Thread Jack Krupansky
What release of Solr are you using? It appears that the hyphen is being treated as an exclusion operator even though it is followed by a space. Solr 4.4 doesn't appear to do that, but maybe earlier releases had a problem. In any case, be careful with leading hyphen in queries since it does me

Re: Help in resolving the below retrieval issue

2013-09-10 Thread Prathik Puthran
Thanks Erick for the response. I tried to debug the query. Below is the response in the debug node Rahul - kumarRahul - kumar+text:Rahul -text:kumar+text:Rahul -text:kumarLuceneQParserRahul - kumar+text:rahul -text:kumar Does it mean the query parser has parsed it to tokens "Rahul -" and "kumar"

RE: Need help with delta import

2013-09-10 Thread suren
Any update? I am also having the same issue. pls reply. This XML file does not appear to have any style information associated with it. The document tree is shown below. 0 7 db-data-config.xml delta-import idle 2 1 0 2013-09-10 07:46:34 2013-09-10 07:46:34 2013-09-10 07:46:35 2013-09-10 0

Re: Combining Solr score with customized user ratings for a document

2013-09-10 Thread Amit Jha
You can use DB for storing user preferences and later if you want you can flush them to solr as an update along with userid. Or you may add a result pipeline filter Rgds AJ On 13-Feb-2013, at 17:50, Á_o wrote: > Hi: > > I am working on a proyect where we want to recommend our users pr

Re: Unexpected value for boolean field in FunctionQuery

2013-09-10 Thread Elodie Sannier
By the way Yonik which version do you use (4.4.0 or nightly) ? Elodie On 09/10/2013 04:06 PM, Elodie Sannier wrote: I didn't forget to commit my changes. I used commands: java -Durl=http://localhost:8983/solr/collection1/update -jar post.jar ipod_video.xml curl 'http://localhost:8983/solr/colle

Re: Does configuration change requires Zookeeper restart?

2013-09-10 Thread Shawn Heisey
On 9/10/2013 12:45 AM, Upayavira wrote: > Upload changed config files to zookeeper, using the zookeeper cli, which > I think is in example/cloud-scripts. Then use the collections api, over > http, to reload the collection. If you are on 4.3.1 or earlier, you'll need to restart Solr. There's a bug

Re: Unexpected value for boolean field in FunctionQuery

2013-09-10 Thread Elodie Sannier
I didn't forget to commit my changes. I used commands: java -Durl=http://localhost:8983/solr/collection1/update -jar post.jar ipod_video.xml curl 'http://localhost:8983/solr/collection1/update/?commit=true' When I use your url example http://localhost:8983/solr/select?q=*:*&rows=100&fl=id,inStock

Re: Facet Sort with non ASCII Characters

2013-09-10 Thread Toke Eskildsen
On Mon, 2013-09-09 at 13:16 +0200, Sandro Zbinden wrote: > Is there a plan to add support for alphabetical facet sorting with non > ASCII Characters ? Not to my knowledge. I discussed an idea a year ago about handling it with modified ICUCollatorKeys, but that solution does not work well with the

URLDataSource & PlainTextEntityProcessor not working

2013-09-10 Thread Raheel Hasan
Hi, I am trying to load data (as plaint text) from a URL. For this I am using URLDataSource & PlainTextEntityProcesso. However, I have the following not working. I checked access logs of my web server, the url is not even getting called: http://localhost/update_1/test.txt"

Re: Unexpected value for boolean field in FunctionQuery

2013-09-10 Thread Yonik Seeley
I just tried a simple test with the example data, and things seem to be working fine... I tried this: http://localhost:8983/solr/select ?q=*:* &rows=100 &fl=id, inStock, if(inStock,10,0) I saw values of 10 when inStock==true and values of 0 when it was missing or explicitly false. Perhaps y

Unexpected value for boolean field in FunctionQuery

2013-09-10 Thread Elodie Sannier
Hello, I am using the solr version 4.4.0, when I'm using FunctionQuery with boolean fields, it seems that the default field value is "true" for documents without a value in the field. The page http://wiki.apache.org/solr/FunctionQuery#field says "0 is returned for documents without a value i

Solr doesnt return answer when searching numbers

2013-09-10 Thread Mysurf Mail
I am querying using http://...:8983/solr/vault/select?q="design test"&fl=PackageName I get 3 result: - design test - design test 2013 - design test for jobs Now when I query using q="test for jobs" -> I get only "design test for jobs" But when I query using q = 2013 http://...:8983/s

Re: charfilter doesn't do anything

2013-09-10 Thread Jack Krupansky
Okay, I can repro the problem. Yes, in appears that the pattern replace char filter does not default to multiline mode for pattern matching, so on one line and on another line cannot be matched. Now, whether that is by design or a bug or an option for enhancement is a matter for some committ

Facet values for spacial field

2013-09-10 Thread Christian Köhler - ZFMK
Hi, I use the new SpatialRecursivePrefixTreeFieldType field to store geo coordinates (e.g. 14.021666,51.5433353 ). I can retrieve the coordinates just find so I am sure they are indexed correctly. However when I try to create facets from this field, solr returns something which looks like a hash

Help in resolving the below retrieval issue

2013-09-10 Thread Prathik Puthran
Hi, I am facing the below issue where in Solr is not retrieving the indexed word for some cases. This happens whenever the indexed word has string " - " (quotes for clarity) as substring i.e word prefix followed by a space which is followed by '-' again followed by a space and followed by the res

AW: Facet sort descending

2013-09-10 Thread Sandro Zbinden
Hey Peter To sort these on the client side is no problem. But we have a problem using pivot facet queries. If we set the facet.limit=-1 the load can cause OutOfMemoryExceptions on the server side. Thanks again for the patch. I will keep an eye on it. Sandro -Ursprüngliche Nachricht-

Re: Facet sort descending

2013-09-10 Thread Peter Sturge
Hi Sandro, Ah, ok, this is quite simple then - you should be able to sort these any way you like in your client code since the facet data is all there. On the server-side, you can look at https://issues.apache.org/jira/browse/SOLR-1672 - please note this is an old patch for 1.4, so this won't work

Re: charfilter doesn't do anything

2013-09-10 Thread Andreas Owen
ok i am getting there now but if there are newlines involved the regex stops as soon as it reaches a "\r\n" even if i try [\t\r\n.]* in the regex. I have to get rid of the newlines. why isn't whitespaceTokenizerFactory the right element for this? On 10. Sep 2013, at 1:21 AM, Jack Krupansky wro

Re: faceting from multiValued field

2013-09-10 Thread Raheel Hasan
wow, that did it... fantastic... Thanks a lot :) On Tue, Sep 10, 2013 at 3:56 PM, Rafał Kuć wrote: > Hello! > > Your field needs to be indexed in order for faceting to work. > > -- > Regards, > Rafał Kuć > Sematext :: http://sematext.com/ :: Solr - Lucene - ElasticSearch > > > Hi, > > > I am

Re: faceting from multiValued field

2013-09-10 Thread Rafał Kuć
Hello! Your field needs to be indexed in order for faceting to work. -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - ElasticSearch > Hi, > I am having a problem with multiValued field and Faceting > This is the schema: > required="false" omitTermFreqAndPositions="

faceting from multiValued field

2013-09-10 Thread Raheel Hasan
Hi, I am having a problem with multiValued field and Faceting This is the schema: all I get is: Note: the data is correctly placed in the field as the query results shows. However, the facet is not working. Could anyone tell me how to achieve it? Thanks a lot. -- Regards, Raheel Hasan

AW: Facet sort descending

2013-09-10 Thread Sandro Zbinden
Hi @Peter This is actually the requirement. We have. For both sort options (index, count) we would like to have the possibility to add the desc option. Instead of this result q=*:*&facet=true&facet.field=image_text&facet.sort=index&rows=0 12 23 200/int> We would like to a

Solr Suggester - How do I filter autocomplete results

2013-09-10 Thread Mysurf Mail
I want to filter the auto complete results from my suggester Lets say I have a book table Table (Id Guid, BookName String, BookOwner id) I want each user to get a list to autocomplete from its own books. I want to add something like the http://.../solr/vault/suggest?q=c&fq=BookOwner:3 This doe

Re: Solr suggest - How to define solr suggest as case insensitive

2013-09-10 Thread Mysurf Mail
I have added it and it didnt work. Still returning different result to 1=C and q=c On Tue, Sep 10, 2013 at 1:52 AM, Chris Hostetter wrote: > > : This is probably because your dictionary is made up of all lower case > tokens, > : but when you query the spell-checker similar analysis doesnt happen

Re: How to facet data from a multivalued field?

2013-09-10 Thread Rah1x
Hi buddy, I am having this problem that I cant even reach to what you did at first step.. all I get is: This is the schema: Note: the data is correctly placed in the field as the query results shows. However, the facet is not working. Could you please share the schema of what you di

Re: Tweaking boosts for more search results variety

2013-09-10 Thread Marc Sturlese
This is totally deprecated but maybe can be helpful if you want to re-sort some documents https://issues.apache.org/jira/browse/SOLR-1311 -- View this message in context: http://lucene.472066.n3.nabble.com/Tweaking-boosts-for-more-search-results-variety-tp4088302p4089044.html Sent from the Solr

Re: Facet sort descending

2013-09-10 Thread Peter Sturge
Hi, This question could possibly be about rare idr facet counting - i.e. retrun the facets counts with the least values. I remember doing a patch for this years ago, but then it broke when some UninvertedField facet optimization came in around ~3.5 time. It's a neat idea though to have an option t

Re: Data import

2013-09-10 Thread Luís Portela Afonso
OK, that makes sense, but when solr when run dataimport identifies the new an existing document with the same uniquekey that is being indexed,right? Because when the same document exists on the source, it deletes it and creates a new one. Instead of that, is not possible to discard the new docume

AW: Facet Sort with non ASCII Characters

2013-09-10 Thread Sandro Zbinden
Hey Yonik It installed the latest Solr (Solr 4.4) and started the jetty configured in the example directory. To the core collection1 I added three titles. a, b, ä curl http://localhost:8983/solr/update/json -H 'Content-type:application/json' -d '[{"id" : "1", "title" : "a"},{"id" : "2", "titl

Re: solr suggestion -

2013-09-10 Thread Mysurf Mail
Yes. I understood that from the result. But how do I change that behaviour? "Don't do any analysis on the field you are using for suggestion" Please elaborate. On Mon, Sep 9, 2013 at 8:48 PM, tamanjit.bin...@yahoo.co.in < tamanjit.bin...@yahoo.co.in> wrote: > Don't do any analysis on the field