Re: Solr Cloud 4.2 - Distributed Requests failing with NPE

2013-04-25 Thread Sudhakar Maddineni
Thank you Hoss for looking into it. -Sudhakar. On Thu, Apr 25, 2013 at 6:50 PM, Chris Hostetter wrote: > > : "trace":"java.lang.NullPointerException\r\n\tat > : > org.apache.solr.handler.component.HttpShardHandler.checkDistributed(HttpShardHandler.java:340)\r\n\tat > : > org.apache.solr.handler

Re: Facets with 5000 facet fields - Out of memory error during the query time

2013-04-25 Thread sivaprasad
I got more information with the responses.Now, It's time to re look into the number of facets to be configured. Thanks, Siva http://smarttechies.wordpress.com/ -- View this message in context: http://lucene.472066.n3.nabble.com/Facets-with-5000-facet-fields-Out-of-memory-error-during-the-quer

Re: Question on storage and index/data management in solr

2013-04-25 Thread Vinay Rai
Thank you very much Shawn for a detailed response. Let me read all the documentation you pointed to and digest it. Sure, if I do use using solr and need to make this change, I would love to also submit it to the Lucene/Solr project. Regards, Vinay From: Shawn

Re: How do set compression for compression on stored fields in SOLR 4.2.1

2013-04-25 Thread Chris Hostetter
: Subject: How do set compression for compression on stored fields in SOLR 4.2.1 : : https://issues.apache.org/jira/browse/LUCENE-4226 : It mentions that we can set compression mode: : FAST, HIGH_COMPRESSION, FAST_UNCOMPRESSION. The compression details are hardcoded into the various codecs. If y

Re: Solr Cloud 4.2 - Distributed Requests failing with NPE

2013-04-25 Thread Chris Hostetter
: "trace":"java.lang.NullPointerException\r\n\tat : org.apache.solr.handler.component.HttpShardHandler.checkDistributed(HttpShardHandler.java:340)\r\n\tat : org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:182)\r\n\tat yea, definitely a bug. Raintung repor

Re: Luke misreporting index-time boosts?

2013-04-25 Thread Chris Hostetter
: Looking at output from Luke, however, all fields are reported as having a : boost of 1.0. : : The following possibilities occur to me. : : (1) The entire index-time-boosting approach is misconceived Yes. see Erick's comments about index boosts vs query boosts and why what you are trying to d

RE: Using another way instead of DIH

2013-04-25 Thread xiaoqi
Thanks for help . "data-config.xml" ? i can not find this file , u mean data-import.xml or solrconfig.xml ? -- View this message in context: http://lucene.472066.n3.nabble.com/Using-another-way-instead-of-DIH-tp4058937p4059067.html Sent from the Solr - User mailing list archive at Nabble.co

Re: Facets with OR clause

2013-04-25 Thread Erick Erickson
If you're talking about _filter queries_, Kai's answer is good But your question is confusing. You talk about facet queries, but then use fq, which is "filter query" and has nothing to do with facets at all unless you're talking about turning facet information into filter queries.. FWIW, Eric

Re: Luke misreporting index-time boosts?

2013-04-25 Thread Erick Erickson
I think you're kinda missing the idea of index time boosting. The semantic of this (as I remember Chris Hostetter explaining) is "this document's content is more important than other document's content". By doing an index-time boost that's the same for all your documents, you're effectively doing

Re: Query specific replica

2013-04-25 Thread Erick Erickson
bq: I was wondering wether it is possible to query the same core every request, Not that I know of. You can ping a single node by appending &distrib=false, but that won't then look at multiple shards. If you don't have any shards, this would work I think... Best Erick On Tue, Apr 23, 2013 at 6:3

Re: Too many close, count -1

2013-04-25 Thread Erick Erickson
One outside possibility (and 4.3 should refuse to start if this is the case). Is it possible that more than one of your cores has the same name? FWIW, Erick On Tue, Apr 23, 2013 at 5:30 PM, Chris Hostetter wrote: > > : Subject: Re: Too many close, count -1 > > Thanks for the details, nothing jum

Re: How To Make Index Backup at SolrCloud?

2013-04-25 Thread Otis Gospodnetic
You can use the index backup command that's part of index replication, check the Wiki. Otis Solr & ElasticSearch Support http://sematext.com/ On Apr 25, 2013 5:23 PM, "Furkan KAMACI" wrote: > I use SolrCloud. Let's assume that I want to move all indexes from one > place to another. There maybe t

Re: SolrJ Custom RowMapper

2013-04-25 Thread Sudhakar Maddineni
Hey Luis, Check this example in the source:TestDocumentObjectBinder https://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_4_2_1/solr/solrj/src/test/org/apache/solr/client/solrj/beans/TestDocumentObjectBinder.java Thx,Sudhakar. On Thu, Apr 25, 2013 at 7:56 AM, Luis Lebolo wrote: > Hi Al

Re: Need to log query request before it is processed

2013-04-25 Thread Sudhakar Maddineni
HI Tim, Have you tried by enabling the logging levels on httpclient, which is used by solrj classes internally? Thx,Sudhakar. On Thu, Apr 25, 2013 at 10:12 AM, Timothy Potter wrote: > I would like to log query requests before they are processed. > Currently, it seems they are only logged afte

Re: How To Make Index Backup at SolrCloud?

2013-04-25 Thread Timothy Potter
Hi Furkan, So here's what I do (not saying this is the best method, but definitely works great albeit with a little work on my part) The replication handler (which must be enabled for Solr cloud) supports a backup command, e.g. .../replication?command=backup&location=/mnt/backups >From what I've

Re: Massive Positions Files

2013-04-25 Thread Jack Krupansky
These are the "postings" for all terms - the lists of positions for every occurrence of every term for all documents. Sounds to me like it could be huge. Did you try a back of the envelope calculation? 3.25 GB divided by 180K = 18 K per doc (call it 2K). How many "words" in a document? You sa

Re: filter before facet

2013-04-25 Thread Daniel Tyreus
On Thu, Apr 25, 2013 at 12:35 AM, Toke Eskildsen wrote: > > > > This leads me to believe that the FQ is being applied AFTER the facets > are > > calculated on the whole data set. For my use case it would make a ton of > > sense to apply the FQ first and then facet. Is it possible to specify > thi

How To Make Index Backup at SolrCloud?

2013-04-25 Thread Furkan KAMACI
I use SolrCloud. Let's assume that I want to move all indexes from one place to another. There maybe two reasons for that: First one is that: I will close all my system and I will use new machines with previous indexes (if it is a must they may have same network topology) at anywhere else after so

Re: Reordered DBQ.

2013-04-25 Thread Marcin Rzewucki
OK. Thanks for explanation. On 23 April 2013 23:16, Yonik Seeley wrote: > On Tue, Apr 23, 2013 at 3:51 PM, Marcin Rzewucki > wrote: > > Recently I noticed a lot of "Reordered DBQs detected" messages in logs. > As > > far as I checked in logs it could be related with deleting documents, but > >

Re: Cloudspace and Solr Support Page

2013-04-25 Thread Jan Høydahl
Hi, Just give your WIKI user name and we'll give you access to edit that page to add yourself. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 25. apr. 2013 kl. 21:39 skrev Nina Talley : > Hi there, > > We offer Solr support

Massive Positions Files

2013-04-25 Thread Mike
Hi All, I'm indexing a pretty large collection of documents (about 500K relatively long documents taking up >1TB space, mostly in MS Office formats), and am confused about the file sizes in the index. I've gotten through about 180K documents, and the *.pos files add up to 325GB, while the all of

Cloudspace and Solr Support Page

2013-04-25 Thread Nina Talley
Hi there, We offer Solr support and were wondering how we would go about being added to the Solr Support page ? Thanks so much for your time! -- [image: Cloudspace.com] Nina TalleyAccount ManagerOffice: 877.823.8808 11551 Uni

Atomic update issue with 4.0 and 4.2.1

2013-04-25 Thread David Fennessey
Hi everyone , We have hit this strange bug using the atomic update functionality of both SOLR 4.0 and SOLR 4.2.1. We're currently posting a JSON formatted file to the core's updater using a simple curl method however we've run a very bizarre error where periodically it will fail and return a 4

Re: Solr metrics in Codahale metrics and Graphite?

2013-04-25 Thread Shawn Heisey
On 4/25/2013 6:30 AM, Dmitry Kan wrote: > We are very much interested in 3.4. > > On Thu, Apr 25, 2013 at 12:55 PM, Alan Woodward wrote: >> This is on top of trunk at the moment, but would be back ported to 4.4 if >> there was interest. This will be bad news, I'm sorry: All remaining work on 3.

Re: Question on storage and index/data management in solr

2013-04-25 Thread Shawn Heisey
On 4/25/2013 8:39 AM, Vinay Rai wrote: > 1. Keep each of last 24 hours segments separate. > 2. Segments generated between last 48 to 24 hours to be merged into one. > Similarly, for segments created between 72 to 48 hours and so on for last 1 > week. > 3. Similarly, merge previous 4 week's data i

Re: What is the difference between a Join Query and Embedded Entities in Solr DIH?

2013-04-25 Thread Shawn Heisey
On 4/25/2013 8:17 AM, Gustav wrote: > Are these two methods functionally different? Is there a performance > difference? > > Another though would be that, if using join tables in MySQL, using the SQL > query method with multiple joins could cause multiple documents to be > indexed instead of one.

Re: Problem with solr deployment on weblogic 10.3

2013-04-25 Thread Shawn Heisey
On 4/25/2013 12:04 AM, Shawn Heisey wrote: > It looks like the solution is adding some config to the weblogic.xml > file in the solr.war so that weblogic prefers application classes. I > filed SOLR-4762. I do not know if this change might have unintended > consequences. > > http://ananthkannan.b

Need to log query request before it is processed

2013-04-25 Thread Timothy Potter
I would like to log query requests before they are processed. Currently, it seems they are only logged after being processed. I've tried enabling a finer logging level but that didn't seem to help. I've enabled request logging in Jetty but most queries come in as POSTs from SolrJ I was thinking of

Re: how to get & display Jessionid with solr results

2013-04-25 Thread Michael Della Bitta
You should look into the documentation of your load balancer to see how you can enable sticky sessions. If you've already done that and the load balancer requires jsessionid rather than using it's own sticky session method, it looks like documentation for using jsessionid with Jetty is here: http:/

Re: Deletes and inserts

2013-04-25 Thread Michael Della Bitta
We've successfully reused the same config in Zookeeper across multiple collections and using aliases. Could you describe your problem? What does the error say? Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 w

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Furkan KAMACI
Ok, it works 2013/4/25 Mark Miller > I think it's numShards, not numshards. > > - Mark > > On Apr 25, 2013, at 12:07 PM, Furkan KAMACI > wrote: > > > Hi; > > If you can help it would be nice: > > > > I have erased the data. I use that commands: > > > > Firstly I do that: > > > > java -Xms512M -

Re: How do set compression for compression on stored fields in SOLR 4.2.1

2013-04-25 Thread Otis Gospodnetic
Hi, Is the question how/where to set that? This is what I found in my repo checkout: $ ffxg COMPRE ./core/src/test-files/solr/collection1/conf/solrconfig-slave.xml: COMPRESSION Hm, but that's about replication compression. Maybe we don't have any examples of this in configs? Otis -- Solr & E

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Mark Miller
I think it's numShards, not numshards. - Mark On Apr 25, 2013, at 12:07 PM, Furkan KAMACI wrote: > Hi; > If you can help it would be nice: > > I have erased the data. I use that commands: > > Firstly I do that: > > java -Xms512M -Xmx5120M -Dsolr.solr.home=/home/solr-4.2.1/solr > -Dsolr.data.

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Furkan KAMACI
Ooppss, I wrote numshards, I think it should be numShards 2013/4/25 Michael Della Bitta > Today I learned there's a clear command in the command line util. :) > > Michael Della Bitta > > > Appinions > 18 East 41st Street, 2nd Floor > New York, NY

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Michael Della Bitta
Today I learned there's a clear command in the command line util. :) Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Thu, Apr 25, 2013 at 12:00 PM, Mark Mill

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Furkan KAMACI
Hi; If you can help it would be nice: I have erased the data. I use that commands: Firstly I do that: java -Xms512M -Xmx5120M -Dsolr.solr.home=/home/solr-4.2.1/solr -Dsolr.data.dir=/home/solr-4.2.1/solr/data -Dnumshards=2 -Dbootstrap_confdir=/home/solr-4.2.1/solr/collection1/conf -Dcollection.co

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Mark Miller
Of course deleting the collection and then recreating it should also work - if it doesn't, there is a bug to address. - Mark On Apr 25, 2013, at 12:00 PM, Mark Miller wrote: > What are you doing to clean zk? > > You should be able to simply use the ZkCli clear cmd: > > http://wiki.apache.org

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Mark Miller
What are you doing to clean zk? You should be able to simply use the ZkCli clear cmd: http://wiki.apache.org/solr/SolrCloud#Command_Line_Util Just make sure you stop your Solr instances before clearing it. Clearing out zk from under a running Solr instance is not a good thing to do. This shoul

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Furkan KAMACI
You said: Otherwise, use rmr to delete these files and folders. Can you give an example? 2013/4/25 Otis Gospodnetic > Nice. Sounds like FAQ/Wiki material, Mike! :) > > Otis > -- > Solr & ElasticSearch Support > http://sematext.com/ > > > > > > On Thu, Apr 25, 2013 at 11:33 AM, Michael Della B

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Otis Gospodnetic
Nice. Sounds like FAQ/Wiki material, Mike! :) Otis -- Solr & ElasticSearch Support http://sematext.com/ On Thu, Apr 25, 2013 at 11:33 AM, Michael Della Bitta wrote: > This is what I have done. > > 1. Turn off all your Solr nodes. > > 2. Ssh to one of your zookeeper machines and run Zookeepe

Re: Deletes and inserts

2013-04-25 Thread Jon Strayer
Thanks Michael, How do you handle configurations in zookeeper? I tried reusing the same configuration but I'm getting an error message that may mean that doesn't work. Or maybe I'm doing something wrong. On Wed, Apr 24, 2013 at 12:50 PM, Michael Della Bitta < michael.della.bi...@appinions.com

RE: Using another way instead of DIH

2013-04-25 Thread Dyer, James
If you post your data-config.xml here, someone might be able to find something you could change to speed things up. If the issue is parallelization, then you could possibly partition your data somehow and then run multiple DIH request handlers at the same time. This might be easier than writin

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Michael Della Bitta
This is what I have done. 1. Turn off all your Solr nodes. 2. Ssh to one of your zookeeper machines and run Zookeeper's CLI. On my machine, it's in /usr/lib/zookeeper/bin. 3. If you've chrooted Solr, just rmr /solr_chroot_dir. Otherwise, use rmr to delete these files and folders: clusterstate.

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread Sandeep Mestry
Agree with Jack. The current field type text_general is designed to match the query tokens instead of exact matches - so it's not able to fulfill your requirements. Can you use flat file as spell check dictionary instead and that way you can sear

How to Clean Zookeeper Data for Solr

2013-04-25 Thread Furkan KAMACI
I have a Zookeepeer ensemble with three machines. I have started a cluster with one shard. However I decided to change my shard number. I want to clean Zookeeper data but whatever I do I always get one shard and rest of added Solr nodes are as replica. What should I do?

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread Jack Krupansky
Well then just do an exact match ONLY! It sounds like you haven't worked out the inconsistencies in your requirements. To be clear: We're not offering you "solutions" - that's your job. We're only pointing out tools that you can use. It is up to you to utilize the tools wisely to implement y

Re: Using Solr For a Real Search Engine

2013-04-25 Thread Furkan KAMACI
Thanks, Otis I got it. 2013/4/25 Otis Gospodnetic > Hi, > > No, start.jar is not deployed. That *is* Jetty. > This is what the real Embedded Jetty is about: > http://wiki.eclipse.org/Jetty/Tutorial/Embedding_Jetty > > What we have here is Solr is just an *included* Jetty, so it's easier > to ge

Re: what is the maximum XML file size to import?

2013-04-25 Thread Otis Gospodnetic
Hi, Even if you could import giant files, I'd avoid it because it feels like just asking for trouble. Chunk the file into smaller pieces. You can index such smaller pieces in parallel, too, and end up with faster indexing as the result. Otis -- Solr & ElasticSearch Support http://sematext.com/

Re: Using Solr For a Real Search Engine

2013-04-25 Thread Otis Gospodnetic
Hi, No, start.jar is not deployed. That *is* Jetty. This is what the real Embedded Jetty is about: http://wiki.eclipse.org/Jetty/Tutorial/Embedding_Jetty What we have here is Solr is just an *included* Jetty, so it's easier to get started. That's all. :) Otis -- Solr & ElasticSearch Support ht

Using another way instead of DIH

2013-04-25 Thread xiaoqi
hi,all i using DIH to build index is slow , when it fetch 2 million rows , it will spend 20 minutes , very slow. i am not very familar with solr , try to using lucene direct building index file from db then move to solr folder. i am not sure ,that is right way. or any other good way? thank

SolrJ Custom RowMapper

2013-04-25 Thread Luis Lebolo
Hi All, Does SolrJ have an option for a custom RowMapper or BeanPropertyRowMapper (I'm using Spring/JDBC terms). I know the QueryResponse has a getBeans method, but I would like to create my own mapping and plug it in. Any pointers? Thanks, Luis

RE: What is the difference between a Join Query and Embedded Entities in Solr DIH?

2013-04-25 Thread Dyer, James
Gustav, DIH should give you the same results in both scenarios. The performance trade-offs depend on your data. In your case, it looks like there is a 1-to-1 or many-to-1 relationship between "item" and "member", so use the SQL Join. You'll get all of your data in one query and you'll be usi

Re: What is the difference between a Join Query and Embedded Entities in Solr DIH?

2013-04-25 Thread Alexandre Rafalovitch
I think JOIN is more performant as - by default - DIH will run an inner query for each outer one. You can use cached source, but JOIN will be still more efficient. The nested entities are more useful when the sources are heterogeneous (e.g. DB and XML) or when you need to do custom transformers in

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread vsl
Thanks for your reply but this solution does not fullfil my requirment because other documents (not exact matched) will be returned as well. -- View this message in context: http://lucene.472066.n3.nabble.com/Exact-matching-in-Solr-3-6-1-tp4058865p4058929.html Sent from the Solr - User mailing

Question on storage and index/data management in solr

2013-04-25 Thread Vinay Rai
Hi, I am relatively new to solr and evaluating it for my project. I would have lots of data coming in at a fast rate (say 10 MB per sec) and I would need the recent data (last 24 hours, or last 100GB) to be searchable faster than the old data. I did a bit of reading on the controls provided by

What is the difference between a Join Query and Embedded Entities in Solr DIH?

2013-04-25 Thread Gustav
Hello guys, i saw this thread on stackoverflow, but still not satisfied with the answers. I am trying to index data across multiple tables using Solr's Data Import Handler. The official wiki on the DIH suggests using embedded entities to link multiple tables like so: Are th

Re: [solr 3.4] anomaly during distributed facet query with 102 shards

2013-04-25 Thread Dmitry Kan
Thanks, Yonik. Yes, I supposed that. We are in the pre-release phase, so we have the pressure. Solr 3.4. Would setting up 4.2.1 router work with 3.4 shards? On 25 Apr 2013 17:11, "Yonik Seeley" wrote: > On Thu, Apr 25, 2013 at 8:32 AM, Dmitry Kan wrote: > > Are there any distrib facet gurus on

Re: [solr 3.4] anomaly during distributed facet query with 102 shards

2013-04-25 Thread Yonik Seeley
On Thu, Apr 25, 2013 at 8:32 AM, Dmitry Kan wrote: > Are there any distrib facet gurus on the list? I would be ready to try > sensible ideas, including on the source code level, if someone of you could > give me a hand. The Lucene/Solr Revolution conference is coming up next week, so I think many

Re: Solr maven install - authorization problem when downloading maven.restlet.org dependencies

2013-04-25 Thread Steve Rowe
Hi Shahar, On a Windows 7 box, after downloading solr-4.2.1-src.tgz from one of the Apache mirrors and unpacking it, I did the following from a cmd window: PROMPT> cd solr-4.2.1 PROMPT> ant get-maven-poms PROMPT> cd maven-build PROMPT> mvn install Is the above what you did? After a while, I se

Re: Did something change with Payloads?

2013-04-25 Thread hariistou
Hi Jim, I faced almost the same issue with payloads recently, and thought I would rather write about it. Please see the link below (my blog). I hope it helps. http://hnagtech.wordpress.com/2013/04/19/using-payloads-with-solr-4-x/

Re: Preparing Solr 4.2.1 for IntelliJ fails - invalid sha1

2013-04-25 Thread Steve Rowe
Hi Shahar, I suspect you may have an older version of Ivy installed - the errors you're seeing look like IVY-1194 , which was fixed in Ivy 2.2.0. Lucene/Solr uses Ivy 2.3.0. Take a look in C:\Users\account\.ant\lib\ and remove older versions of

Re: Solr maven install - authorization problem when downloading maven.restlet.org dependencies

2013-04-25 Thread Dmitry Kan
Building the solr 4.2.1 worked fine for me. Here is the relevant portion of ivy-settings.xml that I had to change: Dmitry On Thu, Apr 25, 2013 at 3:53 PM, Shahar Davidson wrote: > Hi, > > I'm trying to build Solr 4.2.x with Maven and I'm ge

FieldCache insanity with field used as facet and group

2013-04-25 Thread Elodie Sannier
Hello, I am using the Lucene FieldCache with SolrCloud and I have "insane" instances with messages like: VALUEMISMATCH: Multiple distinct value objects for SegmentCoreReader(​owner=_11i(​4.2.1):C4493997/853637)+merchantid 'SegmentCoreReader(​owner=_11i(​4.2.1):C4493997/853637)'=>'merchantid',

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread Majirus FANSI
Hi Pawel, If you are searching on any field of type "text_general" as defined in your schema, you are stuck with the porter stemmer. In fact in your setting solr is not aware of a term like "cats", but "cat". Thus no way to do exact match with "cats" in this case. What you can do is creating a new

Solr maven install - authorization problem when downloading maven.restlet.org dependencies

2013-04-25 Thread Shahar Davidson
Hi, I'm trying to build Solr 4.2.x with Maven and I'm getting the following error in solr-core: [INFO] [INFO] BUILD FAILURE [INFO] [INFO] Total time:

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread vsl
Exact matching is just one of my cases. Currently I perform search on field with given definition: This field definition fullfils all other requirmen

Re: [solr 3.4] anomaly during distributed facet query with 102 shards

2013-04-25 Thread Dmitry Kan
Are there any distrib facet gurus on the list? I would be ready to try sensible ideas, including on the source code level, if someone of you could give me a hand. Dmitry On Wed, Apr 24, 2013 at 3:08 PM, Dmitry Kan wrote: > Hello list, > > We deal with an anomaly when doing a distributed facet

Re: Solr metrics in Codahale metrics and Graphite?

2013-04-25 Thread Dmitry Kan
We are very much interested in 3.4. On Thu, Apr 25, 2013 at 12:55 PM, Alan Woodward wrote: > This is on top of trunk at the moment, but would be back ported to 4.4 if > there was interest. > > Alan Woodward > www.flax.co.uk > > > On 25 Apr 2013, at 10:32, Dmitry Kan wrote: > > > Hi Alan, > > >

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread Jack Krupansky
It sounds as if your field type is doing stemming - mapping "cats" to "cat". That is a valuable feature of search, but if you wish to turn it off... go ahead and do so by editing the field type. But just be aware that turning off stemming is a great loss of search flexibility. Who knows, maybe

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread vsl
I will explain my case in the example below: We have three documents with given content: First document: london cats glenvilet Second document london cat glenvilet leeds Third document london cat glenvilet Search term: "cats" AND London NOT Leeds Expected result: First document Current resu

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread Sandeep Mestry
I think in that case, making a field String type is your option, however remember that it'd be case sensitive. Another approach is to create a case insensitive field type and doing searches on those fields only. Can you provide your fie

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread Jack Krupansky
As indicated previously, yes, exact matching is possible in Solr. You, the developer, have full control over the exactness or inexactness of all queries. If any query is inexact in some way, it is solely due to decisions that you, the developer, have made. Generally speaking, inexactness, fuzz

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread vsl
Thanks for your reply. I am using edismax as well. What I want to get is the exact match without other results that could be close to the given term. -- View this message in context: http://lucene.472066.n3.nabble.com/Exact-matching-in-Solr-3-6-1-tp4058865p4058876.html Sent from the Solr - User

Re: Solr faceted search UI

2013-04-25 Thread Majirus FANSI
Hi Rocha, In your webapp I guess you have at list a view and a service layers. The indexing and search modules should preferably be hosted at the service layer. I recommend you read the Api doc ( http://lucene.apache.org/solr/4_2_1/solr-solrj/index.html) to get a sense of what you can do with SolrJ

Preparing Solr 4.2.1 for IntelliJ fails - invalid sha1

2013-04-25 Thread Shahar Davidson
Hi all, I'm trying to run 'ant idea' on 4.2.* and I'm getting "invalid sha1" error messages. (see below) I'll appreciate any help, Shahar === . . . resolve ivy:retrieve :: problems summary :: WARNINGS problem while do

Re: Exact matching in Solr 3.6.1

2013-04-25 Thread Sandeep Mestry
Hi Pawel, Not sure which parser you are using, I am using edismax and tried using the bq parameter to boost the results having exact matches at the top. You may try something like: q="cats" AND London NOT Leeds&bq="cats"^50 In edismax, pf and pf2 parameters also need some tuning to get the result

Re: Solr metrics in Codahale metrics and Graphite?

2013-04-25 Thread Alan Woodward
This is on top of trunk at the moment, but would be back ported to 4.4 if there was interest. Alan Woodward www.flax.co.uk On 25 Apr 2013, at 10:32, Dmitry Kan wrote: > Hi Alan, > > Great! What is the solr version you are patching? > > Speaking of graphite, we have set it up recently to moni

Exact matching in Solr 3.6.1

2013-04-25 Thread vsl
Hi, is it possible to get exact matched result if the search term is combined e.g. "cats" AND London NOT Leeds In the previus threads I have read that it is possible to create new field of String type and perform phrase search on it but nowhere the above mentioned combined search term had been t

Re: Solr metrics in Codahale metrics and Graphite?

2013-04-25 Thread Dmitry Kan
Hi Alan, Great! What is the solr version you are patching? Speaking of graphite, we have set it up recently to monitor our shard farm. So far since the RAM usage has been most important metric we were fine with pidstat command and a little script generating stats for carbon. Having some additiona

Re: Solr 3.6.1: changing a field from stored to not stored

2013-04-25 Thread Majirus FANSI
Good to know I missed something about solr replication. Thanks Jan On 24 April 2013 17:42, Jan Høydahl wrote: > > I would create a new core as slave of the existing configuration without > > replicating the core schema and configuration. This way I can get the > > This won't work, as master/sla

Re: Solr metrics in Codahale metrics and Graphite?

2013-04-25 Thread Alan Woodward
Hi Walter, Dmitry, I opened https://issues.apache.org/jira/browse/SOLR-4735 for this, with some work-in-progress. Have a look! Alan Woodward www.flax.co.uk On 23 Apr 2013, at 07:40, Dmitry Kan wrote: > Hello Walter, > > Have you had a chance to get something working with graphite, codahale

Re: JVM Parameters to Startup Solr?

2013-04-25 Thread Furkan KAMACI
Could you explain that what you mean with such kind of scripts? What it checks and do exactly? 2013/4/25 Toke Eskildsen > On Wed, 2013-04-24 at 18:03 +0200, Mark Miller wrote: > > On Apr 24, 2013, at 12:00 PM, Mark Miller wrote: > > > > >> -XX:OnOutOfMemoryError="kill -9 %p" -XX:+HeapDumpOnOut

Re: solr.StopFilterFactory doesn't work with wildcard

2013-04-25 Thread Dmitry Baranov
1) I use StopFilterFactory in "multiterm" analyzer because without it "query" analizer doesn't work with multi-terms, in particular terms with wildcard. 2) I expect that: search_string_ss_i:(hp* pavilion* series* d4*) search_string_ss_i:(hp* pavilion* series* d4*) search_string_ss_i:hp* +search_str

Re: JVM Parameters to Startup Solr?

2013-04-25 Thread Toke Eskildsen
On Wed, 2013-04-24 at 18:03 +0200, Mark Miller wrote: > On Apr 24, 2013, at 12:00 PM, Mark Miller wrote: > > >> -XX:OnOutOfMemoryError="kill -9 %p" -XX:+HeapDumpOnOutOfMemoryError > > The way I like to handle this is to have the OOM trigger a little script or > set of cmds that logs the issue a

Re: filter before facet

2013-04-25 Thread Toke Eskildsen
On Wed, 2013-04-24 at 23:10 +0200, Daniel Tyreus wrote: > But why is it slow to generate facets on a result set of 0? Furthermore, > why does it take the same amount of time to generate facets on a result set > of 2000 as 100,000 documents? The default faceting method for your query is field cache

Re: Using Solr For a Real Search Engine

2013-04-25 Thread Furkan KAMACI
Hi Otis; You are right. start.jar starts up an Jetty and there is a war file under example directory and deploys start.jar to itself, is that true? 2013/4/25 Otis Gospodnetic > Suggestion : > Don't call this embedded Jetty to avoid confusion with the actual embedded > jetty. > > Otis > Solr & E

Re: Update on shards

2013-04-25 Thread Arkadi Colson
Hi It seems not to work in my case. We are using the solr php module for talking to Solr. Currently we have 2 collections 'intradesk' and 'lvs' for 10 solr hosts (shards: 5 - repl: 2). Because there is no more disc space I created 6 new hosts for collection 'messages' (shards: 3 - repl: 2).