Re: Update on shards

2013-04-23 Thread Arkadi Colson
We are using tomcat so we'll just wait. Hopefully it's fixed in 4.3 but we have a work around for now so... What exactly is the difference between jetty and tomcat. We are using tomcat because we've read somewhere that it should be more robust in heavily loaded production environments. Arkad

Re: Solr index searcher to lucene index searcher

2013-04-23 Thread parnab kumar
Hi , Thanks Chris. I had been using Nutch 1.1 . The Nutch IndexSearcher used to call the lucene IndexSearcher . As the documents are collected in TopDocs in Lucene , before that is passed back to Nutch , i used to look into the top K matching documents , consult some external repository an

Re: Dataimport handler

2013-04-23 Thread William Bell
I also get this. 4.2+ On Fri, Apr 19, 2013 at 10:43 PM, Eric Myers wrote: > I have multiple parallel entities in my document and when I run an import > there are times like > xxx.last_index_time > where xxx is the name of the entity. > > I tried accessing these using dih.xxx.last_index_time but

Fields issue 4.2.1

2013-04-23 Thread William Bell
I am getting no results when using dynamic field, and the name begins with numbers. This is okay on 3.6, but does not work in 4.2. dynamic name: 1234566_user fl=1234566_user If I change it to name: user_1234566 it works. This appears to be a bug. -- Bill Bell billnb...@gmail.com cell 720-25

SOLR 4.3

2013-04-23 Thread William Bell
Can we get this in please to 4.3? https://issues.apache.org/jira/browse/SOLR-4746 -- Bill Bell billnb...@gmail.com cell 720-256-8076

Re: Book text with chapter line number

2013-04-23 Thread Walter Underwood
If you can represent your books in XML, then MarkLogic could do the job very cleanly. It isn't free, but it is very good. wunder On Apr 23, 2013, at 6:47 PM, Jason Funk wrote: > Is there a better tool than Solr to use for my situation? > > > On Apr 23, 2013, at 5:04 PM, Jack Krupansky wrote:

Re: Book text with chapter line number

2013-04-23 Thread Jason Funk
Is there a better tool than Solr to use for my situation? On Apr 23, 2013, at 5:04 PM, Jack Krupansky wrote: > There is no simple, obvious, and direct approach, right out of the box. Sure, > you can highlight passages of raw text, right out of the box, but that won't > give you chapters, page

How to let Solr load libs from within my JAR?

2013-04-23 Thread Xi Shen
Hi, I developed a data import handler, it has some dependent libraries. I deployed them in a parallel folder with my JAR and included the path in solrconfig.xml. It works fine. But I am thinking maybe I can pack those JAR libs within my JAR, but I got NoClassDefFoundError exception when executing

How to get/set customized Solr data source properties?

2013-04-23 Thread Xi Shen
Hi, I am working on a DataSource implementation. I want to get some customized properties when the *DataSource.init* method is called. I tried to add the attributes in the *dataSource* element, but I cannot read it from the *initProps* object. Either I added the attribute in a wrong element, or I

Re: EdgeGram filter

2013-04-23 Thread Jack Krupansky
Yeah, that’s the problem with Lucene/Solr doc. Sigh. Somebody needs to cover this material in a book. Oh yeah... we are - I'll let you know when early access begins. Actually, I just finished writing the chapter with full descriptions and examples for all the filters. -- Jack Krupansky -O

Re: EdgeGram filter

2013-04-23 Thread alxsss
Hi, I did not find any descriptions, except constructor and method names. Thanks. Alex. -Original Message- From: Markus Jelsma To: solr-user Sent: Tue, Apr 23, 2013 12:08 pm Subject: RE: EdgeGram filter Always check the javadocs. There's a lot of info to be found there: http

Re: Export Index and Re-Index XML

2013-04-23 Thread Jack Krupansky
When you export, explicitly list only the fields that you normally specify when adding a document. So, exclude _version_, which Solr will add. -- Jack Krupansky -Original Message- From: Kalyan Kuram Sent: Tuesday, April 23, 2013 8:40 PM To: solr-user@lucene.apache.org Subject: RE: Exp

RE: Export Index and Re-Index XML

2013-04-23 Thread Kalyan Kuram
Thanks for the help,i could successfully export the file as csv and import it into my local box successfully ,now i have a different problem i tried to re-index the content using post.sh anc chaging URL=http://dev-core-solr1:8983/solr/ZinioArticles/update/csv this is now i see this error Befo

RE: Solr 3.6.1: changing a field from stored to not stored

2013-04-23 Thread Petersen, Robert
Hey I just want to verify one thing before I start doing this: function queries only require fields to be indexed but don't require them to be stored right? -Original Message- From: Petersen, Robert [mailto:robert.peter...@mail.rakuten.com] Sent: Tuesday, April 23, 2013 4:39 PM To: sol

RE: Solr 3.6.1: changing a field from stored to not stored

2013-04-23 Thread Petersen, Robert
Good info, Thanks Hoss! I was going to add a more specific fl= parameter to my queries at the same time. Currently I am doing fl=*,score so that will have to be changed. -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Tuesday, April 23, 2013 4:18 PM T

Re: Solr 3.6.1: changing a field from stored to not stored

2013-04-23 Thread Chris Hostetter
: index? I noticed I am unnecessarily storing some fields in my index and : I'd like to stop storing them without having to 'reindex the world' and : let the changes just naturally percolate into my index as updates come : in the normal course of things. Do you guys think I could get away wit

Re: Update on shards

2013-04-23 Thread Furkan KAMACI
Mark, by the way is there an approximate date about when will Solr 4.3 be ready? 2013/4/23 Mark Miller > We have a 3rd release candidate for 4.3 being voted on now. > > I have never tested this feature with Tomcat - only Jetty. Users have > reported it does not work with Tomcat. That leads one t

Re: Does SolrCloud supports QueryElevationComponent?

2013-04-23 Thread Furkan KAMACI
Thanks. It seems that the wiki should be updated at Lucidworks side. 2013/4/24 Mark Miller > No, I'm fairly sure we added support a year or less back. > > - Mark > > On Apr 23, 2013, at 5:56 PM, Furkan KAMACI wrote: > > > When I read Lucidworks' Solr Guide I saw that: > > > > "Distributed searc

Re: Does SolrCloud supports QueryElevationComponent?

2013-04-23 Thread Mark Miller
No, I'm fairly sure we added support a year or less back. - Mark On Apr 23, 2013, at 5:56 PM, Furkan KAMACI wrote: > When I read Lucidworks' Solr Guide I saw that: > > "Distributed searching does not support the QueryElevationComponent, which > configures the > top results for a given query re

Re: Solr 3.6.1: changing a field from stored to not stored

2013-04-23 Thread Koji Sekiguchi
(13/04/24 7:09), Petersen, Robert wrote: Hi guys, What would happen if I changed a field definition on an existing field in an existing index from stored to not stored? Would solr just party on ignoring the fact that this field's data is stored in the current index? I noticed I am unnecessa

Re: Using Lucene Check Index Tool at SolrCloud?

2013-04-23 Thread Shawn Heisey
On 4/23/2013 4:32 PM, Furkan KAMACI wrote: I want to use SolrCloud at my system. I know that there are many automated operations at SolrCloud one of them includes version system of the documents and so checking for consistency. When I read about documentations I saw that there is a tool called ch

Using Lucene Check Index Tool at SolrCloud?

2013-04-23 Thread Furkan KAMACI
I want to use SolrCloud at my system. I know that there are many automated operations at SolrCloud one of them includes version system of the documents and so checking for consistency. When I read about documentations I saw that there is a tool called check index tool for Lucene. Does it meaningfu

Query specific replica

2013-04-23 Thread Manuel Le Normand
Hello, Since i replicated my shards (i have 2 cores per shard now), I get a remarkable decrease in qTime. I assume it happens since my memory has to split between twice more cores than it used to. In my low qps rate use-case, I use replications as shard backup only (in case one of my servers goes

Re: Autocommit and replication have been slowing down

2013-04-23 Thread Shawn Heisey
On 4/23/2013 3:44 PM, gustavonasu wrote: If I understand well the autoWarmCount is the number of elements used from the cache for new searches. I guess that this isn't the problem because after the commit property increases on the "UPDATE HANDLERS" (admin UI) I can see the new docs in the searche

Re: Listing Priority

2013-04-23 Thread Furkan KAMACI
Let's assume that I have written an update processor and extracted the domain and checked it with my predefined list. What should I do at indexing process and select? 2013/4/15 Alexandre Rafalovitch > You may find the work and code contributions by Jan Høydahl quite > relevant. See the presenta

Re: Book text with chapter line number

2013-04-23 Thread Jack Krupansky
There is no simple, obvious, and direct approach, right out of the box. Sure, you can highlight passages of raw text, right out of the box, but that won't give you chapters, pages, and line numbers. To do all of that, you would have to either: 1. Add chapter, page, and line number as part of t

Does SolrCloud supports QueryElevationComponent?

2013-04-23 Thread Furkan KAMACI
When I read Lucidworks' Solr Guide I saw that: "Distributed searching does not support the QueryElevationComponent, which configures the top results for a given query regardless of Lucene's scoring" is that still true for SolrCloud?

Re: minGramSize

2013-04-23 Thread Walter Underwood
Perhaps he needs different analyzer chains for index and query. Create the edge ngrams when indexing, but not when querying. wunder On Apr 23, 2013, at 2:44 PM, Jack Krupansky wrote: > Why are you bothering to use an Edge/NGram filter if you are setting the > minGramSize to the token size?!! I

Re: minGramSize

2013-04-23 Thread Jack Krupansky
Why are you bothering to use an Edge/NGram filter if you are setting the minGramSize to the token size?!! I mean, why bother - just skip the Edge/NGrem filter and it would give the same result - setting minGramSize to the token size means that there would be only a single gram and it would be i

Re: Autocommit and replication have been slowing down

2013-04-23 Thread gustavonasu
Hi Shawn, Thanks for the answer. If I understand well the autoWarmCount is the number of elements used from the cache for new searches. I guess that this isn't the problem because after the commit property increases on the "UPDATE HANDLERS" (admin UI) I can see the new docs in the searches result

Re: Solr index searcher to lucene index searcher

2013-04-23 Thread Chris Hostetter
: > > . For any query it passes through the search handler and solr finally : > > directs it to lucene Index Searcher. As results are matched and collected : > > as TopDocs in lucene i want to inspect the top K Docs , reorder them by : > > some logic and pass the final TopDocs to solr which solr m

Re: Too many close, count -1

2013-04-23 Thread Chris Hostetter
: Subject: Re: Too many close, count -1 Thanks for the details, nothing jumps out at me, but we're now tracking this in SOLR-4753... https://issues.apache.org/jira/browse/SOLR-4753 -Hoss

Re: Reordered DBQ.

2013-04-23 Thread Yonik Seeley
On Tue, Apr 23, 2013 at 3:51 PM, Marcin Rzewucki wrote: > Recently I noticed a lot of "Reordered DBQs detected" messages in logs. As > far as I checked in logs it could be related with deleting documents, but > not sure. Do you know what is the reason of those messages ? For high throughput index

Book text with chapter line number

2013-04-23 Thread Jason Funk
Hello. I'm trying to figure out if Solr is going to work for a new project that I am wanting to build. At it's heart it's a book text searching application. Each book is broken into chapters and each chapter is broken into lines. I want to be able to search these books and return relevant secti

minGramSize

2013-04-23 Thread hassancrowdc
Hi, I want my minGramSize in ngram filter to be the size of the word passed in the query. how can i do that? Because if i put minsize to 2 and write in abc it gives me result for ab and bc i just want abc or what ever the length of my word is, i want it to be the minGram Size. how can i do that?

Re: Using Solr For a Real Search Engine

2013-04-23 Thread Furkan KAMACI
Thanks for the answers. I will go with embedded Jetty for my SolrCloud. If I face with something important I would want to share my experiences with you. 2013/4/23 Shawn Heisey > On 4/23/2013 2:25 PM, Furkan KAMACI wrote: > >> Is there any documentation that explains using Jetty as embedded or n

Re: Using Solr For a Real Search Engine

2013-04-23 Thread Shawn Heisey
On 4/23/2013 2:25 PM, Furkan KAMACI wrote: Is there any documentation that explains using Jetty as embedded or not? I use Solr deployed at Tomcat but after you message I will consider about Jetty. If we think about other issues i.e. when I want to update my Solr jars/wars etc.(this is just an foo

Aw: Re: Support of field variants in solr

2013-04-23 Thread Timo Schmidt
Ok, thanks for this hint i have two further questions to understand it completly. Settingup custom request handler makes it easier to avoid all the mapping parameters in the query but it would also be possible with one request handler and all mapping in the request arguments right? What about

RE: DirectSolrSpellChecker : vastly varying spellcheck QTime times.

2013-04-23 Thread Dyer, James
If you enable debug-level logging for class org.apache.solr.spelling.SpellCheckCollator, you should get a log message for every collation it tries like this: Collation: will return zzz hits. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: SandeepM [mailto

Re: Using Solr For a Real Search Engine

2013-04-23 Thread Furkan KAMACI
According to answers here for a huge crawling system and high response time searching SolrCloud system I will try Jetty. If anyone has a good reason they can explain it here, you are right. By the way, Shawn when I read you answer I understand that I should choose embedded Jetty, is that right? 2

Re: Using Solr For a Real Search Engine

2013-04-23 Thread Shawn Heisey
On 4/23/2013 1:52 PM, Furkan KAMACI wrote: Thanks for the answer. If I find something that explains using embedded Jetty or Jetty, or Tomcat it would be nice. 2013/4/23 Mark Miller Tomcat should work just fine in most cases. The downside to Tomcat is that all of the devs generally run Jetty s

Re: Using Solr For a Real Search Engine

2013-04-23 Thread Furkan KAMACI
Is there any documentation that explains using Jetty as embedded or not? I use Solr deployed at Tomcat but after you message I will consider about Jetty. If we think about other issues i.e. when I want to update my Solr jars/wars etc.(this is just an foo example) does any pros and cons Tomcat or Je

Re: Using Solr For a Real Search Engine

2013-04-23 Thread Timothy Potter
My 2 cents on this is if you have a choice, just stick with Jetty. This article has some pretty convincing information: http://www.openlogic.com/wazi/bid/257366/Power-Java-based-web-apps-with-Jetty-application-server The folks over at OpenLogic definitely know their stuff when it comes to support

Re: Is there a way to load multiple schema when using zookeeper?

2013-04-23 Thread Furkan KAMACI
I will use Nutch with map reduce to crawl huge data and use SolrCloud for many users with high response time. Actually I wonder about performance issues separating Zookeper cluster or using them for both Hbase and Solr. 2013/4/23 Shawn Heisey > On 4/23/2013 1:46 PM, Furkan KAMACI wrote: > >> If

Re: Is there a way to load multiple schema when using zookeeper?

2013-04-23 Thread Shawn Heisey
On 4/23/2013 1:46 PM, Furkan KAMACI wrote: If I have a Zookeper Cluster for my Hbase Cluster already, can I use same Zookeper cluster for my SolrCloud too? Yes, you can. It is strongly recommended that you use a chroot with the zkHost parameter if you are sharing zookeeper. It's a really goo

Re: Is there a way to load multiple schema when using zookeeper?

2013-04-23 Thread Timothy Potter
Yes - better use of existing resources. In this case, the chroot would be helpful to keep Solr znodes separate from HBase. For the most part, Solr in steady-state doesn't put a lot of stress on Zookeeper, for the most part my zk nodes are snoozing. On Tue, Apr 23, 2013 at 1:46 PM, Furkan KAMACI w

RE: DirectSolrSpellChecker : vastly varying spellcheck QTime times.

2013-04-23 Thread SandeepM
James, Is there a way to determine how many times the collations were tried? Is there a parameter that can be issued that can return this in debug information? This would be very helpful. Appreciate your help with this. Thanks. -- Sandeep -- View this message in context: http://lucene.47206

Too many unique terms

2013-04-23 Thread Manuel Le Normand
Hi there, Looking at one of my shards (about 1M docs) i see lot of unique terms, more than 8M which is a significant part of my total term count. These are very likely useless terms, binaries or other meaningless numbers that come with few of my docs. I am totally fine with deleting them so these t

Re: Using Solr For a Real Search Engine

2013-04-23 Thread Furkan KAMACI
Thanks for the answer. If I find something that explains using embedded Jetty or Jetty, or Tomcat it would be nice. 2013/4/23 Mark Miller > Tomcat should work just fine in most cases. The downside to Tomcat is that > all of the devs generally run Jetty since it's the default. Also, all of > our

Reordered DBQ.

2013-04-23 Thread Marcin Rzewucki
Hi, Recently I noticed a lot of "Reordered DBQs detected" messages in logs. As far as I checked in logs it could be related with deleting documents, but not sure. Do you know what is the reason of those messages ? Apr 23, 2013 1:20:14 AM org.apache.solr.search.SolrIndexSearcher INFO: Opening Sea

Re: Solr index searcher to lucene index searcher

2013-04-23 Thread Joel Bernstein
As Timothy mentioned, Solr has the PostFIlter mechanism, but it's not really suited for ranking/sorting changes. To effect the ranking you'd need to work with the TopScoreDocCollector which Solr does not give you access to. If you're doing distributed search you'd need to account for the ranking al

Re: Is there a way to load multiple schema when using zookeeper?

2013-04-23 Thread Furkan KAMACI
If I have a Zookeper Cluster for my Hbase Cluster already, can I use same Zookeper cluster for my SolrCloud too? 2013/4/23 Timothy Potter > Ah cool, thanks for clarifying Chris - some of that multi-config > management stuff gets confusing but much clearer from your > description. > > Cheers, > T

Re: Using Solr For a Real Search Engine

2013-04-23 Thread Mark Miller
Tomcat should work just fine in most cases. The downside to Tomcat is that all of the devs generally run Jetty since it's the default. Also, all of our units tests run against Jetty - in fact, a specific version of Jetty. Usually, Solr will run fine in other webapps. Many, many users run Solr in

Re: Update on shards

2013-04-23 Thread Mark Miller
We have a 3rd release candidate for 4.3 being voted on now. I have never tested this feature with Tomcat - only Jetty. Users have reported it does not work with Tomcat. That leads one to think it may have a problem in other containers as well. A previous contributor donated a patch that explici

Re: spellcheck: change in behavior and QTime

2013-04-23 Thread SandeepM
I apologize for the length of the previous message. I do see a problem with spellcheck becoming faster (notice QTime). I also see an increase in the number of cache hits if spellcheck=false is run one time followed by the original spellcheck query. Seems like spellcheck=false alters the behavior

Re: Is there a way to load multiple schema when using zookeeper?

2013-04-23 Thread Timothy Potter
Ah cool, thanks for clarifying Chris - some of that multi-config management stuff gets confusing but much clearer from your description. Cheers, Tim On Tue, Apr 23, 2013 at 11:36 AM, Chris Hostetter wrote: > > : Yes, you can effectively chroot all the configs for a collection (to > : support mul

Re: Using Solr For a Real Search Engine

2013-04-23 Thread Furkan KAMACI
At first I will work on 100 Solr nodes and I want to use Tomcat as container and deploy Solr as a war. I just wonder what folks are using for large systems and what kind of problems or benefits they have with their choices. 2013/3/26 Otis Gospodnetic > Hi, > > This question is too open-ended fo

Re: SolrEntityProcessor doesn't grok responseHeader tag in Ancient Solr 1.2 source

2013-04-23 Thread P Williams
Thanks Erik. I remember Solr Flare :) On Tue, Apr 23, 2013 at 11:56 AM, Erik Hatcher wrote: > You might be out of luck with the SolrEntityProcessor I'd recommend > writing a simple little script that pages through /select?q=*:* from the > source Solr and write to the destination Solr. Bac

Re: Update on shards

2013-04-23 Thread Furkan KAMACI
Hi Mark; All in all you say that when 4.3 is tagged at repository (I mean when it is ready) this feature will work for Tomcat too at a stable version? 2013/4/23 Mark Miller > > On Apr 23, 2013, at 2:49 PM, Shawn Heisey wrote: > > > What exactly is the 'request proxying' thing that doesn't wor

Re: dataimporthandler does not distribute documents on solr cloud

2013-04-23 Thread Joel Bernstein
Actually, it is Solr 4.1+ where the implicit router will be used if nuShards is not specified. On Tue, Apr 23, 2013 at 2:52 PM, Joel Bernstein wrote: > What version of Solr a re you using? In Solr 4.2+ if you don't specify > numShards when creating the collection, the implicit document router w

Re: Update on shards

2013-04-23 Thread Mark Miller
On Apr 23, 2013, at 2:49 PM, Shawn Heisey wrote: > What exactly is the 'request proxying' thing that doesn't work on tomcat? Is > this something different from basic SolrCloud operation where you send any > kind of request to any server and they get directed where they need to go? I > haven'

Re: What is cluster overseer at SolrCloud?

2013-04-23 Thread Furkan KAMACI
Thanks for the explanation. 2013/4/23 Mark Miller > > On Apr 23, 2013, at 2:53 PM, Furkan KAMACI wrote: > > > When I read about SolrCloud wiki there writes something about "cluster > > overseer". What is the role of that at read and write processes? How can > I > > see which node is overseer at

RE: EdgeGram filter

2013-04-23 Thread Markus Jelsma
Always check the javadocs. There's a lot of info to be found there: http://lucene.apache.org/core/4_0_0-BETA/analyzers-common/org/apache/lucene/analysis/miscellaneous/LimitTokenCountFilterFactory.html -Original message- > From:alx...@aim.com > Sent: Tue 23-Apr-2013 21:06 > To: solr-us

Re: What is cluster overseer at SolrCloud?

2013-04-23 Thread Mark Miller
On Apr 23, 2013, at 2:53 PM, Furkan KAMACI wrote: > When I read about SolrCloud wiki there writes something about "cluster > overseer". What is the role of that at read and write processes? How can I > see which node is overseer at my cluster? The Overseer's main responsibility is to write the

Re: EdgeGram filter

2013-04-23 Thread alxsss
Hi, I was unable to find more info about LimitTokenCountFilterFactory in solr wiki. Is there any other place to get thorough description of what it does? Thanks. Alex. -Original Message- From: Jack Krupansky To: solr-user Sent: Tue, Apr 23, 2013 11:36 am Subject: Re: EdgeGr

What is cluster overseer at SolrCloud?

2013-04-23 Thread Furkan KAMACI
When I read about SolrCloud wiki there writes something about "cluster overseer". What is the role of that at read and write processes? How can I see which node is overseer at my cluster?

Re: dataimporthandler does not distribute documents on solr cloud

2013-04-23 Thread Joel Bernstein
What version of Solr a re you using? In Solr 4.2+ if you don't specify numShards when creating the collection, the implicit document router will be used. DIH running under the implicit document router most likely would not distribute documents. If this is the case you'll need to recreate the colle

Re: Update on shards

2013-04-23 Thread Shawn Heisey
On 4/23/2013 10:14 AM, Mark Miller wrote: If you use jetty - which you should :) It's what we test with. Tomcat only gets user testing. If you use tomcat, this won't work in 4.2 or 4.2.1, but probably will in 4.3 (we are voting on 4.3 now). No clue on other containers. - Mark On Apr 23, 201

Re: Autocommit and replication have been slowing down

2013-04-23 Thread Shawn Heisey
On 4/23/2013 11:27 AM, gustavonasu wrote: We migrated recently from Solr 1.4 to 3.6.1. In the new version we have noticed that after some hours (around 8) the autocommit is taking more time to be executed. In the new version we have noticed that after some hours the autocommit is takin

Re: Update on shards

2013-04-23 Thread Mark Miller
This request proxying only applies to the read side. The write side forwards updates around, it doesn't proxy requests. - Mark On Apr 23, 2013, at 2:33 PM, Furkan KAMACI wrote: > Sorry but I want to make clears the things in my mind. Is there any > documentation that explains Solr proxying? Is

Re: Update on shards

2013-04-23 Thread Jack Krupansky
Yeah, I'm confused now too. Do all Solr nodes in a distributed cloud really have to run in the same container type?? Why isn't it just raw HTTP for one cloud no to talk to another? I mean each node could/should be on another machine, right? -- Jack Krupansky -Original Message- From:

Re: EdgeGram filter

2013-04-23 Thread Jack Krupansky
Well, you could copy to another field (using copyField) and then have an analyzer with a LimitTokenCountFilterFactory that accepts only 1 token, and then apply the EdgeNGramFilter to that one token. But you would have to query explicitly against that other field. Since you are using dismax, you

Re: Update on shards

2013-04-23 Thread Furkan KAMACI
Sorry but I want to make clears the things in my mind. Is there any documentation that explains Solr proxying? Is it same thing with that: when I use SolrCloud and if I send document any of the nodes at my cluster the document will be routed into the leader of appropriate shard. So you mean I can n

Re: Update on shards

2013-04-23 Thread Mark Miller
The request proxying does not work with tomcat without calling an explicit flush in the code - jetty (which the unit tests are written against) worked without this flush. The flush is added to 4.3. - Mark On Apr 23, 2013, at 2:02 PM, Furkan KAMACI wrote: > Oopps, Mark you said: "If you use t

Re: Update on shards

2013-04-23 Thread Furkan KAMACI
Oopps, Mark you said: "If you use tomcat, this won't work in 4.2 or 4.2.1" Can you explain more what won't be at Tomcat and what will change at 4.3? 2013/4/23 Mark Miller > If you use jetty - which you should :) It's what we test with. Tomcat only > gets user testing. > > If you use tomcat, thi

Re: SolrEntityProcessor doesn't grok responseHeader tag in Ancient Solr 1.2 source

2013-04-23 Thread Erik Hatcher
You might be out of luck with the SolrEntityProcessor I'd recommend writing a simple little script that pages through /select?q=*:* from the source Solr and write to the destination Solr. Back in the day there was this fun little beast

SolrEntityProcessor doesn't grok responseHeader tag in Ancient Solr 1.2 source

2013-04-23 Thread P Williams
Hi, I'd like to use the SolrEntityProcessor to partially migrate an old index to Solr 4.1. The source is pretty old (dated 2006-06-10 16:05:12Z)... maybe Solr 1.2? My data-config.xml is based on the SolrEntityProcessor example a

Autocommit and replication have been slowing down

2013-04-23 Thread gustavonasu
Hi, We migrated recently from Solr 1.4 to 3.6.1. In the new version we have noticed that after some hours (around 8) the autocommit is taking more time to be executed. In the new version we have noticed that after some hours the autocommit is taking more time to be executed. We configured

Re: Is there a way to load multiple schema when using zookeeper?

2013-04-23 Thread Chris Hostetter
: Yes, you can effectively chroot all the configs for a collection (to : support multiple collections in same ensemble) - see wiki: : http://wiki.apache.org/solr/SolrCloud#Zookeeper_chroot I don't think chroot is suitable for what's being asked about here ... that would completely isolate two cl

Is there a way to load multiple schema when using zookeeper?

2013-04-23 Thread bbarani
I have used multiple schema files by using multiple cores but not sure if I will be able to use multiple schema configuration when integrating SOLR with zookeeper. Can someone please let me know if its possible and if so, how? -- View this message in context: http://lucene.472066.n3.nabble.com

Re: Is there a way to load multiple schema when using zookeeper?

2013-04-23 Thread Timothy Potter
Yes, you can effectively chroot all the configs for a collection (to support multiple collections in same ensemble) - see wiki: http://wiki.apache.org/solr/SolrCloud#Zookeeper_chroot On Tue, Apr 23, 2013 at 11:23 AM, bbarani wrote: > > I have used multiple schema files by using multiple cores but

Re: Solr index searcher to lucene index searcher

2013-04-23 Thread Timothy Potter
Take a look at Solr's DelegatingCollector - this article might be of interest too: http://hokiesuns.blogspot.com/2012/11/using-solrs-postfiltering-to-collect.html On Tue, Apr 23, 2013 at 10:32 AM, parnab kumar wrote: > Hi , > > Timothy,Thanks for pointing out . But i have a specific req

Re: Solr index searcher to lucene index searcher

2013-04-23 Thread Otis Gospodnetic
Perhaps http://search-lucene.com/?q=custom+hits+collector ? Otis -- Solr & ElasticSearch Support http://sematext.com/ On Tue, Apr 23, 2013 at 12:32 PM, parnab kumar wrote: > Hi , > > Timothy,Thanks for pointing out . But i have a specific requirement > . For any query it passes thro

Re: DocValues with docValuesFormat="Disk"

2013-04-23 Thread Mou
Hi, If you use a codec which is not default, you need to download/build lucene codec jars and put it in solr_home/lib directory and add the codecfactory in the solr config file. Look here for detail instruction http://wiki.apache.org/solr/SimpleTextCodecExample Best, Mou -- View this messa

Re: Solr index searcher to lucene index searcher

2013-04-23 Thread parnab kumar
Hi , Timothy,Thanks for pointing out . But i have a specific requirement . For any query it passes through the search handler and solr finally directs it to lucene Index Searcher. As results are matched and collected as TopDocs in lucene i want to inspect the top K Docs , reorder them by

Re: Update on shards

2013-04-23 Thread Mark Miller
If you use jetty - which you should :) It's what we test with. Tomcat only gets user testing. If you use tomcat, this won't work in 4.2 or 4.2.1, but probably will in 4.3 (we are voting on 4.3 now). No clue on other containers. - Mark On Apr 23, 2013, at 10:59 AM, Michael Della Bitta wrote:

EdgeGram filter

2013-04-23 Thread hassancrowdc
Hi, I want to edgeNgram let's say this document that has 'difficult contents' so that if i query (using disman) q=dif it shows me this result. This is working fine. But now if i search for q=con it gives me this document as well. is there any way to only show this document when i search for 'dif'

Re: Solr index searcher to lucene index searcher

2013-04-23 Thread Timothy Potter
org.apache.solr.search.SolrIndexSearcher On Tue, Apr 23, 2013 at 9:51 AM, parnab kumar wrote: > Hi , > > Can anyone please point out from where a solr search originates > and how it passes to the lucene index searcher and back to solr . I > actually what to know which class in solr

Solr index searcher to lucene index searcher

2013-04-23 Thread parnab kumar
Hi , Can anyone please point out from where a solr search originates and how it passes to the lucene index searcher and back to solr . I actually what to know which class in solr directly calls the lucene Index Searcher . Thanks. Pom

Re: What to test, calculate, measeure for a pre-production version of SolrCloud?

2013-04-23 Thread Timothy Potter
The other thing to keep in the back of your mind as you go through this process is that search is addicting to most organizations. Meaning your Solr solution may quickly become a victim of its own success. The queries we tested before going production 5+ months ago and the queries we handle today a

Re: Update on shards

2013-04-23 Thread Michael Della Bitta
I believe as of 4.2 you can talk to any host in the cloud. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Apr 23, 2013 at 10:45 AM, Arkadi Colson wrot

Re: Querying only for "+" character causes org.apache.lucene.queryParser.ParseException

2013-04-23 Thread Jorge Luis Betancourt Gonzalez
Hi Jérôme: Thanks for your suggestion Jérôme, I'll do as you told me for allowing the search of this specific tokens. I've also taked into account the option of add the "quote if lenght is 1" in the application level, but I would like to keep this logic inside of Solr (if possible), this is why

Update on shards

2013-04-23 Thread Arkadi Colson
Hi Is it correct that when inserting or updating document into solr you have to talk to a solr host where at least one shard of that collection is stored? For select you can talk to any host within the collection.configName? BR, Arkadi

Re: Querying only for "+" character causes org.apache.lucene.queryParser.ParseException

2013-04-23 Thread Jérôme Étévé
If you want to allow your users to search for '+' , you also define your '+' as being a regular ALPHA characters: In config: delimiter_types.txt: # # We let +, # and * be part of normal words. # This is to let c++, c#, c* and R&D as words. # + => ALPHA # => ALPHA * => ALPHA & => ALPHA @ => ALPH

Re: What to test, calculate, measeure for a pre-production version of SolrCloud?

2013-04-23 Thread Jack Krupansky
Another aspect I neglected to mention: Think about distinguishing between "development", "test", and "production" systems - all separately. Your development system is whether you try out ideas and experiment - your proof of concept. Your "test" or "pre-production" system is where you verify that

Re: Querying only for "+" character causes org.apache.lucene.queryParser.ParseException

2013-04-23 Thread Jorge Luis Betancourt Gonzalez
Hi Kai: Thanks for your reply, for what I've understood this logic must be included in my application, It would be possible to, for instance, use some regular expression at querying time in my schema to avoid a query that contains only this characters? for instance + and + would be a good c

Re: Problem with solr, HTTP/Request.php and tomcat.

2013-04-23 Thread Shawn Heisey
On 4/23/2013 7:30 AM, Viviane Ventura wrote: > I'm using solr with tomcat and i need to add a record using > HTTP/Request.php (PEAR). > So, i created a test file with the following code: > > require_once "HTTP/Request.php"; At a quick glance (and not having much experience with PHP) your code lo

Re: What to test, calculate, measeure for a pre-production version of SolrCloud?

2013-04-23 Thread Jack Krupansky
To be clear, there are no solid and reliable prediction rules for Solr - for the simple reason that there are too many non-linear variables - you need to stand up a "proof of concept" system, load it with representative data and execute representative queries and then measure that system. You ca

Re: what is the maximum XML file size to import?

2013-04-23 Thread Alexandre Rafalovitch
DataImportHandler might be a better way to import very large XML files if it can be loaded from Solr-local file system. Regards, Alex. Personal blog: http://blog.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from h

Re: Querying only for "+" character causes org.apache.lucene.queryParser.ParseException

2013-04-23 Thread Kai Becker
Hi, you need to escape that char in search terms. Special chars are + - ! ( ) { } [ ] ^ " ~ * ? : \ / at the moment. The %2B is just the url encoding, but it will still be a + for Solr, so just put a \ in front of the chars I mentioned. Cheers, Kai Am 23.04.2013 um 15:41 schrieb Jorge Luis Bet

  1   2   >