Re: Solr hangs on distributed updates

2014-12-16 Thread Peter Keegan
> As of 4.10, commits/optimize etc are executed in parallel. Excellent - thanks. On Tue, Dec 16, 2014 at 6:51 AM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > > On Tue, Dec 16, 2014 at 11:34 AM, Peter Keegan > wrote: > > > > > A distributed up

Re: Solr hangs on distributed updates

2014-12-16 Thread Peter Keegan
tPan.534d8d6d.74b0dc51.13a79@airmetal.local%3E On Mon, Dec 15, 2014 at 8:11 PM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > > On Mon, Dec 15, 2014 at 8:41 PM, Peter Keegan > wrote: > > > > If a timeout occurs, does the distributed update then go to the next >

Re: Solr hangs on distributed updates

2014-12-15 Thread Peter Keegan
uests only. The socket and > connection timeout inside the shardHandlerFactory section apply for > inter-shard search requests. > > On Fri, Dec 12, 2014 at 8:38 PM, Peter Keegan > wrote: > > > Btw, are the following timeouts still supported in solr.xml, and do th

Re: Solr hangs on distributed updates

2014-12-15 Thread Peter Keegan
I added distribUpdateConnTimeout and distribUpdateSoTimeout to solr.xml and the commit did timeout.(btw, is there any way to view solr.xml in the admin console?). Also, although we do have an init.d start/stop script for Solr, the 'stop' command was not executed during shutdown because there was n

Re: Solr hangs on distributed updates

2014-12-12 Thread Peter Keegan
The AMIs are Red Hat (not Amazon's) and the instances are properly sized for the environment (t1.micro for ZK, m3.xlarge for Solr). I do plan to add hooks for a clean shutdown of Solr when the VM is shut down, but if Solr takes too long, AWS may clobber it anyway. One frustrating part of auto scali

Re: Solr hangs on distributed updates

2014-12-12 Thread Peter Keegan
#x27;s live node is removed from ZK (after session expiry). I think > most of these issues are because of the lack of timeouts. Just add them and > if there are more problems, we can discuss more. > > On Fri, Dec 12, 2014 at 8:14 PM, Peter Keegan > wrote: > > > No, I w

Re: Solr hangs on distributed updates

2014-12-12 Thread Peter Keegan
Btw, are the following timeouts still supported in solr.xml, and do they only apply to distributed search? ${socketTimeout:0} ${connTimeout:0} Thanks, Peter On Fri, Dec 12, 2014 at 3:14 PM, Peter Keegan wrote: > No, I wasn't aware of these. I will give that a try. If I

Re: Solr hangs on distributed updates

2014-12-12 Thread Peter Keegan
.xml? These are the timeouts used for > inter-shard update requests. > > On Fri, Dec 12, 2014 at 2:20 PM, Peter Keegan > wrote: > > > We are running SolrCloud in AWS and using their auto scaling groups to > spin > > up new Solr replicas when CPU utilization exceeds a t

Solr hangs on distributed updates

2014-12-12 Thread Peter Keegan
We are running SolrCloud in AWS and using their auto scaling groups to spin up new Solr replicas when CPU utilization exceeds a threshold for a period of time. All is well until the replicas are terminated when CPU utilization falls below another threshold. What happens is that index updates sent t

Re: Solr exceptions during batch indexing

2014-11-07 Thread Peter Keegan
failing packet one at a time and report it. > >> > >> I'd really like better reporting back, but we haven't gotten there yet. > >> > >> Best, > >> Erick > >> > >> On Fri, Nov 7, 2014 at 8:25 AM, Peter Keegan > wrote: > >>>

Solr exceptions during batch indexing

2014-11-07 Thread Peter Keegan
How are folks handling Solr exceptions that occur during batch indexing? Solr stops parsing the docs stream when an error occurs (e.g. a doc with a missing mandatory field), and stops indexing the batch. The bad document is not identified, so it would be hard for the client to recover by skipping o

Re: Ideas for debugging poor SolrCloud scalability

2014-10-31 Thread Peter Keegan
Erickson wrote: > Internally, the docs are batched up into smaller buckets (10 as I > remember) and forwarded to the correct shard leader. I suspect that's > what you're seeing. > > Erick > > On Fri, Oct 31, 2014 at 12:20 PM, Peter Keegan > wrote: > > Regarding ba

Re: Ideas for debugging poor SolrCloud scalability

2014-10-31 Thread Peter Keegan
Regarding batch indexing: When I send batches of 1000 docs to a standalone Solr server, the log file reports "(1000 adds)" in LogUpdateProcessor. But when I send them to the leader of a replicated index, the leader log file reports much smaller numbers, usually "(12 adds)". Why do the batches appea

Re: QParserPlugin question

2014-10-24 Thread Peter Keegan
Thanks for the advice. I've moved this query rewriting logic (not really business logic) to a SearchComponent and will leave the custom query parser to deal with the keyword (q=) related aspects of the query. In my case, the latter is mostly dealing with the presence of wildcard characters. Peter

Re: QParserPlugin question

2014-10-22 Thread Peter Keegan
I meant to say: If the keyword is *:* (MachAllDocsQuery)... On Wed, Oct 22, 2014 at 2:17 PM, Peter Keegan wrote: > It's for an optimization. If the keyword is 'match all docs', I want to > remove a custom PostFilter from the query and change the sort parameters > (so th

Re: QParserPlugin question

2014-10-22 Thread Peter Keegan
It's for an optimization. If the keyword is 'match all docs', I want to remove a custom PostFilter from the query and change the sort parameters (so the app doesn't have to do it). It looks like the responseHeader is displaying the 'originalParams', which are immutable. On Wed, Oct 22, 2014 at 2:

QParserPlugin question

2014-10-22 Thread Peter Keegan
I have a custom query parser that modifies the filter query list based on the keyword query. This works, but the 'fq' list in the responseHeader contains the original filter list. The debugQuery output does display the modified filter list. Is there a way to change the responseHeader? I could proba

Re: Does Solr support this?

2014-10-16 Thread Peter Keegan
I'm doing something similar with a custom search component. See SOLR-6502 On Thu, Oct 16, 2014 at 8:14 AM, Upayavira wrote: > Nope, not yet. > > Someone did propose a JavascriptRequestHandler or such, which would > allow you to code such things i

Re: Question about filter cache size

2014-10-03 Thread Peter Keegan
> it will be cached as hidden:true and then inverted Inverted at query time, so for best query performance use fq=hidden:false, right? On Fri, Oct 3, 2014 at 3:57 PM, Yonik Seeley wrote: > On Fri, Oct 3, 2014 at 3:42 PM, Peter Keegan > wrote: > > Say I have a boolean field name

Question about filter cache size

2014-10-03 Thread Peter Keegan
Say I have a boolean field named 'hidden', and less than 1% of the documents in the index have hidden=true. Do both these filter queries use the same docset cache size? : fq=hidden:false fq=!hidden:true Peter

Re: MaxScore

2014-09-17 Thread Peter Keegan
See if SOLR-5831 helps. Peter On Tue, Sep 16, 2014 at 11:32 PM, William Bell wrote: > What we need is a function like scale(field,min,max) but only operates on > the results that come back from the search results. > > scale() takes the min, max

Re: Edismax mm and efficiency

2014-09-10 Thread Peter Keegan
ith two > requests. Would you consider contributing the QueryComponent? > > wunder > Walter Underwood > wun...@wunderwood.org > http://observer.wunderwood.org/ > > > On Sep 10, 2014, at 3:47 AM, Peter Keegan wrote: > > > I implemented a custom QueryComponent th

Re: Edismax mm and efficiency

2014-09-10 Thread Peter Keegan
I implemented a custom QueryComponent that issues the edismax query with mm=100%, and if no results are found, it reissues the query with mm=1. This doubled our query throughput (compared to mm=1 always), as we do some expensive RankQuery processing. For your very long student queries, mm=100% woul

Re: ExternalFileFieldReloader and commit

2014-08-06 Thread Peter Keegan
; Did you try to use > org.apache.solr.search.function.FileFloatSource.ReloadCacheRequestHandler ? > Let's me know if you need help with it. > As a workaround you can reload the core via REST or click a button at > SolrAdmin, your questions are welcome. > > > > On Wed, Aug 6, 2014 at 4:02 PM, Peter

Re: ExternalFileFieldReloader and commit

2014-08-06 Thread Peter Keegan
- > http://soleami.com/blog/comparing-document-classification-functions-of- > lucene-and-mahout.html > > > (2014/08/05 22:34), Peter Keegan wrote: > >> When there are multiple 'external file field' files available, Solr will >> reload the last one (lexico

ExternalFileFieldReloader and commit

2014-08-05 Thread Peter Keegan
When there are multiple 'external file field' files available, Solr will reload the last one (lexicographically) with a commit, but only if changes were made to the index. Otherwise, it skips the reload and logs: "No uncommitted changes. Skipping IW.commit." Has anyone else noticed this? It seems

Re: Question about ReRankQuery

2014-07-23 Thread Peter Keegan
> Search Engineer at Heliosearch > > > On Wed, Jul 23, 2014 at 11:37 AM, Peter Keegan > > wrote: > > > See http://heliosearch.org/solrs-new-re-ranking-feature/ > > > > > > On Wed, Jul 23, 2014 at 11:27 AM, Erick Erickson < > erickerick...@gmail.com

Re: Question about ReRankQuery

2014-07-23 Thread Peter Keegan
n > AND > and using the original sort? > > At the end, you have your original list in it's original order, with > (potentially) some > documents removed that don't satisfy the secondary query. > > Or I'm missing the boat entirely. > > Best, > Erick >

Question about ReRankQuery

2014-07-23 Thread Peter Keegan
I'm looking at how 'ReRankQuery' works. If the main query has a Sort criteria, it is only used to sort the first pass results. The QueryScorer used in the second pass only reorders the ScoreDocs based on score and docid, but doesn't use the original Sort fields. If the Sort criteria is 'score desc,

Re: Question about solrcloud recovery process

2014-07-03 Thread Peter Keegan
Aha, you are right wrdrvf! The query is forwarded to any of the active shards (I saw the query alternate between both of mine). Nice feature. Also, looking at 'ClusterStateAwarePingRequestHandler' (which I downloaded from www.manning.com/SolrinAction), it is checking zookeeper to see if the logical

Re: Question about solrcloud recovery process

2014-07-03 Thread Peter Keegan
d. If this is the issue I think it is, it should only be an > issue when you directly query recovery node. > > The CloudSolrServer client works around this issue as well. > > -- > Mark Miller > about.me/markrmiller > > On July 3, 2014 at 8:42:48 AM, Peter Keegan (peterlkee

Question about solrcloud recovery process

2014-07-03 Thread Peter Keegan
I bring up a new Solr node with no index and watch the index being replicated from the leader. The index size is 12G and the replication takes about 6 minutes, according to the replica log (from 'Starting recovery process' to 'Finished recovery process). However, shortly after the replication begin

Custom QueryComponent to rewrite dismax query

2014-06-10 Thread Peter Keegan
We are using the 'edismax' query parser for its many benefits over the standard Lucene parser. For queries with more than 5 or 6 keywords (which is a lot for our typical user), the recall can be very high (sometimes matching 75% or more of the documents). This high recall, when coupled with some cu

Autoscaling Solr instances in AWS

2014-05-20 Thread Peter Keegan
We are running Solr 4.6.1 in AWS: - 2 Solr instances (1 shard, 1 leader, 1 replica) - 1 CloudSolrServer SolrJ client updating the index. - 3 Zookeepers The Solr instances are behind a load balanceer and also in an auto scaling group. The ScaleUpPolicy will add up to 9 additional instances (replica

Re: Distributed commits in CloudSolrServer

2014-04-16 Thread Peter Keegan
>Are distributed commits also done in parallel across shards? I meant 'sequentially' across shards. On Wed, Apr 16, 2014 at 9:08 AM, Peter Keegan wrote: > Are distributed commits also done in parallel across shards? > > Peter > > > On Tue, Apr 15, 2014 at 3:50 PM,

Re: Distributed commits in CloudSolrServer

2014-04-16 Thread Peter Keegan
Are distributed commits also done in parallel across shards? Peter On Tue, Apr 15, 2014 at 3:50 PM, Mark Miller wrote: > Inline responses below. > -- > Mark Miller > about.me/markrmiller > > On April 15, 2014 at 2:12:31 PM, Peter Keegan (peterlkee...@gmail.com) > wrote: &

Distributed commits in CloudSolrServer

2014-04-15 Thread Peter Keegan
I have a SolrCloud index, 1 shard, with a leader and one replica, and 3 ZKs. The Solr indexes are behind a load balancer. There is one CloudSolrServer client updating the indexes. The index schema includes 3 ExternalFileFields. When the CloudSolrServer client issues a hard commit, I observe that th

Re: Configurable collectors for custom ranking

2014-03-07 Thread Peter Keegan
o watch SOLR-5536 > to see an example of this. > > Joel > > > > > > > > > > > > > Joel Bernstein > Search Engineer at Heliosearch > > > On Mon, Dec 23, 2013 at 4:03 PM, Peter Keegan >wrote: > > > Hi Joel, > > > > Could

Getting index schema in SolrCloud mode

2014-02-03 Thread Peter Keegan
I'm indexing data with a SolrJ client via SolrServer. Currently, I parse the schema returned from a HttpGet on: localhost:8983/solr/collection/schema/fields What is the recommended way to read the schema with CloudSolrServer? Can it be done with a single HttpGet to a ZK server? Thanks, Peter

Re: How to override rollback behavior in DIH

2014-01-17 Thread Peter Keegan
nt Group > (615) 213-4311 > > > -Original Message----- > From: pkeegan01...@gmail.com [mailto:pkeegan01...@gmail.com] On Behalf Of > Peter Keegan > Sent: Friday, January 17, 2014 7:51 AM > To: solr-user@lucene.apache.org > Subject: Re: How to override rollback behavior in D

Re: How to override rollback behavior in DIH

2014-01-17 Thread Peter Keegan
s been some time since I looked at the code so I don't know if this > works with data source. Worth a try I guess. > > On Fri, Jan 17, 2014 at 7:20 PM, Peter Keegan > wrote: > > Following up on this a bit - my main index is updated by a SolrJ client > in > > another

Re: How to override rollback behavior in DIH

2014-01-17 Thread Peter Keegan
it a bit harder to detect the failure via the admin interface. Thanks, Peter On Tue, Jan 14, 2014 at 11:12 AM, Peter Keegan wrote: > I have a custom data import handler that creates an ExternalFileField from > a source that is different from the main index. If the import fails (in m

Re: leading wildcard characters

2014-01-14 Thread Peter Keegan
one is somehow old. Since its description is stale, do you want > to open a new one? > > Ahmet > > > On Friday, January 10, 2014 6:12 PM, Peter Keegan > wrote: > Removing ReversedWildcardFilterFactory had no effect. > > > > On Fri, Jan 10, 2014

How to override rollback behavior in DIH

2014-01-14 Thread Peter Keegan
I have a custom data import handler that creates an ExternalFileField from a source that is different from the main index. If the import fails (in my case, a connection refused in URLDataSource), I don't want to roll back any uncommitted changes to the main index. However, this seems to be the defa

Re: leading wildcard characters

2014-01-10 Thread Peter Keegan
Removing ReversedWildcardFilterFactory had no effect. On Fri, Jan 10, 2014 at 10:48 AM, Ahmet Arslan wrote: > Hi Peter, > > Can you remove any occurrence of ReversedWildcardFilterFactory in > schema.xml? (even if you don't use it) > > Ahmet > > > > On Frid

leading wildcard characters

2014-01-10 Thread Peter Keegan
How do you disable leading wildcards in 4.X? The setAllowLeadingWildcard method is there in the parser, but nothing references the getter. Also, the Edismax parser always enables it and provides no way to override. Thanks, Peter

Re: Zookeeper as Service

2014-01-09 Thread Peter Keegan
There's also: http://www.tanukisoftware.com/ On Thu, Jan 9, 2014 at 11:18 AM, Nazik Huq wrote: > > > From your email I gather your main concern is starting zookeeper on server > startups. > > You may want to look at these non-native service oriented options too: > Create a script( cmd or bat)

Re: Function query matching

2014-01-06 Thread Peter Keegan
: The bottom line for Peter is still the same: using scale() wrapped arround : a function/query does involve a computing hte results for every document, : and that is going to scale linearly as the size of hte index grows -- but : it it is *only* because of the scale function. Another problem with

Re: how to include result ordinal in response

2014-01-04 Thread Peter Keegan
n response > > > On Fri, Jan 3, 2014, at 10:00 PM, Peter Keegan wrote: > >> Is there a simple way to output the result number (ordinal) with each >> returned document using the 'fl' parameter? This would be useful when >> visually comparing the results from 2 q

how to include result ordinal in response

2014-01-03 Thread Peter Keegan
Is there a simple way to output the result number (ordinal) with each returned document using the 'fl' parameter? This would be useful when visually comparing the results from 2 queries. Thanks, Peter

Re: Configurable collectors for custom ranking

2013-12-26 Thread Peter Keegan
t; between a PostFilter and value source. So you may want to watch SOLR-5536 > to see an example of this. > > Joel > > > > > > > > > > > > > Joel Bernstein > Search Engineer at Heliosearch > > > On Mon, Dec 23, 2013 at 4:03 PM, Peter Keegan

Re: Configurable collectors for custom ranking

2013-12-23 Thread Peter Keegan
he value source a > handle called "score" a compound function call will look like this: > sum(score(), field(x)) > > Joel > > > > > > > > > > > On Thu, Dec 12, 2013 at 9:58 AM, Peter Keegan >wrote: > > > Regarding my original goal, whi

Re: Configurable collectors for custom ranking

2013-12-19 Thread Peter Keegan
, Peter On Thu, Dec 19, 2013 at 9:51 AM, Peter Keegan wrote: > In order to size the PriorityQueue, the result window size for the query > is needed. This has been computed in the SolrIndexSearcher and available > in: QueryCommand.getSupersetMaxDoc(), but doesn't seem to be avail

Re: Configurable collectors for custom ranking

2013-12-19 Thread Peter Keegan
ould be in Solr 4.7. > > > > > > > > > > On Thu, Dec 12, 2013 at 12:54 PM, Peter Keegan >wrote: > > > This is pretty cool, and worthy of adding to Solr in Action (v2) and the > > other books. With function queries, flexible filter processing and > c

Re: Configurable collectors for custom ranking

2013-12-12 Thread Peter Keegan
; sum(score(), field(x)) > > Joel > > > > > > > > > > > On Thu, Dec 12, 2013 at 9:58 AM, Peter Keegan >wrote: > > > Regarding my original goal, which is to perform a math function using the > > scaled score and a field value, and sort on t

Re: Configurable collectors for custom ranking

2013-12-12 Thread Peter Keegan
elieve. They might apply to 4.3. >> I think as long you have the finish method that's all you'll need. If you >> can get this working it would be excellent if you could donate back the >> Scale PostFilter. >> >> >> On Wed, Dec 11, 2013 at 3:36 PM, Peter

Re: Configurable collectors for custom ranking

2013-12-11 Thread Peter Keegan
you have the finish method that's all you'll need. If you > can get this working it would be excellent if you could donate back the > Scale PostFilter. > > > On Wed, Dec 11, 2013 at 3:36 PM, Peter Keegan >wrote: > > > This is what I was looking for, but the Delegati

Re: Configurable collectors for custom ranking

2013-12-11 Thread Peter Keegan
ng and > scaling the score from the score map. If the document is not in the score > map then send down 0. > > You'll have setup a dummy scorer to feed to lower collectors. The > CollapsingQParserPlugin has an example of how to do this. > > > > > On Wed, Dec 1

Re: Configurable collectors for custom ranking

2013-12-11 Thread Peter Keegan
ield eff = (ExternalFileField)field.getType(); fieldValues = eff.getFileFloatSource(field, dataDir); And then read the values in 'setNextReader' Peter On Wed, Dec 11, 2013 at 2:05 PM, Peter Keegan wrote: > Hi Joel, > > I thought about using a PostFilter, but the problem is that the 'scal

Re: Configurable collectors for custom ranking

2013-12-11 Thread Peter Keegan
lter rather > then extending the TopDocsCollector. Is there a reason why a PostFilter > won't work for you? > > Joel > > > On Tue, Dec 10, 2013 at 3:24 PM, Peter Keegan >wrote: > > > Quick question: > > In the context of a custom collector, how does one get

Re: Configurable collectors for custom ranking

2013-12-10 Thread Peter Keegan
Quick question: In the context of a custom collector, how does one get the values of a field of type 'ExternalFileField'? Thanks, Peter On Tue, Dec 10, 2013 at 1:18 PM, Peter Keegan wrote: > Hi Joel, > > This is related to another thread on function query matching ( > h

Re: Configurable collectors for custom ranking

2013-12-10 Thread Peter Keegan
ote: > Hi Peter, > > I've been meaning to revisit configurable ranking collectors, but I haven't > yet had a chance. It's on the shortlist of things I'd like to tackle > though. > > > > On Fri, Dec 6, 2013 at 4:17 PM, Peter Keegan > wrote: >

Re: Function query matching

2013-12-07 Thread Peter Keegan
>But for your specific goal Peter: Yes, if the whole point of a function >you have is to wrap generated a "scaled" score of your base $qq, ... Thanks for the confirmation, Chris. So, to do this efficiently, I think I need to implement a custom Collector that performs the scaling (and other mat

Configurable collectors for custom ranking

2013-12-06 Thread Peter Keegan
I looked at SOLR-4465 and SOLR-5045, where it appears that there is a goal to be able to do custom sorting and ranking in a PostFilter. So far, it looks like only custom aggregation can be implemented in PostFilter (5045). Custom sorting/ranking can be done in a pluggable collector (4465), but this

Re: Function query matching

2013-12-06 Thread Peter Keegan
In my previous posting, I said: "Subsequent calls to ScaleFloatFuntion.getValues bypassed 'createScaleInfo and added ~0 time." These subsequent calls are for the remaining segments in the index reader (21 segments). Peter On Fri, Dec 6, 2013 at 2:10 PM, Peter Keegan wr

Re: Function query matching

2013-12-06 Thread Peter Keegan
this possible PostFilter solution, I can see how to collect > > all the hits and recompute the scores in a PostFilter, after all the hits > > have been collected (for scaling). Now, I can't see how to get the custom > > doc/score values back into the main query's H

Re: Function query matching

2013-12-02 Thread Peter Keegan
advice? Thanks, Peter On Fri, Nov 29, 2013 at 9:18 AM, Peter Keegan wrote: > Instead of using a function query, could I use the edismax query (plus > some low cost filters not shown in the example) and implement the > scale/sum/product computation in a PostFilter? Is the query'

Re: Function query matching

2013-11-29 Thread Peter Keegan
Instead of using a function query, could I use the edismax query (plus some low cost filters not shown in the example) and implement the scale/sum/product computation in a PostFilter? Is the query's maxScore available there? Thanks, Peter On Wed, Nov 27, 2013 at 1:58 PM, Peter Keegan

Re: Function query matching

2013-11-27 Thread Peter Keegan
Although the 'scale' is a big part of it, here's a closer breakdown. Here are 4 queries with increasing functions, and theei response times (caching turned off in solrconfig): 100 msec: select?q={!edismax v='news' qf='title^2 body'} 135 msec: select?qq={!edismax v='news' qf='title^2 body'}q={!fun

Re: Function query matching

2013-11-27 Thread Peter Keegan
field(myfield)))&fq={!query v=$qq} Is there any way to speed this up? Would writing a custom function query that compiled all the function queries together be any faster? Thanks, Peter On Mon, Nov 11, 2013 at 1:31 PM, Peter Keegan wrote: > Thanks > > > On Mon, Nov 11, 2013 at

Re: Function query matching

2013-11-11 Thread Peter Keegan
Thanks On Mon, Nov 11, 2013 at 11:46 AM, Yonik Seeley wrote: > On Mon, Nov 11, 2013 at 11:39 AM, Peter Keegan > wrote: > > fq=$qq > > > > What is the proper syntax? > > fq={!query v=$qq} > > -Yonik > http://heliosearch.com -- making solr shine >

Re: Function query matching

2013-11-11 Thread Peter Keegan
func}sum(product(0.75,$scaledQ),product(0.25,field(myfield)))&fq=$qq What is the proper syntax? Thanks, Peter On Thu, Nov 7, 2013 at 2:16 PM, Peter Keegan wrote: > I'm trying to used a normalized score in a query as I described in a > recent thread titled "Re: How to get

Re: Function query matching

2013-11-07 Thread Peter Keegan
7, 2013, at 7:26 AM, Erik Hatcher wrote: > > > Function queries score (all) documents, but don't filter them. All > documents effectively match a function query. > > > > Erik > > > > On Nov 7, 2013, at 1:48 PM, Peter Keegan wrote: > > > >> Why does this function query return docs that don't match the embedded > >> query? > >> select?qq=text:news&q={!func}sum(query($qq),0) > > > >

Function query matching

2013-11-07 Thread Peter Keegan
Why does this function query return docs that don't match the embedded query? select?qq=text:news&q={!func}sum(query($qq),0)

Re: Data Import Handler

2013-11-06 Thread Peter Keegan
I've done this by adding an attribute to the entity element (e.g. myconfig="myconfig.xml"), and reading it in the 'init' method with context.getResolvedEntityAttribute("myconfig"). Peter On Wed, Nov 6, 2013 at 8:25 AM, Ramesh wrote: > Hi Folks, > > > > Can anyone suggest me how can customize d

Re: How to get similarity score between 0 and 1 not relative score

2013-11-01 Thread Peter Keegan
There's another use case for scaling the score. Suppose I want to compute a custom score based on the weighted sum of: - product(0.75, relevance score) - product(0.25, value from another field) For this to work, both fields must have values between 0-1, for example. Toby's example using the scale

How to reinitialize a solrcloud replica

2013-10-25 Thread Peter Keegan
I'm running 4.3 in solrcloud mode and trying to test index recovery, but it's failing. I have one shard, 2 replicas: Leader: 10.159.8.105 Replica: 10.159.6.73 To test, I stopped the replica, deleted the 'data' directory and restarted solr. Here is the replica's logging: INFO - 2013-10-25 12:19:4

Re: Solr timeout after reboot

2013-10-21 Thread Peter Keegan
I found this warming to be especially necessary after starting an instance of those m3.xlarge servers, else the response times for the first minutes was terrible. Peter On Mon, Oct 21, 2013 at 8:39 AM, François Schiettecatte < fschietteca...@gmail.com> wrote: > To put the file data into file sy

Re: Solr timeout after reboot

2013-10-21 Thread Peter Keegan
Have you tried this old trick to warm the FS cache? cat ...//data/index/* >/dev/null Peter On Mon, Oct 21, 2013 at 5:31 AM, michael.boom wrote: > Thank you, Otis! > > I've integrated the SPM on my Solr instances and now I have access to > monitoring data. > Could you give me some hints on whic

Re: limiting deep pagination

2013-10-17 Thread Peter Keegan
arihinde Michael Sokolov < > msoko...@safaribooksonline.com> adlı kullanıcı şöyle yazdı: > > On 10/8/13 6:51 PM, Peter Keegan wrote: > >> > >> Is there a way to configure Solr 'defaults/appends/invariants' such that > >> the product of the 'st

limiting deep pagination

2013-10-08 Thread Peter Keegan
Is there a way to configure Solr 'defaults/appends/invariants' such that the product of the 'start' and 'rows' parameters doesn't exceed a given value? This would be to prevent deep pagination. Or would this require a custom requestHandler? Peter

Re: How to get values of external file field(s) in Solr query?

2013-10-03 Thread Peter Keegan
In 4.3, frange query using an external file works for both q and fq. The Solr wiki and SIA both state that ExternalFileField does not support searching. Was the search/filter capability added recently, or is it not supported? Thanks, Peter On Wed, Jun 26, 2013 at 4:59 PM, Upayavira wrote: > T

Re: Cross index join query performance

2013-09-30 Thread Peter Keegan
ades as > you are seeing. > > Is there a way you can reduce the number of values returned by the inner > query? > > As Joel mentions, those other joins are attempts to find other ways to > work with this limitation. > > Upayavira > > On Fri, Sep 27, 2013, at 09:44 PM

Re: Cross index join query performance

2013-09-27 Thread Peter Keegan
in the fromIndex the > standard join will be faster. > > > On Wed, Sep 25, 2013 at 3:39 PM, Peter Keegan >wrote: > > > I forgot to mention - this is Solr 4.3 > > > > Peter > > > > > > > > On Wed, Sep 25, 2013 at 3:38 PM, Peter Keegan >

Re: Cross index join query performance

2013-09-25 Thread Peter Keegan
I forgot to mention - this is Solr 4.3 Peter On Wed, Sep 25, 2013 at 3:38 PM, Peter Keegan wrote: > I'm doing a cross-core join query and the join query is 30X slower than > each of the 2 individual queries. Here are the queries: > > Main query: http://localhost:8983/solr/m

Cross index join query performance

2013-09-25 Thread Peter Keegan
I'm doing a cross-core join query and the join query is 30X slower than each of the 2 individual queries. Here are the queries: Main query: http://localhost:8983/solr/mainindex/select?q=title:java QTime: 5 msec hit count: 1000 Sub query: http://localhost:8983/solr/subindex/select?q=+fld1:[0.1 TO

Re: A question about attaching shards to load balancers

2013-01-30 Thread Peter Keegan
Aren't you concerned about having a single point of failure with this setup? On Wed, Jan 30, 2013 at 10:38 AM, Michael Ryan wrote: > From a performance point of view, I can't imagine it mattering. In our > setup, we have a dedicated Solr server that is not a shard that takes > incoming requests

Re: Improving performance for use-case where large (200) number of phrase queries are used?

2012-10-25 Thread Peter Keegan
ing the same thing? > > On Wed, Oct 24, 2012 at 1:20 PM, Peter Keegan > wrote: > > Could you index your 'phrase tags' as single tokens? Then your phrase > > queries become simple TermQuerys. > > >> > >> 5) *This is my current favorite*: stop toke

Re: Improving performance for use-case where large (200) number of phrase queries are used?

2012-10-24 Thread Peter Keegan
Could you index your 'phrase tags' as single tokens? Then your phrase queries become simple TermQuerys. On Wed, Oct 24, 2012 at 12:26 PM, Robert Muir wrote: > On Wed, Oct 24, 2012 at 11:09 AM, Aaron Daubman wrote: > > Greetings, > > > > We have a solr instance in use that gets some perhaps atyp

Re: Anyone using mmseg analyzer in solr multi core?

2012-10-09 Thread Peter Keegan
We're using MMSeg with Lucene, but not Solr. Since each SolrCore is independent, I'm not sure how you can avoid each having a copy of the dictionary, unless you modified MMSeg to use shared memory. Or, maybe I missing something. On Mon, Oct 8, 2012 at 3:37 AM, liyun wrote: > Hi all, > Is anybody

Re: How to plug a new ANTLR grammar

2011-09-14 Thread Peter Keegan
er, at which stage do you use lucene analyzers > on the query? After it was parsed into the tree, or before we start > processing the query string? > > Thanks! > > Roman > > > > > > On Tue, Sep 13, 2011 at 10:14 PM, Peter Keegan > wrote: > > Roman, > >

Re: How to plug a new ANTLR grammar

2011-09-13 Thread Peter Keegan
Roman, I'm not familiar with the contrib, but you can write your own Java code to create Query objects from the tree produced by your lexer and parser something like this: StandardLuceneGrammarLexer lexer = new ANTLRReaderStream(new StringReader(queryString)); CommonTokenStream tokens = new Commo

Re: performance crossover between single index and sharding

2011-08-04 Thread Peter Keegan
We have 16 shards on 4 physical servers. Shard size was determined by measuring query response times as a function of doc count. Multiple shards per server provides parallelism. In a VM environment, I would lean towards 1 shard per VM (with 1/4 the RAM). We implemented our own distributed search (p

Re: Localized alphabetical order

2011-04-22 Thread Peter Keegan
On Fri, Apr 22, 2011 at 12:33 PM, Ben Preece wrote: > As someone who's new to Solr/Lucene, I'm having trouble finding information > on sorting results in localized alphabetical order. I've ineffectively > searched the wiki and the mail archives. > > I'm thinking for example about Hawai'ian, where

Re: Info about Debugging SOLR in Eclipse

2011-03-17 Thread Peter Keegan
The instructions refer to the 'Run configuration' menu. Did you try 'Debug configurations'? On Thu, Mar 17, 2011 at 3:27 PM, Peter Keegan wrote: > Can you use jetty? > > > http://www.lucidimagination.com/developers/articles/setting-up-apache-solr-in-eclipse >

Re: Info about Debugging SOLR in Eclipse

2011-03-17 Thread Peter Keegan
Can you use jetty? http://www.lucidimagination.com/developers/articles/setting-up-apache-solr-in-eclipse On Thu, Mar 17, 2011 at 12:17 PM, Geeta Subramanian < gsubraman...@commvault.com> wrote: > Hi, > > Can some please let me know the steps on how can I debug the solr code in > my eclipse? > > I

CapitalizationFilter

2010-12-29 Thread Peter Keegan
I was looking at 'CapitalizationFilter' and noticed that the 'incrementToken' method splits words at ' ' (space) and '.' (period). I'm curious as to why the period is treated as a word separator? This could cause unexpected results, for example: "Hello There My Name Is Dr. Watson" ---> "Hello Ther

Re: Does anyone notice this site?

2010-10-25 Thread Peter Keegan
fwiw, our proxy server has blocked this site for malicious content. Peter On Mon, Oct 25, 2010 at 1:25 PM, Grant Ingersoll wrote: > > On Oct 25, 2010, at 12:54 PM, scott chu wrote: > > > I happen to bump into this site: http://www.solr.biz/ > > > > They said they are also developing a search eng

LuceneRevolution - NoSQL: A comparison

2010-10-11 Thread Peter Keegan
I listened with great interest to Grant's presentation of the NoSQL comparisons/alternatives to Solr/Lucene. It sounds like the jury is still out on much of this. Here's a use case that might favor using a NoSQL alternative for storing 'stored fields' outside of Lucene. When Solr does a distribute

Re: Range queries

2009-06-16 Thread Peter Keegan
How about this: x:[5 TO 8] AND x:{0 TO 8} On Tue, Jun 16, 2009 at 1:16 PM, Otis Gospodnetic < otis_gospodne...@yahoo.com> wrote: > > Hi, > > I think the square brackets/curly braces need to be balanced, so this is > currently not doable with existing query parsers. > > Otis > -- > Sematext -- ht

  1   2   >