Re: not equals query in solr

2011-08-24 Thread Ranveer Kumar
any help... On Wed, Aug 24, 2011 at 12:58 PM, Ranveer Kumar wrote: > Hi, > > is it right way to do : > q=(state:[* TO *] AND city:[* TO *]) > > regards > Ranveer > > > On Wed, Aug 24, 2011 at 12:54 PM, Ranveer Kumar wrote: > >> Hi All, >> >> How to do negative query in solr. Following are the cri

Re: Optimize requires 50% more disk space when there are exactly 20 segments

2011-08-24 Thread Lance Norskog
Which Solr version do you have? In 3.x and trunk, Tiered and BalancedSegment are there for exactly this reason. In Solr 1.4, your only trick is to do a partial optimize with maxSegments. This lets you say "optimize until there are 15 segments, then stop". Do this with smaller and smaller numbers.

Optimize requires 50% more disk space when there are exactly 20 segments

2011-08-24 Thread Michael Ryan
I'm using Solr 3.2 with a mergeFactor of 10 and no merge policy configured, thus using the default LogByteSizeMergePolicy. Before I do an optimize, typically the largest segment will be about 90% of the total index size. When I do an optimize, the total disk space required is usually about 2x t

Re: Query vs Filter Query Usage

2011-08-24 Thread Shawn Heisey
On 8/24/2011 6:07 PM, Joshua Harness wrote: Shawn - Thanks for your reply. Given that my application is mainly used as faceted search, would the following types of queries make sense or are there other pitfalls to consider? q=*:*&fq=someField:someValue&fq=anotherField:anotherValue I'm n

Re: Getting DIH status with SolrJ

2011-08-24 Thread Shawn Heisey
On 8/24/2011 4:15 PM, Shawn Heisey wrote: It might not be the prettiest code, but I'll take it. Thank you. I paraphrased quite a bit and have ended up with the following: I put all this into a somewhat generic method. Hopefully it will prove useful to someone else on the list. There are so

Re: Query vs Filter Query Usage

2011-08-24 Thread Joshua Harness
Shawn - Thanks for your reply. Given that my application is mainly used as faceted search, would the following types of queries make sense or are there other pitfalls to consider? *q=*:*&fq=someField:someValue&fq=anotherField:anotherValue* Thanks! Josh On Wed, Aug 24, 2011 at 4:48 PM, Sha

Best way to anchor solr searches?

2011-08-24 Thread arian487
If I'm searching for users based on last login time, and I search once, then go to the second page with a new offset, I could potentially see the same users on page 2 if the index has changed. What is the best way to anchor it so I avoid this? -- View this message in context: http://lucene.472

Re: commitWithin + SolrJ

2011-08-24 Thread Chris Hostetter
: I ended up doing this with request.process(server) on an UpdateRequest : class. right ... if you peek under the covers of SolrServer most of it's methods are are just convinience methods for constructing a Request, setting some attributes/streams on it, and then processing it via that Server

Re: Where the heck do you put maxAnalyzedChars?

2011-08-24 Thread Koji Sekiguchi
(11/08/25 5:29), Daniel Skiles wrote: I have a very large field in my index that I need to highlight. Where in the config file do I set the maxAnalyzedChars in order to make this work? Has anyone successfully done this? Placing it in your requestHandler should work. For example: 1000

Re: Getting DIH status with SolrJ

2011-08-24 Thread Shawn Heisey
On 8/24/2011 3:24 PM, Dyer, James wrote: Shawn, I do not know of an easy or a good way to do this. It would be nice if there were a non-frail, programmatic way to get back DIH status but I don't think there is one. I have a (monsterous) program that polls a running DIH handler every so ofte

Re: Newbie Question, can I store structured sub elements?

2011-08-24 Thread dan whelan
You could change starttime and channelname to multiValued=true and use these fields to store all the values for those fields. showing.movie_id and showing.id probably isn't needed in a solr record. On 8/24/11 7:53 AM, Zac Tolley wrote: I have a very scenario in which I have a film and showin

RE: query

2011-08-24 Thread Jaeger, Jay - DOT
One way I had thought of doing this kind of thing: include in the index an "ACL" of some sort. The problem I see in your case is that the list if "friends" can presumably change over time. So, given that, one way would be to have a little application in between. The request goes to the appli

RE: how to deal with URLDatasource which needs authorization?

2011-08-24 Thread Jaeger, Jay - DOT
You could run the HTML import from Tika (see the Solr tutorial on the Solr website). The job that ran Tika would need the user/password of the site to be indexed, but Solr would not. (You might have to write a little script to get the HTML page using curl or wget or Nutch). Users could then s

Re: automatically dealing with out of memory exceptions

2011-08-24 Thread Daniel Skiles
I've gotten around that by using the Java Service Wrapperfrom Tanuki Soft to restart the entire container. On Wed, Aug 24, 2011 at 5:28 PM, Jason Toy wrote: > After running a combination of different queries, my solr server eventually > is unable to complete ce

automatically dealing with out of memory exceptions

2011-08-24 Thread Jason Toy
After running a combination of different queries, my solr server eventually is unable to complete certain requests because it runs out of memory, which means I need to restart the server as its basically useless with some queries working and not others. I am moving to distributed setting soon, bu

RE: Getting DIH status with SolrJ

2011-08-24 Thread Dyer, James
Shawn, I do not know of an easy or a good way to do this. It would be nice if there were a non-frail, programmatic way to get back DIH status but I don't think there is one. I have a (monsterous) program that polls a running DIH handler every so often to get its status. The crux is something

Getting DIH status with SolrJ

2011-08-24 Thread Shawn Heisey
I can't figure out how to get the particular information I need out of a Solr response with SolrJ. I see that QueryResponse has a number of methods for getting specific information out, but as far as I can see, none of them have anything at all to do with the DIH. I've started out with the fo

Re: Query vs Filter Query Usage

2011-08-24 Thread Shawn Heisey
On 8/24/2011 2:02 PM, Joshua Harness wrote: I've done some basic query performance testing on my SOLR instance, which allows users to search via a faceted search interface. As such, document relevancy is less important to me since I am performing exact match searching. Comparing using filte

Where the heck do you put maxAnalyzedChars?

2011-08-24 Thread Daniel Skiles
I have a very large field in my index that I need to highlight. Where in the config file do I set the maxAnalyzedChars in order to make this work? Has anyone successfully done this?

Re: commitWithin + SolrJ

2011-08-24 Thread Daniel Skiles
I ended up doing this with request.process(server) on an UpdateRequest class. On Wed, Aug 24, 2011 at 2:07 PM, Daniel Skiles wrote: > What is the cleanest way to use the commitWithin directive with SolrJ? > AbstractUpdateRequest has a setCommitWithin() method, but I don't see how to > hook that i

Query vs Filter Query Usage

2011-08-24 Thread Joshua Harness
All - I apologize if this question has been asked before - I couldn't seem to find a straightforward answer by researching it on google and stackoverflow. I am trying to understand when I should use filter queries vs plain vanilla queries. Here's what I understand: * Filter queries can be m

Solr support for stored procedures

2011-08-24 Thread Maria Vazquez
Does Solr support calling stored procedures in the data-config.xml? Thanks! Maria

commitWithin + SolrJ

2011-08-24 Thread Daniel Skiles
What is the cleanest way to use the commitWithin directive with SolrJ? AbstractUpdateRequest has a setCommitWithin() method, but I don't see how to hook that into SolrServer.add(SolrInputDocument doc). Do I need to use SolrServer.request(), or do I need to use some other method? Thanks.

Re: Text Analysis and copyField

2011-08-24 Thread Erick Erickson
Have you considered having two dictionaries and using ajax to query them both and intermingling the results in your suggestions? It'd be some work, but I think it might accomplish what you want. Best Erick On Tue, Aug 23, 2011 at 1:48 PM, Herman Kiefus wrote: > To close, I found this article fro

Re: csv responsewriter and numfound

2011-08-24 Thread Jon Hoffman
I took a look at the source and agree that it would be a bit hairy to bubble up header settings from the response writers. Alternatively, and I'll admit that this is a somewhat hacky proposal, an optional parameter "csv.numfound=true" could be added to the request which would cause the first line

Replication: solrconfig.xml

2011-08-24 Thread alexander
Dear Solr users, I am getting a bit confused here: On Solr 1.4, multicore setup, all config files listed get replicated properly, the only exception being "solrconfig.xml". I have tried these two variations on the master's side: 1.) schema.xml,protwords.txt,stopwords.txt,synonyms.txt,elevate.xm

Re: Field type change / copy field

2011-08-24 Thread Alexei Martchenko
have u tried in your facet_year index analyzer something like this? this can theoretically do the trick 2011/8/24 Oliver Schihin > Hello list > > My documents come with a field holding a date, always a year: > 2008 > In the schema, this content is taken for a field as an integer, and > it

Re: hierarchical faceting in Solr?

2011-08-24 Thread Alexei Martchenko
Cheers, very good, congratulations 2011/8/23 Naomi Dushay > Chris Beer just did a revamp of the wiki page at: > > > http://wiki.apache.org/solr/**HierarchicalFaceting > > Yay Chris! > > - Naomi > (" ... and I helped!") > > > On Aug 22, 2011, at

Re: Full sentence spellcheck

2011-08-24 Thread Valentin
I've run some tests, and I found that it makes this error when i add a spellcheck component to a handler and i try to use spellcheck.q So spellcheck.q works with this kind of use : http://localhost:8983/solr/db/suggest_full?q=american%20israel&spellcheck.q=american%20israel&qt=spellchecker (with

Newbie Question, can I store structured sub elements?

2011-08-24 Thread Zac Tolley
I have a very scenario in which I have a film and showings, each film has multiple showings at set times on set channels, so I have: Movie - id title description duration Showing - id movie_id starttime channelname I want to know can I store this in solr so that I keep this stucture?

Re: csv responsewriter and numfound

2011-08-24 Thread Erik Hatcher
Good idea. However response writers can't control HTTP response headers currently... Only the content type returned. Erik On Aug 24, 2011, at 8:52, Jon Hoffman wrote: > What about the HTTP response header? > > >>> Great question. But how would that get returned in the response? >>>

Re: Property "undefined" in Schema Browser (Solr Admin)

2011-08-24 Thread Chantal Ackermann
Hi Stefan, I'm using Firefox 3.6.20 and Chromium 12.0.742.112 (90304) Ubuntu 10.10. The "undefined" appears with both of them. Chantal On Wed, 2011-08-24 at 14:09 +0200, Stefan Matheis wrote: > Hi Chantal, > > On Wed, Aug 24, 2011 at 1:43 PM, Chantal Ackermann > wrote: > > There is a capit

Re: Spatial Search problems

2011-08-24 Thread Smiley, David W.
Well that's your problem :-P You need to be using the same version of Lucene for reading & writing. Create your index with Lucene 3.3. FYI I tried indexing the point you said you had trouble with, and with a 300km radius, and it found it. On Aug 24, 2011, at 4:39 AM, Javier Heras wrote: > An

can you help on this?

2011-08-24 Thread abhijit bashetti
SEVERE: java.lang.InternalError: a fault occurred in a recent unsafe memory access operation in compiled Java code at org.apache.lucene.store.DataInput.readVInt(DataInput.java:108) at org.apache.lucene.index.TermBuffer.read(TermBuffer.java:64) at org.apache.lucene.index.SegmentTermEnum.next(Segment

Re: csv responsewriter and numfound

2011-08-24 Thread Jon Hoffman
What about the HTTP response header? > > Great question. But how would that get returned in the response? > > > > It is a drag that the header is lost when results are written in CSV, but > there really isn't an obvious spot for that information to be returned. > > I guess a comment would be one

Re: Property "undefined" in Schema Browser (Solr Admin)

2011-08-24 Thread Stefan Matheis
Hi Chantal, On Wed, Aug 24, 2011 at 1:43 PM, Chantal Ackermann wrote: > There is a capital F which is not listed as key? But this is also the > case in your example so probably I'm confusing something. There's a quick hack in place, which tries: the character, the lowercase character & the upper

Re: Property "undefined" in Schema Browser (Solr Admin)

2011-08-24 Thread Chantal Ackermann
Hi Stefan, thanks for your time! There is a capital F which is not listed as key? But this is also the case in your example so probably I'm confusing something. Anyway, the respective output of: /admin/luke?fl=title is: string I-SM---OF---l I-SO 16697 8476 − ... − ... − − Index

Field type change / copy field

2011-08-24 Thread Oliver Schihin
Hello list My documents come with a field holding a date, always a year: 2008In the schema, this content is taken for a field as an integer, and it will be searchable. Through a copyfield-instruction I move the to a -field, you guess, to use it for faceting and make range queries possible.

query

2011-08-24 Thread directorscott
Hello, In my application, users prepare documents and they can share them with friends. Friendships between users are persisted in database. When a user issues a search, system should bring only those documents which are prepared by friends of the user who makes the search. Can anyone please show

Re: Property "undefined" in Schema Browser (Solr Admin)

2011-08-24 Thread Stefan Matheis
Hi Chantal, how does your luke-output look like? What the Schema-Browser does is, it takes the schema- & index-element: > I-SOF---l > I-SO and does a lookup for every mentioned character in the key-hash: > > Indexed > Tokenized > Stored > Multivalued > TermVector Stored > Store Offs

how to deal with URLDatasource which needs authorization?

2011-08-24 Thread deniz
hi all i am trying to index a page which basically returns an xml file. But i dont want it to be accessible for anyone else... the page will basically check for authorization like username and password... e.g the page which return is this : www.blablabla.com/xyz i would like to index the data

Property "undefined" in Schema Browser (Solr Admin)

2011-08-24 Thread Chantal Ackermann
Hi all, the Schema Browser in the SOLR Admin shows me the following information: """ Field: title Field Type: string Properties: Indexed, Stored, Multivalued, Omit Norms, undefined, Sort Missing Last Schema: Indexed, Stored, Multivalued, Omit Norms, undefined, Sort Missing Last Index: Indexe

Re: Spatial Search problems

2011-08-24 Thread Javier Heras
And one more thing... should I create the index with the same version of solr that I use to open index for reading??? I create my index with lucene 2.9, and my solr version where Im trying spatial search is 3.3 Thank you very much David -- View this message in context: http://lucene.472066.n3.na

Re: Copying cores with solrj?

2011-08-24 Thread Manish Bafna
Use replication On Wed, Aug 24, 2011 at 1:26 PM, Michael Szalay wrote: > Hi all > > We have a setup with two cores, a "current" and a "work". The current is > used > by the application and the work is used by the updating program. > > I'm implementing a partial index updater. To do that, I want t

Re: Problem using stop words

2011-08-24 Thread Lance Norskog
A note: in the first schema, you had the stopwords after the stemmer. This would not work, since the stopwords are not stemmed. On Wed, Aug 24, 2011 at 12:59 AM, _snake_ wrote: > I forgot to say that my stopwords file is in the same location as the schema > file and the solrconfig file. > > -- >

Re: Problem using stop words

2011-08-24 Thread _snake_
I forgot to say that my stopwords file is in the same location as the schema file and the solrconfig file. -- View this message in context: http://lucene.472066.n3.nabble.com/Problem-using-stop-words-tp3274598p3280319.html Sent from the Solr - User mailing list archive at Nabble.com.

Copying cores with solrj?

2011-08-24 Thread Michael Szalay
Hi all We have a setup with two cores, a "current" and a "work". The current is used by the application and the work is used by the updating program. I'm implementing a partial index updater. To do that, I want to copy the "current" core to the "work" so that I can have the current state to star

Re: Problem using stop words

2011-08-24 Thread _snake_
Thanks everybody for your help!! I change the stopwords file, and I only use one word per line, without start / ending spaces, and without comments. I change it to UTF-8. I am using the TermsComponent to suggest words to the user (JQuery UI Autocomplete). So, the stopwords are still showed here...

Re: not equals query in solr

2011-08-24 Thread Ranveer Kumar
Hi, is it right way to do : q=(state:[* TO *] AND city:[* TO *]) regards Ranveer On Wed, Aug 24, 2011 at 12:54 PM, Ranveer Kumar wrote: > Hi All, > > How to do negative query in solr. Following are the criteria : > I have state and city field where I want to filter only those state and > city w

Re: Sorting results by Range

2011-08-24 Thread Sowmya V.B.
Hi Chris Thanks for the clarification. My Doubt:* You said: so if the range of legal values is 0-100, and you care about 10-20 sort=map(map(myNumField,0,10,0),20,100,0) desc, score desc sort=map(map(myNumField,0,10,100),20,100,100) asc, score desc * By doing the first one, I got results with "my

not equals query in solr

2011-08-24 Thread Ranveer Kumar
Hi All, How to do negative query in solr. Following are the criteria : I have state and city field where I want to filter only those state and city which is not blank. something like: state NOT "" AND city NOT "". I tried -state:"" but its not working. Or suggest me to do this in better way.. r

Re: Spatial Search problems

2011-08-24 Thread Javier Heras
Hi David, the thing is that all indexed points are in spain, so distances should always be lower than 1300 Kms. And yes, I'm using solr tutorial scheme.xml, for which tutorial indexes work fine. When I change to my index, it only works when distances are over 4510Kms. The query I run is: http://l