Re: Features not present in Solr

2010-03-23 Thread Andrzej Bialecki
On 2010-03-23 06:25, David Smiley @MITRE.org wrote: I use Endeca and Solr. A few notable things in Endeca but not in Solr: 1. Real-time search. 2. "related record navigation" (RRN) is what they call it. This is the ability to join in other records, something Lucene/Solr definitely can't do

Re: Question about query

2010-03-23 Thread Armando Ota
Hey ... 10x for you reply ... unfortunately this is not a case for me .. I have canceled the feature which needs this ... KInd regards Armando Erick Erickson wrote: One thing I've seen suggested is to add the number of values to a separate field, say topic_count. Then, in your situation abov

resetting stats

2010-03-23 Thread Andre Parodi
Hi, Is there a way to reset the stats counters? For example in the Query handler avgTimePerRequest is not much use after a while as it is an avg since the server started. When putting the data into a monitoring system like nagios it would be useful to be able to sample the data and reset it

Configuring multiple SOLR apps to play nice with MBeans / JMX

2010-03-23 Thread Constantijn Visinescu
Hi, I'm having a problem trying to get multiple solr applications to run in the same servlet container because they all try to claim "solr" as a name/category to put their mbeans under and that causes exceptions/crashes for all the applications after the first. I've read http://wiki.apache.org/so

Re: Configuring multiple SOLR apps to play nice with MBeans / JMX

2010-03-23 Thread Charl Mert
Hi Constantijn, I'm not too sure about the JMX monitoring side of things but having looked at the Solr's MultiCore feature it seems really simple to create multiple solr cores that could all be configured to point to one MBean server. When creating a core y

feature request for ivalid data formats

2010-03-23 Thread Király Péter
Hi, I don't know whether this is the good place to ask it, or there is a special tool for issue requests. If I set a field to int, but the input contains a string, the Solr reports an error like this: 2010.03.23. 13:27:23 org.apache.solr.common.SolrException log SEVERE: java.lang.NumberForma

Re: Configuring multiple SOLR apps to play nice with MBeans / JMX

2010-03-23 Thread Constantijn Visinescu
Hi, Multicore lets me have me have multiple cores in a single instance. However since i have 3 different webapps with embedded solr that means i have 3 different instances of solr. (and they're all trying to park their JMX MBeans under the same name, namely solr) Constantijn On Tue, Mar 23, 20

Re: SOLR-1316 How To Implement this autosuggest component ???

2010-03-23 Thread stocki
can nobody help me ? =( stocki wrote: > > i patch an nightly build from solr. > patch runs, classes are in the correct folder, but when i replace > spellcheck with this spellchecl like in the comments, solr cannot find the > classes =( > > > > suggest >name="classname">org

Perfect Match

2010-03-23 Thread Nair, Manas
Hello Experts, I need help on one of my issues with perfect matching of terms. I have a collection of artists which are stored in the index against the field name artist_t which is a text type field. This field consists of values like ["dora", Dora The Explorer", "Princess Dora The explorer"]

Re: Perfect Match

2010-03-23 Thread Ahmet Arslan
> I need help on one of my issues with perfect matching of > terms. > > I have a collection of artists which are stored in the > index against the field name artist_t which is a text type > field. This field consists of values like ["dora", Dora The > Explorer", "Princess Dora The explorer"] acr

RE: Perfect Match

2010-03-23 Thread Nair, Manas
Thankyou Ahmet. You were right. artist_s:Dora is bringing results. But I need artist_s:Dora the explorer to bring only those results which contain "Dora the explorer". I tried to give artist_s:"Dora the explorer" (phrase search).. that is working. But artist_s:Dora the explorer is not working.

Re: Perfect Match

2010-03-23 Thread Erick Erickson
What Ahmet was getting to was that you need parentheses to insure that all your terms go against the artist_s field. Something like artist_s:(Dora The Explorer). But watch capitalization. Adding debugQuery=on to your query will show you a lot about what's going on. HTH Erick On Tue, Mar 23, 2010

Re: SOLR-1316 How To Implement this autosuggest component ???

2010-03-23 Thread Alexey Serba
> Error loading class 'org.apache.solr.spelling.suggest.Suggester' Are you sure you applied the patch correctly? See http://wiki.apache.org/solr/HowToContribute#Working_With_Patches Checkout Solr trunk source code ( http://svn.apache.org/repos/asf/lucene/solr/trunk ), apply patch, verify that ever

Re: SOLR-1316 How To Implement this autosuggest component ???

2010-03-23 Thread stocki
okay, i do this.. but one file are not right updatet Index: trunk/src/java/org/apache/solr/util/HighFrequencyDictionary.java (from the suggest.patch) i checkout it from eclipse, apply patch, make an new solr.war ... its the right way ?? i thought that is making a war i didnt need to make

Re: use termscomponent like spellComponent ?!

2010-03-23 Thread Grant Ingersoll
On Mar 22, 2010, at 12:09 PM, stocki wrote: > > thx. > > it try to patch solr with 1316 but it not works =( > > do i need to checkout from svn Nightly ? > http://svn.apache.org/repos/asf/lucene/solr/ Yes, you will need to work from trunk. > > when i create a patch and then create the WAR

Re: Features not present in Solr

2010-03-23 Thread Grant Ingersoll
On Mar 23, 2010, at 4:17 AM, Andrzej Bialecki wrote: > On 2010-03-23 06:25, David Smiley @MITRE.org wrote: >> >> I use Endeca and Solr. >> >> A few notable things in Endeca but not in Solr: >> 1. Real-time search. > > >> 2. "related record navigation" (RRN) is what they call it. This is the

Re: Features not present in Solr

2010-03-23 Thread Grant Ingersoll
On Mar 22, 2010, at 3:27 PM, Israel Ekpo wrote: >> >> There a probably a lot of features already available in Solr out of the box > that most of those other "enterprise level" applications do not have yet. > > You would also be surprised to learn that a lot of them use Lucene under the > covers

RE: PDFBox/Tika Performance Issues

2010-03-23 Thread Giovanni Fernandez-Kincade
Sorry for the late reply - been out of town for a couple of days. >From my solrconfig: ignored_ text -Original Message- From: Grant Ingersoll [mailto:gsi...@gmail.com] On Behalf Of Grant Ingersoll Sent: Saturday, March 20, 2010 8:43 AM To: solr-user@lucene.apa

Re: PDFBox/Tika Performance Issues

2010-03-23 Thread Mattmann, Chris A (388J)
Hi Giovanni, The error that you're showing in your logs below indicates that this message signature: org.apache.solr.handler.ContentStreamLoader.load(Lorg/apache/solr/request/SolrQueryRequest;Lorg/apache/solr/response/SolrQueryResponse;Lorg/apache/solr/common/util/ContentStream;) doesn't match

Spatial queries

2010-03-23 Thread Jean-Sebastien Vachon
Hi All, I am using the package from JTeam to perform spatial searches on my index. I'd like to know if it is possible to build a query that uses multiple clauses. Here is an example: q={!spatial lat=123 long=456 radius=10} OR {!spatial lat=111 long=222 radius=20}title:java Basically that would

Issue w/ highlighting a String field

2010-03-23 Thread Saïd Radhouani
I have trouble with highlighting field of type "string". It looks like highlighting is only working with tokenized fields, f.i., it worked with text and another type I defined. Is this true, or I'm making a mistake that is preventing me to have the highlighting option working on string? Thanks for

Re: Issue w/ highlighting a String field

2010-03-23 Thread Markus Jelsma
Hello, Check out the wiki [1] on what options to use for highlighting and other components. [1]: http://wiki.apache.org/solr/FieldOptionsByUseCase Cheers, On Tuesday 23 March 2010 17:11:42 Saïd Radhouani wrote: > I have trouble with highlighting field of type "string". It looks like > hig

Re: Issue w/ highlighting a String field

2010-03-23 Thread Saïd Radhouani
Thanks Markus. It says that a tokenizer ust be defined for the field. Here's is the fildType I'm using and the field I want to highlight on. As you can see, I defined a tokenizer, but it's not working though. Any idea? In the schema:

Re: Issue w/ highlighting a String field

2010-03-23 Thread Erick Erickson
Did you restart solr and reindex? just changing the field definition won't help you without reindexing... One thing worries me about your fragment, you call it text_Sort. If you really intend to sort by this field, it may NOT be tokenized, you'll probably have to use copyfield HTH Erick On T

Re: Issue w/ highlighting a String field

2010-03-23 Thread Saïd Radhouani
Thanks Erik. Actually, I restarted and reindexed numers of time, but still not working. RE: your question, I intend to use this field for automatic PHRASED boosting; is that ok?: title_sort Thanks. 2010/3/23 Erick Erickson > Did you restart solr and reindex? just changing the field

Cannot fetch urls with "target=_blank"

2010-03-23 Thread Stefano Cherchi
As in subject: when I try to fetch a page whose link should open in new window, Nutch fails. I know it is not a Solr issue, actually, but I beg for a hint. S -- "Anyone proposing to run Windows on servers should be prepared to explain what they know about ser

lowercasing for sorting

2010-03-23 Thread Nagelberg, Kallin
I'm trying to perform a case-insensitive sort on a field in my index that contains values like aaa bbb AA BB And I get them sorted like: aaa bbb AA BB When I would like them: aa aaa bb bbb To do this I'm trying to setup a fieldType who's sole purpose is to lowercase a value on query and ind

DIH - Deleting documents

2010-03-23 Thread André Maldonado
Hy all. How can I delete documents when using DataImportHandler on a delta import? Thank's "Então aproximaram-se os que estavam no barco, e adoraram-no, dizendo: És verdadeiramente o Filho de Deus." (Mateus 14:33)

Re: DIH - Deleting documents

2010-03-23 Thread Mauricio Scheffer
Take a look at the DIH special commands: http://wiki.apache.org/solr/DataImportHandler#Special_Commands Some other options: http://stackoverflow.com/questions/1555610/solr-dih-how-to-handle-deleted-documents Cheers, Mauricio 2010/3/2

Performing "Starts with searches"

2010-03-23 Thread Vladimir Sutskever
How do I perform a "starts with" search in Lucene/Solr. Ex: I need all results that start with "Bill" - NOT just contain "Bill" somewhere in the search string. Thank You -Vladimir This email is confidential and subject to important disclaimers and conditions including on offers for the purc

RE: lowercasing for sorting

2010-03-23 Thread Binkley, Peter
Solr makes this easy: You can populate this field from another field using copyField, if you also need to be able to search or display the original values. Just out of curiosity, can you tell us anything about what the Globe and Mail is using Solr for? (assuming the question is work-related)

RE: lowercasing for sorting

2010-03-23 Thread Nagelberg, Kallin
Thanks, and my cover is apparently blown :P We're looking at solr for a number of applications, from taking the load off the database, to user searching etc. I don't think I'll get fired for saying that :P Thanks, Kallin Nagelberg -Original Message- From: Binkley, Peter [mailto:peter.b

RE: PDFBox/Tika Performance Issues

2010-03-23 Thread Giovanni Fernandez-Kincade
I don't think so. I'm using Tomcat on my servers, but I set up my local machine with the Eclipse-Jetty plugin from that Lucid article and I'm getting the same error. These are the libraries references in my Eclipse project: apache-solr-core-1.5-dev.jar apache-solr-dataimporthandler-1.5-dev.jar

Re: Performing "Starts with searches"

2010-03-23 Thread Ahmet Arslan
> How do I perform a "starts with" > search in Lucene/Solr. > > Ex: I need all results that start with > "Bill"   - NOT just contain "Bill" somewhere > in the search string. In Lucene with SpanFirstQuery [1]. In Solr you can copy your field into a string typed field. And use prefix query: stri

Re: Issue w/ highlighting a String field

2010-03-23 Thread Ahmet Arslan
> Thanks Erik. Actually, I restarted > and reindexed numers of time, but still > not working. Highlighting on string typed fields perferctly works. See the output of : http://localhost:8983/solr/select/?q=id%3ASOLR1000&version=2.2&start=0&rows=10&indent=on&hl=true&hl.fl=id But there must be a ma

Re: DIH - Deleting documents

2010-03-23 Thread blargy
Are there any examples out there for using these special commands? Im not quite sure of the syntax. Any simple example will suffice. Thanks mausch wrote: > > Take a look at the DIH special commands: > http://wiki.apache.org/solr/DataImportHandler#Special_Commands >

RE: Perfect Match

2010-03-23 Thread Ahmet Arslan
> Thankyou Ahmet. You were right. > artist_s:Dora is bringing results. > But I need artist_s:Dora the explorer to bring only those > results which contain "Dora the explorer". > > I tried to give artist_s:"Dora the explorer" (phrase > search).. that is working. But artist_s:Dora the explorer is >

Re: DIH - Deleting documents

2010-03-23 Thread André Maldonado
In my case I will sove the problem with postImportDeleteQuery Thank's "Então aproximaram-se os que estavam no barco, e adoraram-no, dizendo: És verdadeiramente o Filho de Deus." (Mateus 14:33) On Tue, Mar 23, 2010 at 15:29, blargy wrote: > > Are there any examples out there for using these sp

Impossible Boost Query?

2010-03-23 Thread blargy
I was wondering if this is even possible. I'll try to explain what I'm trying to do to the best of my ability. Ok, so our site has a bunch of products that are sold by any number of sellers. Currently when I search for some product I get back all products matching that search term but the proble

Out of Memory

2010-03-23 Thread Neil Chaudhuri
I am using the DataImportHandler to index literally millions of documents in an Oracle database. Not surprisingly, I got the following after a few hours: java.sql.SQLException: ORA-04030: out of process memory when trying to allocate 4032 bytes (kolaGetRfcHeap,kghsseg: kolaslCreateCtx) Has anyo

Solr Self-Join Query

2010-03-23 Thread Vladimir Sutskever
Hi Guys/Gals, I have columns like so in my index client_id, client_name, client_parent_id Does SOLR support queries of self-join. Example: client_name:wallmart AND (client_parent_id!=client_id) I need all entries that match "wallmart" and do NOT have client_parent_id==client_id Thank you

RE: Out of Memory

2010-03-23 Thread Craig Christman
Is this on Oracle 10.2.0.4? Looking at the Oracle support site there's a memory leak using some of the XML functions that can be fixed by upgrading to 10.2.0.5, 11.2, or by using 10.2.0.4 Patch 2 in Windows 32-bit. -Original Message- From: Neil Chaudhuri [mailto:nchaudh...@potomacfusion

RE: Out of Memory

2010-03-23 Thread Dennis Gearon
Now THAT's real open source help! Nice job Craig. Dennis Gearon Signature Warning EARTH has a Right To Life, otherwise we all die. Read 'Hot, Flat, and Crowded' Laugh at http://www.yert.com/film.php --- On Tue, 3/23/10, Craig Christman wrote: > From: Craig Christman > Subj

Re: Solr Self-Join Query

2010-03-23 Thread Otis Gospodnetic
Vladimir, Think of Solr/Lucene index as a single, flat, denormalized table, where the "columns" are called "fields". client_id:walmart client_name:Walmart client_parent_id:walmart The query that I think you are looking for then becomes: +client_id:walmart -client_parent_id:walmart Otis

Re: Impossible Boost Query?

2010-03-23 Thread Otis Gospodnetic
Would Field Collapsing from SOLR-236 do the job for you? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem search :: http://search-hadoop.com/ - Original Message > From: blargy > To: solr-user@lucene.apache.org > Sent: Tue, March 23, 2010 2:39:48 PM

Re: Cannot fetch urls with "target=_blank"

2010-03-23 Thread Otis Gospodnetic
hi Stefano, nutch-user@ is a much better place to ask this question really. You'll also want to include more info about "Nutch fails". Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem search :: http://search-hadoop.com/ - Original Message > From:

Re: Features not present in Solr

2010-03-23 Thread David Smiley @MITRE.org
Interesting. Do you have a reference (e.g. a patch, post, ...) to people actually doing this? The FieldCache seems like cheating because it's in-memory and there is a limited amount of memory, so for large data sets I have to wonder. Grant Ingersoll-6 wrote: > > > On Mar 23, 2010, at 4:17 AM

Re: Configuring multiple SOLR apps to play nice with MBeans / JMX

2010-03-23 Thread Otis Gospodnetic
Wow, this sounds interesting. I never looked at JMX with multiple cores Solr instances. I wonder if this calls for a new JIRA issue Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem search :: http://search-hadoop.com/ - Original Message > From:

Re: [POLL] Users of abortOnConfigurationError ?

2010-03-23 Thread Ryan McKinley
The 'abortOnConfigurationError' option was added a long time ago... at the time, there were many errors that would just be written to the logs but startup would continue normally. I felt (and still do) that if there is a configuration error everything should fail loudly. The option in solrconfig.

Re: Impossible Boost Query?

2010-03-23 Thread blargy
Possibly. How can I install this as a contrib or do I need to actually perform the patch? Otis Gospodnetic wrote: > > Would Field Collapsing from SOLR-236 do the job for you? > > Otis > > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch > Hadoop ecosystem search :: http://search-

Re: Impossible Boost Query?

2010-03-23 Thread blargy
Maybe a better question is... how can I install this and will it work with 1.4? Thanks blargy wrote: > > Possibly. How can I install this as a contrib or do I need to actually > perform the patch? > > > Otis Gospodnetic wrote: >> >> Would Field Collapsing from SOLR-236 do the job for you? >

Re: Configuring multiple SOLR apps to play nice with MBeans / JMX

2010-03-23 Thread Chris Hostetter
: I'm having a problem trying to get multiple solr applications to run in the : same servlet container because they all try to claim "solr" as a Hmmm... i think you're in new territory here. I don't know that anyone has ever mentioned doing this before. Honestly: I thought the hierarchical na

Re: use termscomponent like spellComponent ?!

2010-03-23 Thread Chris Hostetter
: so when ich search for "nik" termsCom suggest me "nikon". thats correctly : whar i want. : but when i type "nikon on" i want that solr suggest me "nikon one" , try using copyField to index an untokenized version of your field, so that "nikon one" is a single term, then "nikon on" as a pr

Re: How to get Facet results only on a range of search results documents

2010-03-23 Thread Chris Hostetter
: I would like to return Facet results only on the range of search results : (say 1-100) not on the whole set of search results. Any idea how can I do : it? Thta's pretty trivial to do in the client layer (fetch the first 100 results, iterate over them, and count per facet field) If you really

Re: Impossible Boost Query?

2010-03-23 Thread Otis Gospodnetic
You'd likely want to get the latest patch and trunk and try applying. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem search :: http://search-hadoop.com/ - Original Message > From: blargy > To: solr-user@lucene.apache.org > Sent: Tue, March 23, 201

Re: dismax and q.op

2010-03-23 Thread Chris Hostetter
: *I haven't mentioned value for mm* ... : My result:- No results; but each of the terms individually gave me results! http://wiki.apache.org/solr/DisMaxRequestHandler#mm_.28Minimum_.27Should.27_Match.29 "The default value is 100% (all clauses must match)" : 2. Does the default

Re: Impossible Boost Query?

2010-03-23 Thread blargy
Thanks but Im not quite show on how to apply the patch. I just use the packaged solr-1.4.0.war in my deployment (no compiling, etc). Is there a way I can patch the war file? Any instructions would be greatly appreciated. Thanks Otis Gospodnetic wrote: > > You'd likely want to get the latest pa

Re: How to Combine Dismax Query Handler and Clustering Component

2010-03-23 Thread Chris Hostetter
: How do we combine clustering component and Dismax query handler? The dismax *handler* is now just the SearchHandler with defType=dismax ... so if you follow the examples for setting up the clustering component on an instance of SearchHandler, all you have to do is configure that instance to

Re: 64 bit integers (MySQL bigint) and SOLR

2010-03-23 Thread Chris Hostetter
: : The primary key for my database is a BIGINT, basically a 64 bit integer. The : value is well below the 32 bit maximum (about 230 million right now) but : someday in the future that might not be the case. In the schema, we have it : mapped to a "tint" field type as defined in the example sch

Re: [POLL] Users of abortOnConfigurationError ?

2010-03-23 Thread Chris Hostetter
: I felt (and still do) that if there is a configuration error : everything should fail loudly. The option in solrconfig.xml was added : as a back-compatible way to get both behaviors. Oh man ... i completley remembered that backwards ... i thought you were the one that was argueing in favor of

Re: [ANN] Zoie Solr Plugin - Zoie Solr Plugin enables real-time update functionality for Apache Solr 1.4+

2010-03-23 Thread brad anderson
I see, so when you do a commit it adds it to Zoie's ramdirectory. So, could you just commit after every document without having a performance impact and have real time search? Thanks, Brad On 20 March 2010 00:34, Janne Majaranta wrote: > To my understanding it adds a in-memory index which holds

Re: HTTP Status 500 - null java.lang.IllegalArgumentException at java.nio.Buffer.limit(Buffer.java:249)

2010-03-23 Thread Chris Hostetter
: I am doing a really simple query on my index (it's running in tomcat): : : http://host:8080/solr_er_07_09/select/?q=hash_id:123456 ... details please ... http://wiki.apache.org/solr/UsingMailingLists ... what version of solr? lucene? tomcat? : I built the index on a different m

Re: release schedule?

2010-03-23 Thread Chris Hostetter
: I'm new to this list, so please excuse me if I'm asking in the wrong : place. you're definitely in the right place. : - Are there any planned Solr releases for this year? : : - What are the planned release dates/contents, etc.? releases aren't really planned .. they happe

SOLR-236 patch with version 1.4

2010-03-23 Thread blargy
Is the field collapsing patch (236) not compatible with Solr 1.4? $ patch -p0 -i ~/Desktop/SOLR-236.patch patching file src/test/test-files/solr/conf/solrconfig-fieldcollapse.xml patching file src/java/org/apache/solr/search/fieldcollapse/collector/DocumentGroupCountCollapseCollectorFactory.java

Re: SOLR-1316 How To Implement this autosuggest component ???

2010-03-23 Thread Lance Norskog
You need 'ant' to do builds. At the top level, do: ant clean ant example These will build everything and set up the example/ directory. After that, run: ant test-core to run all of the unit tests and make sure that the build works. If the autosuggest patch has a test, this will check that the pa

Re: Impossible Boost Query?

2010-03-23 Thread Lance Norskog
At this point (and for almost 3 years :) field collapsing is a source patch. You have to check out the Solr trunk from the Apache subversion server, apply the patch with the 'patch' command, and build the new Solr with 'ant'. On Tue, Mar 23, 2010 at 4:13 PM, blargy wrote: > > Thanks but Im not qu

Re: Impossible Boost Query?

2010-03-23 Thread Lance Norskog
Also, there is a 'random' type which generates random numbers. This might help you also. On Tue, Mar 23, 2010 at 7:18 PM, Lance Norskog wrote: > At this point (and for almost 3 years :) field collapsing is a source > patch. You have to check out the Solr trunk from the Apache subversion > server,

Re: HTTP Status 500 - null java.lang.IllegalArgumentException at java.nio.Buffer.limit(Buffer.java:249)

2010-03-23 Thread Lance Norskog
That area of the Lucene code throws NullPEs and ArrayIndex bugs, but they are all caused by corrupt indexes. They should be caught and wrapped. On Tue, Mar 23, 2010 at 4:33 PM, Chris Hostetter wrote: > > : I am doing a really simple query on my index (it's running in tomcat): > : > : http://host:

Re: dismax and q.op

2010-03-23 Thread Mark Fletcher
Hi Hoss, Thankyou so much for your time. Regarding the last one I myself got confused when I posed the question. I got it after your reply. I think I was actually looking for some thing like the debugQuery="on" option, which I found later. Best Regards, Mark. On Tue, Mar 23, 2010 at 6:56 PM, Ch

phrase segmentation plugin in component, analyzer, filter or parser?

2010-03-23 Thread Tommy Chheng
I'm writing an experimental phrase segmentation plugin for solr. My current plan is to write as a SearchComponent by overriding the queryString with the new grouped query. ex. (university of california irvine 2009) will be re-written to "university of calfornia irvine" "2009" Is the SearchC

Re: phrase segmentation plugin in component, analyzer, filter or parser?

2010-03-23 Thread Erik Hatcher
On Mar 24, 2010, at 1:35 AM, Tommy Chheng wrote: I'm writing an experimental phrase segmentation plugin for solr. My current plan is to write as a SearchComponent by overriding the queryString with the new grouped query. ex. (university of california irvine 2009) will be re-written to "univ