Re: Status of Spelt integration

2009-12-07 Thread freerk55
The standard spell checker of Thunderbird works in eGroupware. But not in Felamimail!!?? Why not? How can I get it working as it does in the rest of eGroupware? Freerk Jongsma Toby Cole-2 wrote: Hi Andrew, We ended up abandoning the spelt integration as the built in solr

Re: Status of Spelt integration

2009-12-07 Thread Toby Cole
I'm pretty sure this isn't a Solr related question. Have you tried asking on the eGroupware mailing lists? http://sourceforge.net/mail/?group_id=78745 Toby. On 7 Dec 2009, at 08:52, freerk55 wrote: The standard spell checker of Thunderbird works in eGroupware. But not in Felamimail!!?? Why

Re: Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-07 Thread Koji Sekiguchi
Robin Wojciki wrote: Koji, I was able to create a minimal replication. Attached zip has solr.xml, solrconf.xml and Main.java. I was able to replicate the issue by replacing the conf files in apache-solr-1.4.0/example/solr/conf and running the class Main. Could please confirm if this replication

Re: Question about the message Indexing failed. Rolled back all changes.

2009-12-07 Thread yountod
That was it! Thank you for the tip. To clarify for other beginners: Create a blank file called dataimport.properties in your conf directory and don't forget to make sure the system has write access to it. Lance Norskog-2 wrote: This is definitely a bug. Please open a JIRA issue for

Multiindexing

2009-12-07 Thread Jörg Agatz
Hi Users.. i need help with Multiindexing in Solr, i want one Core, and 3 to 5 diferent indizes. So i can search in simultan in all or in some of them. i find the Help im WIKI.. but it dosent Help. http://wiki.apache.org/solr/MultipleIndexes?highlight=%28multi%29 there stand nothing about

DIH Updating

2009-12-07 Thread Lee Smith
Hello All Sorry newbie Q. Im looking at using the Data Import Handler to add my data to solr. But I am a little confused how I go about updating the index. I understand there is no update index so just a delete replace but how will solr know what to remove and add ? Also hope someone

Solr Search in stemmed and non stemmed mode

2009-12-07 Thread khalid y
Hi !! I'm looking for a way to have two index in solr one stemmed and another non stemmed. Why ? It's simple :-) My users can do query for : - banking marketing = it return all document matches bank*** and market*** - banking marketing = it return all document matches banking and market***

Re: Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-07 Thread Robin Wojciki
Koji, In the sample I sent, the exception comes only if the HTMLStripCharFilter is there. However, your test case seems to capture the essence. Sorry if I sent you on a wild goose chase. Thanks for taking the time! I will log a ticket. Robin On Mon, Dec 7, 2009 at 5:09 PM, Koji Sekiguchi

RE: search on tomcat server

2009-12-07 Thread Jill Han
In fact, I just followed the instructions titled as Tomcat On Windows. Here are the updates on my computer 1. -Dsolr.solr.home=C:\solr\example 2. change dataDir to dataDirC:\solr\example\data/dataDir in solrconfig.xml at C:\solr\example\conf 3. created solr.xml at C:\Tomcat

Re: question about schemas

2009-12-07 Thread solr-user
Lance Norskog-2 wrote: You can make a separate facet field which contains a range of buckets: 10, 20, 50, or 100 means that the field has a value 0-10, 11-20, 21-50, or 51-100. You could use a separate filter query with values for these buckets. Filter queries are very fast in Solr 1.4 and

Re: DIH Updating

2009-12-07 Thread Erick Erickson
The short form is that you must provide and identify a unique key (analogous to a database PK). See http://wiki.apache.org/solr/UniqueKey http://wiki.apache.org/solr/UniqueKeyThere's an example here: http://wiki.apache.org/solr/DataImportHandler http://wiki.apache.org/solr/DataImportHandlerBut

Re: Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-07 Thread Robin Wojciki
Logged a ticket for Solr: https://issues.apache.org/jira/browse/SOLR-1630 Thanks, Robin On Mon, Dec 7, 2009 at 9:36 PM, Robin Wojciki robin.wojc...@gmail.com wrote: Koji, In the sample I sent, the exception comes only if the HTMLStripCharFilter is there. However, your test case seems to

Re: Solr Search in stemmed and non stemmed mode

2009-12-07 Thread Erick Erickson
Try searching the mail archive for stemmer exact match or similar, this has been discussed multiple times and you'll get more complete discussions wy faster One suggestion is to use two fields, one for the stemmed version and one for the original, then use whichever field you need to via

Re: Solr Search in stemmed and non stemmed mode

2009-12-07 Thread khalid y
Thanks, I'll read the mail archive. Your suggestion is like mine but whitout the DisMax handler. I'm going to read what is this handler. I have one field text and another text_unstemmed where I copy all others fields. I'm writing my custom query handler who check if quotes exists and switch

Re: question about schemas (and SOLR-1131?)

2009-12-07 Thread solr-user
wojtekpia wrote: Could this be solved with a multi-valued custom field type (including a custom comparator)? The OP's situation deals with multi-valuing products for each customer. If products contain strictly numeric fields then it seems like a custom field implementation (or extension of

RE: Embedded for write, HTTP for read - cache aging

2009-12-07 Thread Peter 4U
Hi Erik, Thanks for your answer. Yes, I've done an /update to the http server, which certainly works as far as the 'reading' goes. This sends the update to the back-end index though, which essentially defeats the purpose of having the embedded instance do the write (as writes are

Re: Multiple Solr Instances - Multiple Jetty Instances

2009-12-07 Thread Smiley, David W.
If you have many documents (say 10M documents, probably a larger threshold) then you will benefit from sharding your index, i.e. splitting your index up into multiple cores and using distributed searches. You could use one VM and multiple cores just fine, assuming you have multiple CPUs. If

RE: search on tomcat server

2009-12-07 Thread Sascha Szott
Hi Jill, just to make sure your index contains at least one document, what is the output of http://localhost:8080/solr/select?q=*:*debugQuery=trueechoParams=all Best, Sascha Jill Han wrote: In fact, I just followed the instructions titled as Tomcat On Windows. Here are the updates on my

Re: Multiple Solr Instances - Multiple Jetty Instances

2009-12-07 Thread Amit Nithian
David thanks for your response. With that having been said, is there a general ratio of the number of Tomcat/Jetty HTTP threads to allocate relative to the number of CPU cores you have on your machine? Is the default in Tomcat/Jetty acceptable? Thanks again Amit On Mon, Dec 7, 2009 at 10:00 AM,

Re: comparing index-time boost and sort in the case of a date field

2009-12-07 Thread Chris Hostetter
: : I have a requirement where I need to display records with more recent values : for approval_dt to come first when a query is made. I thought of approaching : this in 2 different ways:- ... : 2. INDEX-TIME boosting. : I sorted the query from databse itself in asc order of approval_dt

# in query

2009-12-07 Thread Joel Nylund
Hi, How can I put a # sign in a query, do I need to escape it? For example I want to query books with title that contain # No work so far: http://localhost:8983/solr/select?q=textTitle:#; http://localhost:8983/solr/select?q=textTitle:# http://localhost:8983/solr/select?q=textTitle:\#; Getting

RE: SolrPlugin Guidance

2009-12-07 Thread Chris Hostetter
: e.g. For the following query that looks for a file in a directory: : q=+directory_name:myDirectory +file_name:myFile : : We'd need to decompose the query into the following two queries: : 1. q=+directory_name:myDirectoryfl=directory_id : 2. q=+file_name:myFile +directory_id:(results from query

Re: # in query

2009-12-07 Thread Paul Libbrecht
Sure you have to escape it! %23 otherwise the browser considers it as a separator between the URL for the server (on the left) and the fragment identifier (on the right) which is not sent the server. You might want to read about URL-encoding, escaping with backslash is a shell-thing, not

Re: # in query

2009-12-07 Thread Joel Nylund
ok thanks, sorry my brain wasn't working, but even when I url encode it, I dont get any results, is there something special I have to do for solr? thanks Joel On Dec 7, 2009, at 3:20 PM, Paul Libbrecht wrote: Sure you have to escape it! %23 otherwise the browser considers it as a

Re: # in query

2009-12-07 Thread Erick Erickson
Well, the very first thing I would is examine the field definition in your schema file. I suspect that the tokenizers and/or filters you're using for indexing and/or querying is doing something to the # symbol. Most likely stripping it. If you're just searching for the single-letter term #, I

Re: Question regarding scoring/boosting

2009-12-07 Thread Chris Hostetter
Unfortunately understanding how Lucene/Scoring works isn't much of a beginer level topic -- the short answer to your question is that adding a function to the bf param of dismax causees that function to be evaluated for every doc that matches your main query and the scores are boosted in

Re: Solr plugin or something else for custom work?

2009-12-07 Thread Chris Hostetter
What you are describing corrisponds pretty closely to some work currently in progress to make the DataImportHandler integrate with the ExtractingRequestHandler/Tika ... https://issues.apache.org/jira/browse/SOLR-1358 ...in the meantime, your options are either to extract all the

Exception encountered during replication on slave....Any clues?

2009-12-07 Thread William Pierce
Folks: I am seeing this exception in my logs that is causing my replication to fail. I start with a clean slate (empty data directory). I index the data on the postingsmaster using the dataimport handler and it succeeds. When the replication slave attempts to replicate it encounters this

Re: Exception encountered during replication on slave....Any clues?

2009-12-07 Thread TCK
are you missing the port number in the master's url ? -tck On Mon, Dec 7, 2009 at 4:44 PM, William Pierce evalsi...@hotmail.comwrote: Folks: I am seeing this exception in my logs that is causing my replication to fail.I start with a clean slate (empty data directory). I index the

Oddly slow replication

2009-12-07 Thread Simon Wistow
I have a Master server with two Slaves populated via Solr 1.4 native replication. Slave1 syncs at a respectable speed i.e around 100MB/s but Slave2 runs much, much slower - the peak I've seen is 56KB/s. Both are running off the same hardware with the same config - compression is set to

Re: Exception encountered during replication on slave....Any clues?

2009-12-07 Thread William Pierce
tck, thanks for your quick response. I am running on the default port (8080). If I copy that exact string given in the masterUrl and execute it in the browser I get a response from solr: ?xml version=1.0 encoding=UTF-8 ? - response - lst name=responseHeader int name=status0/int int

Re: Response writer configs

2009-12-07 Thread Chris Hostetter
: I guess we should remove this commented response writers from the : example solrconfig. It adds no value. The comment tried to make it clear that it was showing what writers were enabled by default. But i changed it to be more in line with what we have for search components. -Hoss

Re: Exception encountered during replication on slave....Any clues?

2009-12-07 Thread William Pierce
Just to make doubly sure, per tck's suggestion, I went in and explicitly added in the port in the masterurl so that it now reads: http://localhost:8080/postingsmaster/replication Still getting the same exception... I am running solr 1.4, on Ubuntu karmic, using tomcat 6 and Java 1.6.

how to set CORE when using Apache Solr extension?

2009-12-07 Thread regany
Hello, Can anyone tell me how you set which Solr CORE to use when using the Apache Solr extension? (Using Solr with multicores) http://www.php.net/manual/en/book.solr.php thanks, regan -- View this message in context:

Re: Windows 7 / Java 64bit / solr 1.4 - solr.solr.home problem

2009-12-07 Thread Chris Hostetter
: I tried it on Vista 32 Java 1.6.0_17-b04 and it works without a problem. : Actually on all other computers in the office there is no problem - I am the : only one using Windows 7 : : I did try with full path and it didn't work as well. Here's the result: by the looks of it, your problem is

Re: Stopping Starting

2009-12-07 Thread regany
Lee Smith-6 wrote: So how can I stop and restart the service ? Hope you can help get me going again. Thank you Lee I found this shell script which works well for me... #!/bin/sh -e # Starts, stops, and restarts solr SOLR_DIR=/usr/local/solr/example JAVA_OPTIONS=-Xmx1024m

Re: edismax using bigrams instead of phrases?

2009-12-07 Thread Chris Hostetter
: I've started trying edismax, and have noticed that my relevancy ranking is : messed up with edismax because, according to the debug output, it's using : bigrams instead of phrases and inexplicably ignoring a couple of the pf I noticed that aw well while testing edismax on the train the

Re: edismax using bigrams instead of phrases?

2009-12-07 Thread Chris Hostetter
: I see that edismax already defines pf (bigrams) and pf3 (trigrams) -- how : would folks think about just calling them pf / pf1 (aliases for each : other?), pf2, and pf3? The pf would then behave exactly as it does in : dismax. changing edismax's current pasing logic to be applied to a pf2

Re: latency in solr response is observed after index is updated

2009-12-07 Thread Chris Hostetter
: We are observing latency (some times huge latency upto 10-20 secs) in solr : response after index is updated . whats the reason of this latency and how : can it be minimized ? : Note: our index size is pretty large. Please read the following wiki pages...

Re: Facet query with special characters

2009-12-07 Thread Chris Hostetter
: When performing a facet query where part of the value portion has a : special character (a minus sign in this case), the query returns zero : results unless I put a wildcard (*) at the end. check your analysis configuration for this fieldtype, in particular look at what debugQuery produces

Re: NullPointerException thrown during updates to index

2009-12-07 Thread Chris Hostetter
: Hi, : I'm running a distributed solr index (3 nodes) and have noticed frequent : exceptions thrown during updates. The exception (see below for full trace) what do you mean during updates ? ... QueryComponent isn't used at all when updating hte index, so there may be a missunderstanding here.

why no results?

2009-12-07 Thread regany
hi all - newbie solr question - I've indexed some documents and can search / receive results using the following schema - BUT ONLY when searching on the id field. If I try searching on the title, subtitle, body or text field I receive NO results. Very confused. :confused: Can anyone see anything

Re: why no results?

2009-12-07 Thread Tom Hill
Hi - That's a common one to get bit by. The string On Mon, Dec 7, 2009 at 7:44 PM, regany re...@newzealand.co.nz wrote: hi all - newbie solr question - I've indexed some documents and can search / receive results using the following schema - BUT ONLY when searching on the id field. If I

Re: why no results?

2009-12-07 Thread regany
Tom Hill-7 wrote: That's a common one to get bit by. The string You lost me Tom? I Think your message got cut off. I'm guessing something to do with the string type?? -- View this message in context: http://old.nabble.com/why-no-results--tp26688249p26688295.html Sent from the Solr -

Re: why no results?

2009-12-07 Thread Tom Hill
Sorry, just discovered a keyboard shortcut for send. :-) That's a common one to get bit by. The fieldtype StrField indexes the entire field as one item. So you can only find it if your search term is everything in the field. That is, fox will not find The Quick Brown Fox, because it's not the

Re: why no results?

2009-12-07 Thread regany
Tom Hill-7 wrote: Try solr.TextField instead. Thanks Tom, I've replaced the types section above with... types fieldtype name=string class=solr.TextField sortMissingLast=true omitNorms=true / /types deleted my index, restarted Solr and re-indexed my documents - but the search

Re: Exception encountered during replication on slave....Any clues?

2009-12-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
are you able to hit the http://localhost:8080/postingsmaster/replication using a browser from the slave box. if you are able to hit it what do you see? On Tue, Dec 8, 2009 at 3:42 AM, William Pierce evalsi...@hotmail.com wrote: Just to make doubly sure,  per tck's suggestion,  I went in and

Re: how to set CORE when using Apache Solr extension?

2009-12-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
the core is a part of the uri http://host:port/solr-app/core-name/select say if the core name is core1 and solr app name is solr deployed at port 8983 then it would look like http://host:8983/solr/core1/select On Tue, Dec 8, 2009 at 3:44 AM, regany re...@newzealand.co.nz wrote: Hello, Can

Re: Oddly slow replication

2009-12-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
this has to be a network problem . We have never encountered such vastly different speeds in the same LAN. On Tue, Dec 8, 2009 at 3:22 AM, Simon Wistow si...@thegestalt.org wrote: I have a Master server with two Slaves populated via Solr 1.4 native replication. Slave1 syncs at a respectable

Replicating multiple cores

2009-12-07 Thread Jason Rutherglen
If I've got multiple cores on a server, I guess I need multiple rsyncd's running (if using the shell scripts)?

Re: Replicating multiple cores

2009-12-07 Thread Shalin Shekhar Mangar
On Tue, Dec 8, 2009 at 11:48 AM, Jason Rutherglen jason.rutherg...@gmail.com wrote: If I've got multiple cores on a server, I guess I need multiple rsyncd's running (if using the shell scripts)? Yes. I'd highly recommend using the Java replication though. -- Regards, Shalin Shekhar Mangar.