C++ type of analysis issues

2007-11-22 Thread Yu-Hui Jin
Hi, there, I haven't found any existing filter/tokenizer that can deal with "C++" type of search keywords. I'm using WordDelimiterFilter which removes the "++". One way I am thinking of right now is to use synonym filter before the WordDelimiterFilter to replace "c++" (after low-cased it) with s

Re: uniqueKey type

2007-11-06 Thread Yu-Hui Jin
Got it. Thanks, Mike. That explains. regards, -Hui On 11/6/07, Mike Klaas <[EMAIL PROTECTED]> wrote: > > On 6-Nov-07, at 2:31 PM, Yu-Hui Jin wrote: > > > We used a custom fieldType that segments the values of a field A into > > tokens. We then define uniqueKey

uniqueKey type

2007-11-06 Thread Yu-Hui Jin
We used a custom fieldType that segments the values of a field A into tokens. We then define uniqueKey as that field A. It seems when we update a document by solr, documents that have different values (one contains the other as a substring) gets collided into one so the result is the new documen

Re: customer request handler doesn't envok the query tokenization chain

2007-11-05 Thread Yu-Hui Jin
1:abc and field2:xyz , or it connects the tokens from "abc" and "xyz" for their respective field? thanks, -Hui On 11/4/07, Yonik Seeley <[EMAIL PROTECTED]> wrote: > > On 11/4/07, Yu-Hui Jin <[EMAIL PROTECTED]> wrote: > > Let's say we def

customer request handler doesn't envok the query tokenization chain

2007-11-04 Thread Yu-Hui Jin
hi, there, Here's a question regarding the relationship of a customer request handler and a customer field type. Let's say we defined a customer filed type that when querying and indexing, the solr.LowerCaseFilterFactory is used as the last filter to low-case all letters. In the Analysis UI, we

Re: get all query not working

2007-10-28 Thread Yu-Hui Jin
Please ignore this message. It's working now. Just found tomcat hasn't updated my webapps directory correctly; it still has the old 1.1. jars under the WEB-INF/lib.I deleted the webapp directory and restarted tomcat. It then worked. thanks, -Hui On 10/28/07, Yu-Hui J

get all query not working

2007-10-28 Thread Yu-Hui Jin
Hi, there, Is there an easy to find out which version of solr is running. I installed solr 1.2 and set up an instance using Tomcat. It was successful before. Today I set up on a different host. Somehow running the query "*:*" doesn't get me the correct results. Instead it showed parser error. (

Re: schema for response

2007-10-02 Thread Yu-Hui Jin
Got it. Thanks, Ryan. -Hui On 10/2/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: > > Yu-Hui Jin wrote: > > Hi, there, > > > > Given that there's some questions on the updated XML schema for the > response > > in Solr 1.2. Can someone points me to th

schema for response

2007-10-02 Thread Yu-Hui Jin
Hi, there, Given that there's some questions on the updated XML schema for the response in Solr 1.2. Can someone points me to the XML schema? Is it documented somewhere? I'm particularly interested in the different status code we would have in the response for either update or select. -- Rega

Re: rsync start and enable for multiple solr instances within one tomcat

2007-09-20 Thread Yu-Hui Jin
Bill, Thanks for the explanation. That helps my understanding on rsync and the replication in general. regards, -Hui On 9/20/07, Bill Au <[EMAIL PROTECTED]> wrote: > > The "solr" that you are referring to in your third question in the > name of the rsync area which is map to the solr data dire

Re: a bug in commit script?

2007-09-20 Thread Yu-Hui Jin
Got it. So what's the easiest way to get this patch? Sorry i'm new to this. regards, -Hui On 9/20/07, Bill Au <[EMAIL PROTECTED]> wrote: > > That would be my bad. I noticed the problem while fixing SOLR-282 > which is not related. I fixed both problems in stead of opening a > different bug fo

Re: rsync start and enable for multiple solr instances within one tomcat

2007-09-20 Thread Yu-Hui Jin
ok. Hoss. I think I'll believe you since nobody raised any issue running the script. And I'm about to try it out shortly with different solr home names. So just to help my knowledge, where does this virtual setting of this "solr" string happen? Should it be in some config file or sth? thanks,

Re: rsync start and enable for multiple solr instances within one tomcat

2007-09-20 Thread Yu-Hui Jin
Thanks, Hoss. For the last question, yes I understand now it's referring to whatever solr home we have named. However, there's still the last part of my question that feels suspicious why the "solr" string is directly coded in the script (unlike other cases they usually use ${solr_root} to get to

Re: rsync start and enable for multiple solr instances within one tomcat

2007-09-20 Thread Yu-Hui Jin
stand.) Can someone confirm my understanding? Does the #3 question suggests a hard-coded "solr" that shouldn't be? Thanks, -Hui On 9/19/07, Yu-Hui Jin <[EMAIL PROTECTED]> wrote: > > Hi, there, > > So we are using the Tomcat's JNDI method to set up multiple so

a bug in commit script?

2007-09-20 Thread Yu-Hui Jin
Hi, guys, It seems there's a small bug in the bin/commit script for solr 1.2. I was able to run snapinstaller successfully to install the index and open a new searcher. (This is verified by querying the new docs through the web admin UI.) However, the snapinstaller script failed due to the commit

Re: setting absolute path for snapshooter in solrconfig.xml doesn't work

2007-09-19 Thread Yu-Hui Jin
, if you're not setting environment variables either, > remove the following line as well: > > MYVAR=val1 > > Once you alter / remove those two lines, snapshooter should function as > expected. > > cheers, > Piete > > > > On 20/09/2007, Yu-Hui Jin <[

Re: setting absolute path for snapshooter in solrconfig.xml doesn't work

2007-09-19 Thread Yu-Hui Jin
r/bin/snapshooter > . > > In order to get the snapshooter working correctly. > > cheers, > Piete > > > > On 20/09/2007, Yu-Hui Jin <[EMAIL PROTECTED]> wrote: > > > > Hi, there, > > > > I used an absolute path for the "dir" par

setting absolute path for snapshooter in solrconfig.xml doesn't work

2007-09-19 Thread Yu-Hui Jin
Hi, there, I used an absolute path for the "dir" param in the solrconfig.xml as below: snapshooter /var/SolrHome/solr/bin true arg1 arg2 MYVAR=val1 However, I got "snapshooter: not found" exception thrown in catalina.out. I don't see why this doesn't

rsync start and enable for multiple solr instances within one tomcat

2007-09-19 Thread Yu-Hui Jin
Hi, there, So we are using the Tomcat's JNDI method to set up multiple solr instances within a tomcat server. Each instance has a solr home directory. Now we want to set up collection distribution for all these solr home indexes. My understanding is: 1. we only need to run rsync-start once use

Re: multiple solr home directories

2007-08-30 Thread Yu-Hui Jin
Thanks, Hoss, >> you still use a separate lib directory for each solr home and symlink each jar ... Just to make sure. you mean we can create a directory containing the shared jars, and each solr home/lib will symlink to the jar files in that directory. Right? Thanks, -Hui On 8/30/07, Chri

multiple solr home directories

2007-08-28 Thread Yu-Hui Jin
Hi, there, I have a few basic questions on setting up Solr home directories. * can we set up multiple Solr home directories within the same Solr instance? (I want to use the same Tomcat Solr instance to support indexing and searching over multiple independent indexes.) * If so, say I have some

Re: How to realize index spaces

2007-08-25 Thread Yu-Hui Jin
I'm interested in the answer as well. Here's what I inferred from the discussion (without probing into solr's source code): 1. there can be only one schema.xml in a "Solr Home" directory; 2. one Solr instance can only have one "Solr Home" directory. Is that the correct understanding? I really

Re: Embedded about 50% faster for indexing

2007-08-24 Thread Yu-Hui Jin
Sorry i'm new to the topic, can you point me to the Embedded approach? thanks, -Hui On 8/24/07, Sundling, Paul <[EMAIL PROTECTED]> wrote: > > Created two indexer implementations to test HTTP Post versus Embedded > and the performance was 54.6% faster on embedded. > > Thought others might find t

Re: how to retrieve all the documents in an index?

2007-08-19 Thread Yu-Hui Jin
BTW, Hoss, is there a default order for the documents returned by running this query? thanks, -Hui On 8/16/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > > : Any of you know whether the new "q:*.*" query performs better than the > : get-around solutions like using a ranged query? I would

Re: Solr 1.1. vs. 1.2.

2007-08-17 Thread Yu-Hui Jin
Thanks, Hoss! I would recommend go with 1.2. regards, -Hui On 8/17/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > > : I wonder what are the production experience of Solr 1.2 vs. 1.1. We are > : thinking of using 1.2 as opposed to 1.1 to support a mission critical > : application, but am no

Solr 1.1. vs. 1.2.

2007-08-17 Thread Yu-Hui Jin
Hi, there, I wonder what are the production experience of Solr 1.2 vs. 1.1. We are thinking of using 1.2 as opposed to 1.1 to support a mission critical application, but am not sure whether 1.2. is stable enough ( afraid of things like more features may introduce new bugs). However, 1.1. librar

Re: synchronizing slave indexes in distributing collections

2007-08-17 Thread Yu-Hui Jin
Thanks, guys. Glad to know the scripts work very well in your experience. (well, indeed they are quite simple.) So that's how I imagine we should do it except that you guys added a very good point -- that the monitoring system can invoke a script to take the slave out of the load balancer. I'd li

Re: how to retrieve all the documents in an index?

2007-08-17 Thread Yu-Hui Jin
Got it. Thanks Hoss! regards, -Hui On 8/16/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > > : Any of you know whether the new "q:*.*" query performs better than the > : get-around solutions like using a ranged query? I would guess so, but I > : haven't looked into the Lucene implementation.

synchronizing slave indexes in distributing collections

2007-08-16 Thread Yu-Hui Jin
Hi, there, We want to use Solr's Collection Distribution. Here's the question regarding recovery of failures of the scripts. To my understanding: * if the snapuller fails on a slave, we can possibly implement something like the master would examine the status messages from all slaves and notify

Re: how to retrieve all the documents in an index?

2007-08-15 Thread Yu-Hui Jin
Great. Thanks, guys. That's how what I thought. Any of you know whether the new "q:*.*" query performs better than the get-around solutions like using a ranged query? I would guess so, but I haven't looked into the Lucene implementation. regards, -Hui On 8/15/07, Yonik Seeley <[EMAIL PROTECTED

Re: how to retrieve all the documents in an index?

2007-08-13 Thread Yu-Hui Jin
y using q=*:* to match all documents in the index. > > Piete > > > > On 13/08/07, Yu-Hui Jin <[EMAIL PROTECTED]> wrote: > > > > Hi, there, > > > > I found the following post on the web. Is this still the simplest > > get-around > > to retriev

how to retrieve all the documents in an index?

2007-08-12 Thread Yu-Hui Jin
Hi, there, I found the following post on the web. Is this still the simplest get-around to retrieve all documents in an index? (I'm asking just in case I don't know there's a more standard way to do that now.) thanks, -Hui >From "Fuad Efendi" < [EMAIL PROTECTED]> Subject RE: MatchAl

Re: retrieving range of fields for the results

2007-08-08 Thread Yu-Hui Jin
Sounds good. Many thanks, guys! regards, -Hui On 8/8/07, Pieter Berkel <[EMAIL PROTECTED]> wrote: > > On 09/08/07, Mike Klaas <[EMAIL PROTECTED]> wrote: > > > > Faceting ignores pagenation/startat/maxresults/etc. > > > > > This is correct, the facet information returned is based on the entire >

Re: retrieving range of fields for the results

2007-08-08 Thread Yu-Hui Jin
no option to simply return the > minimum > / maximum facet values but it might be and idea for a future enhancement. > > cheers, > Piete > > > On 08/08/2007, Yu-Hui Jin <[EMAIL PROTECTED]> wrote: > > > > Piete, > > > > Thanks for the pointers an

Re: retrieving range of fields for the results

2007-08-07 Thread Yu-Hui Jin
elds such as publish_date, take a look at > https://issues.apache.org/jira/browse/SOLR-258 which was recently commited > to the svn trunk (although not much documentation on that yet). > > Just a note about your query, specifying the sort order in the q parameter > is depricated sy

retrieving range of fields for the results

2007-08-07 Thread Yu-Hui Jin
Hi, there, We used Solr to index a set of documents and provide free-text search functionality with sorting options on a number of fields. We have a requirement that along with each search result we want to obtain the ranges of a few fields for the resulting documents. Here's an example: Let's sa

Re: Admin page went down

2006-11-03 Thread Yu-Hui Jin
cool, I actually hope it's just my boxes. let me know. -Hui On 11/3/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: : I hope someone could give it a try to reproduce what I have seen. so that we : can proove it's not a problem for all. FYI: I've had the "java -jar start.jar" Jetty instance

Re: Admin page went down

2006-11-01 Thread Yu-Hui Jin
Thanks, Hoss. I actually know Jetty is quite a solid container. However, the admin page problem is simply wired. I hope someone could give it a try to reproduce what I have seen. so that we can proove it's not a problem for all. regards, -Hui On 11/1/06, Chris Hostetter <[EMAIL PROTECTED]> w

Re: Admin page went down

2006-11-01 Thread Yu-Hui Jin
No, there's no logs printed out at all. I have just tried it again. Posting queries through the URL still works. Have any of you tried keeping jetty running with the example solr on your box for a few days? If not, maybe you can try it. This is really wired to me. Because it happens to two of

Re: replication test problems

2006-11-01 Thread Yu-Hui Jin
Yap, Bill. The backslash-escaping one works for my zsh as well. And I'm sure you checked it's working for other major shells. So I would say backslash seems to be a good solution since you don't have to worry about double-single quotes. Thanks! regards, -Hui On 11/1/06, Bill Au <[EMAIL P

Re: replication test problems

2006-10-31 Thread Yu-Hui Jin
I found that if I add single quotes to the pattern as below, it worked on the command line of my zsh env, so I'll try add the quotes in the snappuller script and test it again. find /home/yjin/apps/solr-nightly/example/solr/data/ -name 'snapshot.*' -print /home/yjin/apps/solr-nightly/example/solr

Re: Admin page went down

2006-10-31 Thread Yu-Hui Jin
Hi, Hoss, Sorry my bad, I have two boxes, one working as master the other slave. Both have this admin down problems. I tried to send a couple queries just now to the master box and they worked. But the logs I sent you is from the slave box's screen. (I have restarted the master after I found i

Re: Admin page went down

2006-10-31 Thread Yu-Hui Jin
Hi Hoss, Here's all the logs printed on the screen after I started jetty solr server. I don't see anything suspicious. Is there any other logs that I should check? Thanks! integ-node-03456.sea3% java -jar start.jar java.net.BindException: Address already in use WARN: Not listening on monitor po

Admin page went down

2006-10-31 Thread Yu-Hui Jin
Hi, Solr folks, I kept the solr jetty server running on my box for a couple of days. Today I found I can no longer access the admin page. It gives the following error page: HTTP ERROR: 404 Not Found RequestURI=/solr/admin *Powered by Jetty:// * ---

Re: replication test problems

2006-10-30 Thread Yu-Hui Jin
Hi, Hoss, Thanks for the reply! For #2, I think I just need to setup the passwordless SSH with empty passphase. right? For #1: I'm using the following Enterprise version: Linux version 2.4.21-37a6 (gcc version 3.2.3 20030502 (Red Hat Linux 3.2.3-47)) I tried to run the find command find /hom

replication test problems

2006-10-30 Thread Yu-Hui Jin
Hi, solr folks, I followed the tutorial and everything worked well. I then read the collection and distribtution twiki: http://wiki.apache.org/solr/CollectionDistribution and tried to test replication using the scripts. What I did are the following: - downloaded solr-nightly binary to two diffe