Re: [POLL] - A new logo for Solr

2008-08-01 Thread Erik Hatcher
On Aug 1, 2008, at 8:41 PM, Nick Jenkin wrote: Is there an option to keep the current one? Good point. Here's my Solr Logo design: TO SOLR LOOP [ FORAGE ]

Re: Multicore DataDir

2008-08-01 Thread Ryan McKinley
Check: https://issues.apache.org/jira/browse/SOLR-646 hopefully that will solve your problems... On Aug 1, 2008, at 4:35 PM, CameronL wrote: The dataDir parameter specified in the element in multicore.xml does not seem to point to the correct directory. I commented out the element from

Re: fastest way to load documents

2008-08-01 Thread Otis Gospodnetic
Increase maxBufferedDocs and don't go above 50 with mergeFactor. What exactly the best numbers are depends on things I can't see from here. -Xmx5000m is good if you care for fast indexing, but not the best choice when searching. How fast you can index also depends on the size of your document

Re: [POLL] - A new logo for Solr

2008-08-01 Thread Nick Jenkin
Is there an option to keep the current one? -Nick On 8/2/08, Shalin Shekhar Mangar <[EMAIL PROTECTED]> wrote: > On Fri, Aug 1, 2008 at 8:39 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > > > On Fri, Aug 1, 2008 at 10:44 AM, Shalin Shekhar Mangar > > <[EMAIL PROTECTED]> wrote: > > > The design wit

Re: Dynamic Boosting

2008-08-01 Thread Erik Hatcher
Are you using the dismax parser? (I try not to make assumptions that folks have /select mapped just like the example - which is mapped to Lucene/Solr query parser syntax) Add &defType=dismax And don't forget about good ol' &debugQuery=true ;) Erik On Aug 1, 2008, at 6:06 PM,

Multicore DataDir

2008-08-01 Thread CameronL
The dataDir parameter specified in the element in multicore.xml does not seem to point to the correct directory. I commented out the element from solrconfig.xml in all of my cores. I could explicitly set the for each core in all of the solrconfig.xml files, but I would rather only have to dea

Re: Dynamic Boosting

2008-08-01 Thread CameronL
Thanks Erik. That's a great idea, and it seems like this is exactly what the bq parameter was made for. Now the problem I seem to be having is that Solr seems to be ignoring the bq parameter altogether. My query looks something like: ...select?q=name:something&bq=Tier:1^100 Tier:2^75 Tier:3^50 E

Re: fastest way to load documents

2008-08-01 Thread Ian Connor
I am on fedora and just running with jetty (I guess that means it will not just use as much RAM as I have and I need to specify it when I load java). So, if I have 8GB RAM are you suggesting that I set the -Xmx 5000M or something large and then set merge to: 1 should I also increase any of t

Re: fastest way to load documents

2008-08-01 Thread Otis Gospodnetic
Configure Solr to use as much RAM as you can afford and not merge too often via mergeFactor. It's not clear (to me) from your explanation when you see 3000 docs/second and when only 100 docs/second. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message

fastest way to load documents

2008-08-01 Thread Ian Connor
I have a number of documents in files 1.xml 2.xml ... 17M.xml I have been using cat to join them all together: cat 1.xml 2.xml ... 1000.xml | grep -v '<\/add>' > /tmp/post.xml and posting them with curl: curl -d @/tmp/post.xml 'http://localhost:8983/solr/update' -H 'Content-Type: tex

Re: High load when updating many cores

2008-08-01 Thread Otis Gospodnetic
Doug, (old email) My initial guesses would be: * GC * Lucene segment merging happening on multiple indices during the same time (check IO before/after/during high load) * too frequent auto-commit (though this sounds least likely, as if this were the cause you'd see high load all the time) Maybe

Re: Multiple Update servers

2008-08-01 Thread Otis Gospodnetic
SAN (or NAS if you can live with its performance) is one way. Oh course, the SAN would have to have redundant hw, too - no use in having multiple and redundant disks if you only have 1 controller, for example. Another options (that I have not tried) is DRBD. Whatever the case, you need to hav

Re: Individual query limits for each search field value

2008-08-01 Thread Otis Gospodnetic
Hi, The fact that you need the cheapest price (personal nitpick: prices are not cheap - they can be low, which makes items cheap) I think you'd want to sort by the price field. As for showing N items for each city, you'd either want to grab a lot of docs in 1 query and then run another query wi

RE: Solr* != solr*

2008-08-01 Thread George Aroush
Hi Erik and all, I'm still trying to solve this issue and I like to know how others might have solved it in their client. I can't modify Solr / Lucene code and I'm using Solr 1.2. What I have done is simple. Given a user input, I break it into words and then analyze each word. Any word contain

Re: [POLL] - A new logo for Solr

2008-08-01 Thread Shalin Shekhar Mangar
On Fri, Aug 1, 2008 at 8:39 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On Fri, Aug 1, 2008 at 10:44 AM, Shalin Shekhar Mangar > <[EMAIL PROTECTED]> wrote: > > The design with the most number of total (first+second place) votes will > be > > accepted as the community's choice > > Thanks for sett

Re: [POLL] - A new logo for Solr

2008-08-01 Thread Yonik Seeley
On Fri, Aug 1, 2008 at 10:44 AM, Shalin Shekhar Mangar <[EMAIL PROTECTED]> wrote: > The design with the most number of total (first+second place) votes will be > accepted as the community's choice Thanks for setting this up Shalin. I'm not sure first+second votes should simply be added though (som

[POLL] - A new logo for Solr

2008-08-01 Thread Shalin Shekhar Mangar
Hello, As per discussions in http://www.nabble.com/Vote-on-a-new-solr-logo-td18560742.html three designs from the last poll have been shortlisted for another run-off, one each from the three font families (straight, normal and curvy). Please cast your vote at http://people.apache.org/~shalin/poll

Re: Vote on a new solr logo

2008-08-01 Thread Mark Miller
Lance Norskog wrote: One of Parkinson's Laws is that the most trivial item on the agenda receives the most attention. Don't doubt the law, don't doubt the logo is trivial, certainly doubt its receiving more attention than the code.