Re: Plans for a new Solr Python library

2008-03-24 Thread Christian Vogler
On Monday 24 March 2008 01:01:59 Leonardo Santagada wrote: I have done some modifications on the solr python client[1], and though we kept the same license and my work could be put back in solr I think if there are more people interested we could improve the module a lot. Have you taken a

Re: Plans for a new Solr Python library

2008-03-24 Thread Leonardo Santagada
On 24/03/2008, at 04:39, Christian Vogler wrote: On Monday 24 March 2008 01:01:59 Leonardo Santagada wrote: I have done some modifications on the solr python client[1], and though we kept the same license and my work could be put back in solr I think if there are more people interested we

Re: Help Requested

2008-03-24 Thread Norberto Meijome
On Thu, 20 Mar 2008 09:07:08 -0700 (PDT) Raghav Kapoor [EMAIL PROTECTED] wrote: [...] Any particular reason why need the server in this situation? pretty much everything you are doing can be done locally. Except, probably, cross linking between client's documents. I have no idea in

Re: Plans for a new Solr Python library

2008-03-24 Thread Leonardo Santagada
On 24/03/2008, at 09:11, Ed Summers wrote: On Mon, Mar 24, 2008 at 6:32 AM, Leonardo Santagada [EMAIL PROTECTED] wrote: Thanks, I think most of the stuff that I wanted to do is there... I will take a closer look and if there is omething missing I will add to that. Why is this on the issue

Re: off list: Plans for a new Solr Python library

2008-03-24 Thread Leonardo Santagada
On 24/03/2008, at 10:15, Christian Vogler wrote: Ok, so, let's get started. I made a few modifications to SOLR-216 that fix some unicode and timezone conversion issues, and I can upload them wherever we want to host the project. There is at an outstanding XML unicode bug that was discussed

Re: Plans for a new Solr Python library

2008-03-24 Thread Ed Summers
On Mon, Mar 24, 2008 at 12:13 PM, Yonik Seeley [EMAIL PROTECTED] wrote: AFAIK, no one has fixed the outstanding bugs and indicated it was ready to be committed. What is the preferred approach to fixing bugs in patches? Attaching new patches? //Ed

Re: Plans for a new Solr Python library

2008-03-24 Thread Yonik Seeley
On Mon, Mar 24, 2008 at 12:27 PM, Ed Summers [EMAIL PROTECTED] wrote: On Mon, Mar 24, 2008 at 12:13 PM, Yonik Seeley [EMAIL PROTECTED] wrote: AFAIK, no one has fixed the outstanding bugs and indicated it was ready to be committed. What is the preferred approach to fixing bugs in

missing content stream - simple tab file

2008-03-24 Thread tim robertson
Hi all, I am a newbie with SOLR, trying to index a very simple tab delimitted file (using a nightly build from a couple days ago). Any help would be greatly appreciated! My test tab file has only 3 lines: Passer domesticus 1787248 Passer domesticus (Linnaeus, 1758) 694 Passer domesticus

missing content stream - simple tab file

2008-03-24 Thread tim robertson
Hi all, I am a newbie with SOLR, trying to index a very simple tab delimitted file (using a nightly build from a couple days ago). Any help would be greatly appreciated! My test tab file has only 3 lines: Passer domesticus 1787248 Passer domesticus (Linnaeus, 1758) 694 Passer domesticus

Fwd: missing content stream - simple tab file

2008-03-24 Thread tim robertson
Ah, for some reason I am not receiving SOLR-user messages even though I am subscribed. If anyone has any ideas, can you please copy me in on the reply? Thanks -- Forwarded message -- From: tim robertson [EMAIL PROTECTED] Date: Mon, Mar 24, 2008 at 8:53 PM Subject: missing content

CJKTokenizer in Solr 1.3?

2008-03-24 Thread Vinci
Hi, I would like to ask, does any support of CJKTokenizer (org.apache.lucene.analysis.cjk.CJKTokenizer) available for Solr 1.3 now? If it is supported, which nightly build I can try and how can I turn it on? (I have nightly build up to 2008 Mar 8 on hand) If it is not supported, how can I use

Re: missing content stream - simple tab file

2008-03-24 Thread Chris Hostetter
Tim: double check that solr-user mail isn't showing up in your spam folder, you may need to whitelist it since it identifies itself as bulk mail. : And I am uploading using this command: : curl :

Re: missing content stream - simple tab file

2008-03-24 Thread tim robertson
Thanks, You are correct... ' ' around the URL solved it - schoolboy error thanks Tim On Mon, Mar 24, 2008 at 9:48 PM, Chris Hostetter [EMAIL PROTECTED] wrote: Tim: double check that solr-user mail isn't showing up in your spam folder, you may need to whitelist it since it identifies

Re: Converting lucene index into solr usable xml

2008-03-24 Thread Chris Hostetter
: How can we convert the lucene index file into format : that solr can understand. I have very little knowledge This seems to be the exact same question... http://www.nabble.com/Reusing-lucene-index-file-in-Solr-to16215877.html -Hoss

What are the limits? Billions of records anyone?

2008-03-24 Thread tim robertson
Hi all, I have just got a SOLR index working for the first time on a few 100,000 records from a custom database dump, and the results are very impressive, both in the speed it indexes (even on my macbook) and the response times. If I want to index what, where(grid based to 0.1 degree cells),

Re: Survey: How do you store your fields?

2008-03-24 Thread Chris Hostetter
: I'm curious: do you store everything in a database and just use Solr : for indexing/searching, or do you store everything in Solr so that : your search results come back with context? Or something in between? : (I know if you want highlighting you have to store those fields.) There are really

Re: Plans for a new Solr Python library

2008-03-24 Thread Leonardo Santagada
On 24/03/2008, at 15:34, Yonik Seeley wrote: On Mon, Mar 24, 2008 at 12:27 PM, Ed Summers [EMAIL PROTECTED] wrote: On Mon, Mar 24, 2008 at 12:13 PM, Yonik Seeley [EMAIL PROTECTED] wrote: AFAIK, no one has fixed the outstanding bugs and indicated it was ready to be committed. What is the

Re: What are the limits? Billions of records anyone?

2008-03-24 Thread Yonik Seeley
On Mon, Mar 24, 2008 at 5:30 PM, tim robertson [EMAIL PROTECTED] wrote: Is there any documentation on whether indexes can be partitioned easily, so scaling is somewhat linear? http://wiki.apache.org/solr/DistributedSearch It's very new, so you would need a recent nightly build. If you try

Re: Nightly build archives don't contain solrJ source?

2008-03-24 Thread Chris Hostetter
: : I download the zip for nightly Solr Source files for 03/18/2008. This doesnt : seem to contain : the corresponding client folder with the SolrJ files. Is there a way to get you are correct ... at teh moment none of the clients have their sources included in the builds ... i'll ope na bug so

Query Time Boosting

2008-03-24 Thread Amitha Talasila
Hi All, We have a requirement for our Project, there is a date field called start date in the schema. When a query is done Products whose start date is within 10 days from today's date need to be boosted by 100 points, and start date within 30 days should be boosted by 80 points and so.

Re: introduction and help!

2008-03-24 Thread Vinci
Hi David, Start Solr from jetty first, then check the following: 1. your war file is placed outside of the normal webapps 2. your content file, solr.xml is written properly. In the case you are not sure, use absolute path 3. copy and paste the trace after SEVERE, especially the last line. If you

Re: What are the limits? Billions of records anyone?

2008-03-24 Thread Vinci
Hi, 100,000 is not a big number in IR world. Lucene actually played some trick so people can do very big IR system with Lucene-based system. I have read blog post that people have much larger search document baseThe only concern you need to think of is to provide bigger heap size to JRE to