Re: Updating documents in index with some fields not stored

2006-05-11 Thread Andrzej Bialecki
Chun Wei Ho wrote: (2) I understand Luke is able to reconstruct the field so that the document can be re-inserted. Can someone give me a hint on how its done and if its potentially too time consuming for a large index (up to million docs and too many terms to count). Luke simply iterates over a

Re: Updating documents in index with some fields not stored

2006-05-11 Thread Andrzej Bialecki
Chun Wei Ho wrote: (2) I understand Luke is able to reconstruct the field so that the document can be re-inserted. Can someone give me a hint on how its done and if its potentially too time consuming for a large index (up to million docs and too many terms to count). Ah, I forgot to mention: wh

Redistribution Licensing

2006-05-11 Thread Gwyn Carwardine
Hi all, We're using dotLucene (vanilla) in our commercial product and we're redistributing dotLucene with our product. What do we need to do in terms of licensing - what notices do we need to include and where? Any help much appreciated Regards, Gwyn

Re: Redistribution Licensing

2006-05-11 Thread Erik Hatcher
Gwyn, You're asking in the wrong forum. dotLucene is not an Apache hosted project. Lucene.Net (which is no longer based on anything but Java Lucene) is now in incubation here, but is not dotLucene. You need to check with the license of the version of dotLucene you're using. Er

RE: Searching across spaces

2006-05-11 Thread Eric Isakson
You might consider using overlapping bi-gram tokenization with stripped out whitespace and a PhraseQuery. So your tokenized content, "spongebob squarepants", would look like: sp po on ng ge eb bo ob bs sq qu ua ar re ep pa an nt ts and your tokens for your query, "sponge bob", would look like

Re: Searching across spaces

2006-05-11 Thread Maxym Mykhalchuk
Eric, IMHO the number of side-effects can be reduced by requiring "phrases": tokens for your query, "sponge bob", would look like "sp po on ng ge" eb "bo ob" Maxym == Maxym Mykhalchuk (+39) 320 8593170 PhD student at University of Trento, ITALY ==

RE: Redistribution Licensing

2006-05-11 Thread Gwyn Carwardine
Hi Erik. I'm sorry if I posted in the wrong place. I get very confused! However dotLucene and the Highlighter are both distributed under the apache version 2.0 licence (that's what it says in the licence.txt file that they come with) So if anyone on here understands the requirement of this Apache

Re: Redistribution Licensing

2006-05-11 Thread Erik Hatcher
IANAL, so I'll only refer you to the official ASF website on the license: particularly this one: On May 11, 2006, at 10:07 AM, Gwyn Carwardine wrote: Hi Er

Re: Searching across spaces

2006-05-11 Thread Rob Young
That sounds like just what I'm looking for. Do you know if this is covered in Lucene in Action or where I can find more information about it. Eric Isakson wrote: You might consider using overlapping bi-gram tokenization with stripped out whitespace and a PhraseQuery. So your tokenized conten

RE: Searching across spaces

2006-05-11 Thread Eric Isakson
I think you will have to write a custom analyzer and tokenizer to produce the tokens you need and you will have to arrange for whatever code you are using to create your query to use that analyzer in the correct circumstances. I don't think I've seen anyone post about this particular use case be

Re: Searching across spaces

2006-05-11 Thread Otis Gospodnetic
Rob, look at the third hit: http://www.lucenebook.com/search?query=bi-grams Otis - Original Message From: Rob Young <[EMAIL PROTECTED]> > That sounds like just what I'm looking for. Do you know if this is > covered in Lucene in Action or where I can find more information about it. E

RE: wildcards in phrase searches

2006-05-11 Thread Lee_Gary
Thanks Erik, I was able to get a solution working for me using the classes you outlined below. -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, May 10, 2006 12:11 PM To: java-user@lucene.apache.org Subject: Re: wildcards in phrase searches On May 10, 20

can't delete

2006-05-11 Thread John Powers
If I search on id:test bbb in luke I find some items. If I do a ir.delete(new Term(IDENTITY, id)) I get records changed is zero..and those records aren't deleted I do a print of new Term(IDENTITY, id) and it comes out with id:test bbb What am I missing?

Can lucene do this?

2006-05-11 Thread Scott Smith
I'm building an application which has to provide "real-time" searching of emails as they come in. I have a number of search strings that I need to apply against each email as it comes in and then do something with the email based on which search string(s) get a hit. My initial thought was to

Re: Can lucene do this?

2006-05-11 Thread Erik Hatcher
Scott, Have a look at the MemoryIndex (in contrib/memory) - it is perfect for this sort of thing. You'd index each e-mail individually into a MemoryIndex and then run all the queries against it getting a hit or not. Erik On May 11, 2006, at 9:28 PM, Scott Smith wrote: I'm bui

Re: Can lucene do this?

2006-05-11 Thread Chris Hostetter
: Have a look at the MemoryIndex (in contrib/memory) - it is perfect : for this sort of thing. You'd index each e-mail individually into a : MemoryIndex and then run all the queries against it getting a hit or alternately, if the "queries" you need to test each email against are fairly simple (i