Integrating Lucene with Cayenne

2006-04-18 Thread Andrus Adamchik
Hi, I've been thinking for some time about integrating Cayenne ORM engine with Lucene to provide database full text search capability that is db-agnostic and works off of the Cayenne metadata. Now with the Summer of Code approaching [1], this seems like a good idea of a student project.

using Plucene and Plucene::Simple

2006-04-18 Thread Dean Hoover
I have developed some simple programs in java form creating lucene indexes and then using them to search. I now have an interest in using perl to use the indexes for search purposes, but am having trouble doing even the simplest thing using Plucene and Plucene::Simple. If anyone here can help,

Re: Integrating Lucene with Cayenne

2006-04-18 Thread karl wettin
18 apr 2006 kl. 11.11 skrev Andrus Adamchik: I've been thinking for some time about integrating Cayenne ORM engine with Lucene to provide database full text search capability that is db-agnostic and works off of the Cayenne metadata. Now with the Summer of Code approaching [1], this

Re: Integrating Lucene with Cayenne

2006-04-18 Thread Andrus Adamchik
Thanks, interesting... Certainly writing it from scratch doesn't look trivial. Since Compass supports integration with many other ORMs, I guess we may concentrate on writing a Compass Device for Cayenne then. Andrus On Apr 18, 2006, at 1:24 PM, karl wettin wrote: 18 apr 2006 kl. 11.11

Re: using Plucene and Plucene::Simple

2006-04-18 Thread Marvin Humphrey
On Apr 18, 2006, at 2:20 AM, Dean Hoover wrote: I have developed some simple programs in java form creating lucene indexes and then using them to search. I now have an interest in using perl to use the indexes for search purposes, but am having trouble doing even the simplest thing using

NearSpan

2006-04-18 Thread karl wettin
I don't understand what it means when this exception is thrown. I'm quite certain that it is my skipTo(target)-code that causes it , but I don't understand why. The FIXME and error message makes me think it might be that I deliver the data ordered and it was not expected to be? private

Re: NearSpan

2006-04-18 Thread Paul Elschot
Karl, On Tuesday 18 April 2006 17:48, karl wettin wrote: I don't understand what it means when this exception is thrown. I'm quite certain that it is my skipTo(target)-code that causes it , but I don't understand why. The FIXME and error message makes me think it might be that I

Re: NearSpan

2006-04-18 Thread Michael Goddard
Karl, If you get this one figured out, I'd be very happy. Unless I replace the line reading throw new RuntimeException(Unexpected: ordered); with this one return false; I see plenty of these exceptions being thrown. I look forward to how this plays out and if you are able to come

how to match Documents from Hits with Documents from Query Spans?

2006-04-18 Thread Boris Galitsky
Hello I am using span queries to get hits (Documents) and occurrences (positions) of search terms within these documents. For some reason, there is a disagreement between the order the Documents are returned in hits, and the Documents are referenced (via order number, starting from 0) in

Re: how to match Documents from Hits with Documents from Query Spans?

2006-04-18 Thread Chris Hostetter
: For some reason, there is a disagreement between the order the : Documents are returned in hits, and the Documents are referenced (via : order number, starting from 0) in the Spans? When dealing with a Hits instance, documents are iterated over in results order -- which may be by score, or may

Re: using Plucene and Plucene::Simple

2006-04-18 Thread Chris Hostetter
As marvin mentioned, there are some UTF-8 incompatabilities between java lucene and Plucene. Incidently: your best bet for getting assistence with Plucene is the Plucene mailing lists, as identified at the bottom of perldoc Plucene ... http://kasei.com/mailman/listinfo/plucene ...perl

how to show snippets of search reult

2006-04-18 Thread wenjie zheng
I have this general question about how to generate snippets (small piece of information with search keywords in it) for any search results. Is it something can be done by Lucene? Or does it require other algorithms/packages? Thanks, Wenjie

Re: how to show snippets of search reult

2006-04-18 Thread Erik Hatcher
Have a look at the Highlighter contrib library. It is used for the search results here, for example: http://www.lucenebook.com/search?query=highlighter On Apr 18, 2006, at 1:50 PM, wenjie zheng wrote: I have this general question about how to generate snippets (small

How to get Document (or filename) from Span

2006-04-18 Thread Boris Galitsky
Thanks a lot Hoss The question is when I get Spans, I get start/end positions and a Document order (starting from 0), not the Document object itself from which I could get a filename. Since I believe there is no way to get a Document object from Spans, and there is no such thing as Document

Re: How to get Document (or filename) from Span

2006-04-18 Thread Chris Hostetter
: The question is when I get Spans, I get start/end positions and a : Document order (starting from 0), not the Document object itself from Are you sure about that? Spans.doc() should return you the internal document Identifier which you can pass to indexReader.doc(int) : which I could get a

Re: How to get Document (or filename) from Span

2006-04-18 Thread Grant Ingersoll
The doc() number can be given to IndexReader.document() to get the Document, I believe. Boris Galitsky wrote: Thanks a lot Hoss The question is when I get Spans, I get start/end positions and a Document order (starting from 0), not the Document object itself from which I could get a

Re: How to get Document (or filename) from Span

2006-04-18 Thread Boris Galitsky
I fully understand now. Thanks a lot Boris On Tue, 18 Apr 2006 11:10:20 -0700 (PDT) Chris Hostetter [EMAIL PROTECTED] wrote: : The question is when I get Spans, I get start/end positions and a : Document order (starting from 0), not the Document object itself from Are you sure about that?

Re: how to show snippets of search reult

2006-04-18 Thread wenjie zheng
That's what I need. Thanks On 4/18/06, Erik Hatcher [EMAIL PROTECTED] wrote: Have a look at the Highlighter contrib library. It is used for the search results here, for example: http://www.lucenebook.com/search?query=highlighter On Apr 18, 2006, at 1:50 PM, wenjie zheng wrote:

Re: Using Lucene for searching tokens, not storing them.

2006-04-18 Thread karl wettin
17 apr 2006 kl. 08.16 skrev karl wettin: The code contains lots of things that can be optimized for both memory and CPU. Pretty sure it can be cranked down to use a fraction of the ticks spent by a RAMDirectory. I aim at 1/3. I'm not sure if you people are as amazed as me by this, so I'll

Re: Using Lucene for searching tokens, not storing them.

2006-04-18 Thread Doug Cutting
karl wettin wrote: I'm not sure if you people are as amazed as me by this, so I'll just keep posting reports until someone tells me not to. :-) Keep it up! After adding a couple of binary searches in well needed places (and a couple of new bugs that in a few cases affects the results) I'm

Re: Using Lucene for searching tokens, not storing them.

2006-04-18 Thread karl wettin
18 apr 2006 kl. 22.18 skrev Doug Cutting: Will you be able to contribute this to Apache? Of course. I'll pop it in the Jira as soon it passes all tests. If someone wants to take a look right now, let me know. Right now it's more of a branch than a couple of diffs. I might be able to

[jira] Created: (LUCENENET-3) Index optimization problem

2006-04-18 Thread Steve Berteau (JIRA)
Index optimization problem -- Key: LUCENENET-3 URL: http://issues.apache.org/jira/browse/LUCENENET-3 Project: Lucene.NET Type: Bug Environment: Windows Server 2003 Reporter: Steve Berteau We are using Lucene .NET Beta 1.9 version 1.

[jira] Created: (LUCENENET-4) Index Optimization

2006-04-18 Thread Steve Berteau (JIRA)
Index Optimization --- Key: LUCENENET-4 URL: http://issues.apache.org/jira/browse/LUCENENET-4 Project: Lucene.NET Type: Bug Environment: Windows server 2003 Reporter: Steve Berteau We are using Lucene .NET Beta 1.9 version 1. We have been