SentenceHighlighter

2006-04-19 Thread Anton Feldmann
Hi I am writing a application to find words into a sentence. I began with the Formatter: import org.apache.lucene.search.highlight.Formatter; import org.apache.lucene.search.highlight.TokenGroup; public class SimpleSentenceFormatter implements Formatter{ String tag; /**

Updating existing documents (for the thousandth time I'm sure).

2006-04-19 Thread Darren Vengroff
Hi, I know this discussion probably comes up all the time, but I need to tackle it. There is a closed JIRA issue at http://issues.apache.org/jira/browse/LUCENE-166?page=all that indicates that there may be some patches floating around to support updating individual fields. Does any one know w

Re: SentenceHighlighter

2006-04-19 Thread mark harwood
If you are wanting to select highlights from a document where only whole sentences are the fragments selected you will need to implement a custom Fragmenter class. This will need to look for sentence boundaries eg a "." followed by whitespace only, then a word with an uppercase first character. I

[jira] Commented: (LUCENE-496) New tool for reseting the (length)norm of fields after changing Similarity

2006-04-19 Thread Hoss Man (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-496?page=comments#action_12375177 ] Hoss Man commented on LUCENE-496: - FYI: I picked the package to match the other classes in contrib/miscellaneous ... I don't think it matters very much what package it lives i

[jira] Created: (LUCENE-546) Index corruption when using RAMDirectory( Directory) constructor

2006-04-19 Thread peter royal (JIRA)
Index corruption when using RAMDirectory( Directory) constructor Key: LUCENE-546 URL: http://issues.apache.org/jira/browse/LUCENE-546 Project: Lucene - Java Type: Bug Versions: 1.9 Reporter

[jira] Updated: (LUCENE-546) Index corruption when using RAMDirectory( Directory) constructor

2006-04-19 Thread peter royal (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-546?page=all ] peter royal updated LUCENE-546: --- Attachment: RAMDirectory-truncate.diff > Index corruption when using RAMDirectory( Directory) constructor > --

deep index comparation

2006-04-19 Thread karl wettin
I want to pass two IndexReaders to a test that checks all data in depth. Settings, values, vectors, positions, et.c. Is there such a thing? - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PR

[jira] Created: (LUCENE-547) Directory implementation for Applets

2006-04-19 Thread Andre Schild (JIRA)
Directory implementation for Applets Key: LUCENE-547 URL: http://issues.apache.org/jira/browse/LUCENE-547 Project: Lucene - Java Type: Improvement Components: Store Versions: 1.9 Environment: Applets Reporter: An

[jira] Updated: (LUCENE-547) Directory implementation for Applets

2006-04-19 Thread Andre Schild (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-547?page=all ] Andre Schild updated LUCENE-547: Attachment: AppletDirectory.zip > Directory implementation for Applets > > > Key: LUCENE-547 > URL: http:

[jira] Commented: (LUCENE-547) Directory implementation for Applets

2006-04-19 Thread Andre Schild (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-547?page=comments#action_12375211 ] Andre Schild commented on LUCENE-547: - The classes are based on the RAMxxx classes, perhaps some of them could be reused instead of creating new ones. Currently we have t

[jira] Created: (LUCENE-548) Sort bug using ParallelMultiSearcher

2006-04-19 Thread dan (JIRA)
:61) at org.apache.lucene.search.MultiSearcherThread.run(ParallelMultiSearcher.java:271) >>>>> Input: - This only occurs when searching more than one index using ParallelMultiSearcher - I use the signature new Sort( "date", true) - The values in dates are strings in th

[jira] Created: (LUCENE-549) Sort bug with ParallelMultiSearcher

2006-04-19 Thread dan (JIRA)
:61) at org.apache.lucene.search.MultiSearcherThread.run(ParallelMultiSearcher.java:271) >>>>> Input: - This only occurs when searching more than one index using ParallelMultiSearcher - I use the signature new Sort( "date", true) - The values in dates are strings in th

Re: Using Lucene for searching tokens, not storing them.

2006-04-19 Thread karl wettin
18 apr 2006 kl. 22.08 skrev karl wettin: After adding a couple of binary searches in well needed places (and a couple of new bugs that in a few cases affects the results) I'm now down at 1/8th of the time compared to RAMDirectory. That is really fast if you ask me. After fixing the bugs,

Re: Using Lucene for searching tokens, not storing them.

2006-04-19 Thread karl wettin
20 apr 2006 kl. 07.29 skrev karl wettin: 18 apr 2006 kl. 22.08 skrev karl wettin: After adding a couple of binary searches in well needed places (and a couple of new bugs that in a few cases affects the results) I'm now down at 1/8th of the time compared to RAMDirectory. That is really

[jira] Created: (LUCENE-550) InstanciatedIndex - faster but memory consuming index

2006-04-19 Thread Karl Wettin (JIRA)
InstanciatedIndex - faster but memory consuming index - Key: LUCENE-550 URL: http://issues.apache.org/jira/browse/LUCENE-550 Project: Lucene - Java Type: New Feature Components: Store Versions: 1.9

[jira] Updated: (LUCENE-550) InstanciatedIndex - faster but memory consuming index

2006-04-19 Thread Karl Wettin (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-550?page=all ] Karl Wettin updated LUCENE-550: --- Attachment: InstanciatedIndex.java > InstanciatedIndex - faster but memory consuming index > - > > Ke

[jira] Updated: (LUCENE-550) InstanciatedIndex - faster but memory consuming index

2006-04-19 Thread Karl Wettin (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-550?page=all ] Karl Wettin updated LUCENE-550: --- Attachment: Term.java > InstanciatedIndex - faster but memory consuming index > - > > Key: LUCENE-550

[jira] Updated: (LUCENE-550) InstanciatedIndex - faster but memory consuming index

2006-04-19 Thread Karl Wettin (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-550?page=all ] Karl Wettin updated LUCENE-550: --- Attachment: Document.java > InstanciatedIndex - faster but memory consuming index > - > > Key: LUCENE