Re: Fuzzy search change

2009-06-19 Thread Varun Dhussa
Hi, I can port the code to java. I do not know the Lucene file structures etc. as of now. So if someone with experience on that to store trigrams and index them is can work on that part, I can port the rest of the code. Regards Varun Dhussa Product Architect CE InfoSystems (P) Ltd http://www.ma

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722145#action_12722145 ] Shai Erera commented on LUCENE-1703: Right ... forgot the merges to the uncommitted se

Re: javadoc language

2009-06-19 Thread Otis Gospodnetic
+1 for English as much as I like language variety. Otis - Original Message > From: Grant Ingersoll > To: java-dev@lucene.apache.org > Sent: Friday, June 19, 2009 7:33:05 PM > Subject: Re: javadoc language > > I think they should be in English. Keeping the Chinese would be fine as

Re: javadoc language

2009-06-19 Thread Grant Ingersoll
I think they should be in English. Keeping the Chinese would be fine as well, but seems kind of pointless given all the other javadocs are in English. On Jun 19, 2009, at 12:49 PM, Robert Muir wrote: While hunting down some strange behavior in SmartChineseAnalyzer, I noticed the javadocs

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722069#action_12722069 ] Jason Rutherglen commented on LUCENE-1703: -- Seems like a useful feature. > Add a

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722065#action_12722065 ] Uwe Schindler commented on LUCENE-1703: --- bq. I still don't understand how if autocom

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722060#action_12722060 ] Earwin Burrfoot commented on LUCENE-1701: - bq. Someday maybe I'll convince you to

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722057#action_12722057 ] Shai Erera commented on LUCENE-1703: {quote} one thread may call addDocument() (or may

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722040#action_12722040 ] Michael McCandless commented on LUCENE-1703: There are also "merges" that take

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722037#action_12722037 ] Tim Smith commented on LUCENE-1703: --- NOTE: I'm always using autoCommit=false (autoCommit

[jira] Commented: (LUCENE-1705) Add deleteAllDocuments() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722021#action_12722021 ] Tim Smith commented on LUCENE-1705: --- My use case is like so: * IndexReader opened again

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722019#action_12722019 ] Shai Erera commented on LUCENE-1703: {quote} MergeScheduler does not provide a sync()

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722017#action_12722017 ] Michael McCandless commented on LUCENE-1701: bq. That is what I was talking ab

[jira] Commented: (LUCENE-1705) Add deleteAllDocuments() method to IndexWriter

2009-06-19 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722006#action_12722006 ] Shai Erera commented on LUCENE-1705: My search app has such a scenario, and currently

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722002#action_12722002 ] Uwe Schindler commented on LUCENE-1701: --- That is what I was talking about all the ti

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722003#action_12722003 ] Tim Smith commented on LUCENE-1703: --- MergeScheduler does not provide a sync() method in

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721993#action_12721993 ] Michael McCandless commented on LUCENE-1701: {quote} bq. new NumericSortField

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721989#action_12721989 ] Michael McCandless commented on LUCENE-1701: bq. It's not sufficient for every

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721988#action_12721988 ] Uwe Schindler commented on LUCENE-1701: --- I aggree with Yonik, this is too much magic

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721987#action_12721987 ] Shai Erera commented on LUCENE-1703: I'm just playing the devil's advocate here. I'm

[jira] Commented: (LUCENE-961) RegexCapabilities is not Serializable

2009-06-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721985#action_12721985 ] Mark Miller commented on LUCENE-961: +1 Lets add a no arg constructor to the impls and

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721983#action_12721983 ] Yonik Seeley commented on LUCENE-1701: -- bq. new NumericSortField("price"); Magic. Ho

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721979#action_12721979 ] Michael McCandless commented on LUCENE-1701: I still think we should make Nume

[jira] Created: (LUCENE-1705) Add deleteAllDocuments() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
Add deleteAllDocuments() method to IndexWriter -- Key: LUCENE-1705 URL: https://issues.apache.org/jira/browse/LUCENE-1705 Project: Lucene - Java Issue Type: Wish Components: Index Aff

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721972#action_12721972 ] Yonik Seeley commented on LUCENE-1701: -- bq. Because we've decided that this is ou

[jira] Updated: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Smith updated LUCENE-1703: -- Attachment: IndexWriter.java.diff Very minor change moved assert on mergingSegments.size() into waitFo

Re: caching an indexreader

2009-06-19 Thread Jason Rutherglen
On the topic of RAM consumption, it seems like field caches could return estimated RAM usage (given they're arrays of standard Java types)? There's methods of calculating per platform (I believe relatively accurately). On Fri, Jun 19, 2009 at 12:11 PM, Michael McCandless < luc...@mikemccandless.co

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721961#action_12721961 ] Michael McCandless commented on LUCENE-1701: bq. Static factories are cool (th

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721956#action_12721956 ] Michael McCandless commented on LUCENE-1701: bq. Here is a first draft of Nume

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721954#action_12721954 ] Tim Smith commented on LUCENE-1703: --- My primary use case for this is to stabilize an ind

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721951#action_12721951 ] Shai Erera commented on LUCENE-1703: May I ask what's the use case for this? I looked

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721949#action_12721949 ] Michael McCandless commented on LUCENE-1703: Patch looks good! I don't think

Shouldn't IndexWriter.commit(Map) accept Properties instead?

2009-06-19 Thread Shai Erera
It really assumes a String, String map ... Is it just because Properties is synced? If so, then when moving to 1.5 we should declare the Map with Map because currently if anyone will pass anything other than Strings, the code will fail with a ClassCastException in ChecksumIndexOutput.writeStringSt

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721946#action_12721946 ] Uwe Schindler commented on LUCENE-1701: --- bq. ut any new field cache API should be po

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721938#action_12721938 ] Tim Smith commented on LUCENE-1703: --- I'm finding it a bit tricky to create a proper unit

[jira] Updated: (LUCENE-1704) org.apache.lucene.ant.HtmlDocument added Tidy config file passthrough availability

2009-06-19 Thread Keith Sprochi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Keith Sprochi updated LUCENE-1704: -- Description: Parsing HTML documents using the org.apache.lucene.ant.HtmlDocument.Document met

[jira] Updated: (LUCENE-1704) org.apache.lucene.ant.HtmlDocument added Tidy config file passthrough availability

2009-06-19 Thread Keith Sprochi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Keith Sprochi updated LUCENE-1704: -- Description: Parsing HTML documents using the org.apache.lucene.ant.HtmlDocument.Document met

[jira] Commented: (LUCENE-1313) Near Realtime Search

2009-06-19 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721933#action_12721933 ] Jason Rutherglen commented on LUCENE-1313: -- On second thought, the previous idea

[jira] Created: (LUCENE-1704) org.apache.lucene.ant.HtmlDocument added Tidy config file passthrough availability

2009-06-19 Thread Keith Sprochi (JIRA)
org.apache.lucene.ant.HtmlDocument added Tidy config file passthrough availability -- Key: LUCENE-1704 URL: https://issues.apache.org/jira/browse/LUCENE-1704 Project: Luc

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721929#action_12721929 ] Yonik Seeley commented on LUCENE-1701: -- The exception certainly is a hack - but any n

[jira] Commented: (LUCENE-1313) Near Realtime Search

2009-06-19 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721927#action_12721927 ] Jason Rutherglen commented on LUCENE-1313: -- Using a single segmentInfos in IW see

[jira] Updated: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Smith updated LUCENE-1703: -- Attachment: IndexWriter.java.diff Here's a diff for IndexWriter.java moved code from else block in fi

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721920#action_12721920 ] Uwe Schindler commented on LUCENE-1701: --- When this comes (payloads, CSF,...) we will

[jira] Resolved: (LUCENE-1405) Support for new Resources model in ant 1.7 in Lucene ant task.

2009-06-19 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erik Hatcher resolved LUCENE-1405. -- Resolution: Fixed Przemyslaw - apologies for the delay in addressing this valuable patch. It'

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721915#action_12721915 ] Yonik Seeley commented on LUCENE-1701: -- Regardless of the fact that plain_int parser

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721914#action_12721914 ] Uwe Schindler commented on LUCENE-1701: --- Yonik, I will explain my intention: The rea

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721913#action_12721913 ] Tim Smith commented on LUCENE-1703: --- I'm not super familiar with internals of IndexWrite

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721911#action_12721911 ] Michael McCandless commented on LUCENE-1703: bq. ideally, the IndexWriter woul

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721910#action_12721910 ] Michael McCandless commented on LUCENE-1692: OK I committed them. Thanks for

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721908#action_12721908 ] Tim Smith commented on LUCENE-1703: --- thought maybe that method would do it, however that

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721906#action_12721906 ] Michael McCandless commented on LUCENE-1692: Duh, I forgot to svn add them! S

[jira] Commented: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721903#action_12721903 ] Michael McCandless commented on LUCENE-1703: You can use ConcurrentMergeSchedu

[jira] Created: (LUCENE-1703) Add a waitForMerges() method to IndexWriter

2009-06-19 Thread Tim Smith (JIRA)
Add a waitForMerges() method to IndexWriter --- Key: LUCENE-1703 URL: https://issues.apache.org/jira/browse/LUCENE-1703 Project: Lucene - Java Issue Type: Improvement Components: Index Af

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-19 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721888#action_12721888 ] Robert Muir commented on LUCENE-1692: - michael, I updated my svn and I think you might

[jira] Commented: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721884#action_12721884 ] Robert Muir commented on LUCENE-1702: - Steven, thanks for the information, and the ran

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721880#action_12721880 ] Yonik Seeley commented on LUCENE-1701: -- Having the trie parsers public is good (or pu

[jira] Commented: (LUCENE-1639) intermittent failure in TestIndexWriter. testRandomIWReader

2009-06-19 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721881#action_12721881 ] Jason Rutherglen commented on LUCENE-1639: -- Great work Mike! I wonder if I was s

[jira] Commented: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721877#action_12721877 ] Steven Rowe commented on LUCENE-1702: - bq. I think for this issue it would be best to

javadoc language

2009-06-19 Thread Robert Muir
While hunting down some strange behavior in SmartChineseAnalyzer, I noticed the javadocs are in Chinese. some of these do not reflect the method params, which makes it a little harder to work with. /** * 设计上是SentenceTokenizer的下一处理层。将SentenceTokenizer的句子读出, * 利用HHMMSegment主程序将句子分词,然后将分词结果返

[jira] Updated: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1701: -- Attachment: NumericField.java Here is a first draft of NumericField with the same handling as

[jira] Commented: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721867#action_12721867 ] Robert Muir commented on LUCENE-1702: - Steven, even without >BMP support, 1.5 branch w

[jira] Commented: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721859#action_12721859 ] Steven Rowe commented on LUCENE-1702: - bq. Steven I have been watching that jflex 1.5

[jira] Commented: (LUCENE-1583) SpanOrQuery skipTo() doesn't always move forwards

2009-06-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721858#action_12721858 ] Mark Miller commented on LUCENE-1583: - That change would always call next once before

[jira] Commented: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721840#action_12721840 ] Robert Muir commented on LUCENE-1702: - Steven I have been watching that jflex 1.5 bran

[jira] Commented: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721833#action_12721833 ] Steven Rowe commented on LUCENE-1702: - +1 (I was involved in perpetuating the Thai gra

[jira] Resolved: (LUCENE-1692) Contrib analyzers need tests

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1692. Resolution: Fixed Thanks Robert! > Contrib analyzers need tests > ---

[jira] Issue Comment Edited: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721830#action_12721830 ] Earwin Burrfoot edited comment on LUCENE-1701 at 6/19/09 8:50 AM: --

[jira] Issue Comment Edited: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721827#action_12721827 ] Uwe Schindler edited comment on LUCENE-1701 at 6/19/09 8:50 AM:

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721830#action_12721830 ] Earwin Burrfoot commented on LUCENE-1701: - Mike, I very much agree with everything

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721827#action_12721827 ] Uwe Schindler commented on LUCENE-1701: --- But the same problem like with NumericToken

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721824#action_12721824 ] Michael McCandless commented on LUCENE-1692: OK I will commit this soon. Than

[jira] Resolved: (LUCENE-1700) LogMergePolicy.findMergesToExpungeDeletes need to get deletes from the SegmentReader

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1700. Resolution: Fixed Thanks Jason! > LogMergePolicy.findMergesToExpungeDeletes need

[jira] Created: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Robert Muir (JIRA)
Thai token type() bug - Key: LUCENE-1702 URL: https://issues.apache.org/jira/browse/LUCENE-1702 Project: Lucene - Java Issue Type: Bug Components: contrib/analyzers Reporter: Robert Muir

[jira] Resolved: (LUCENE-1639) intermittent failure in TestIndexWriter. testRandomIWReader

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1639. Resolution: Fixed > intermittent failure in TestIndexWriter. testRandomIWReader >

[jira] Updated: (LUCENE-1639) intermittent failure in TestIndexWriter. testRandomIWReader

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1639: --- Attachment: LUCENE-1639.patch OK I tracked this one down... in certain cases, IndexW

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721810#action_12721810 ] Michael McCandless commented on LUCENE-1701: Uwe can you also open an issue fo

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721795#action_12721795 ] Uwe Schindler commented on LUCENE-1693: --- During my tests I found a small problem wit

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-19 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721790#action_12721790 ] Robert Muir commented on LUCENE-1692: - michael, yes the only issue... i'll open anothe

[jira] Commented: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721787#action_12721787 ] Earwin Burrfoot commented on LUCENE-1701: - I vote for factories - escaping back-co

[jira] Created: (LUCENE-1701) Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache

2009-06-19 Thread Uwe Schindler (JIRA)
Add NumericField and NumericSortField, make plain text numeric parsers public in FieldCache, move trie parsers to FieldCache Key: LUCENE-1701

Deleting old javadoc files on Hudson

2009-06-19 Thread Uwe Schindler
Hallo, In December the javadocs build system was updated to generate the javadocs for all in the /all/ subdir, core in /core/ and so on. As Hudson in its nightly build does not delete the old javadocs before publishing the new ones, there are still a lot of outdated html files around. Even the ro

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch After committing TrieRange to core, here some updates to the pat

[jira] Commented: (LUCENE-1466) CharFilter - normalize characters before tokenizer

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721761#action_12721761 ] Michael McCandless commented on LUCENE-1466: Thanks for the update Koji! The

[jira] Closed: (LUCENE-1673) Move TrieRange to core

2009-06-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler closed LUCENE-1673. - Resolution: Fixed - Committed addition to core in revision 786470 - Committed remove from contri

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-19 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721722#action_12721722 ] Michael Busch commented on LUCENE-1693: --- Sorry, Uwe, I was really busy today. I'll t

Re: Some thoughts around the use of reader.isDeleted and hasDeletions

2009-06-19 Thread Michael McCandless
On Thu, Jun 18, 2009 at 11:18 PM, Earwin Burrfoot wrote: > Runtime change. Hard to imagine people relying on failing document() call. +1 Mike - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional comman

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721711#action_12721711 ] Michael McCandless commented on LUCENE-1692: Latest patch looks good Robert, t

[jira] Commented: (LUCENE-1539) Improve Benchmark

2009-06-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721705#action_12721705 ] Michael McCandless commented on LUCENE-1539: Where are we assuming/requiring t

Re: madvise(ptr, len, MADV_SEQUENTIAL)

2009-06-19 Thread Michael McCandless
On Fri, Jun 19, 2009 at 4:15 AM, Uwe Schindler wrote: > But then we also need to map, when writing to files, which is hard to do, > because you do not know how large the mapping area will be (unknown filesize). It may not that important on writing to avoid the IO cache, in that at least we are w

RE: madvise(ptr, len, MADV_SEQUENTIAL)

2009-06-19 Thread Uwe Schindler
But then we also need to map, when writing to files, which is hard to do, because you do not know how large the mapping area will be (unknown filesize). As Earwin suggested, we could change MMapDirectory to also mmap on writing, but maps the buffers in something we call "pages". Filesizes of writte