[jira] Updated: (LUCENE-2142) FieldCache.getStringIndex should not throw exception if term count exceeds doc count

2010-06-24 Thread Uwe Schindler (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated LUCENE-2142:
--

Fix Version/s: 2.9.4
   3.0.3
   (was: 3.0.2)
   (was: 2.9.3)

 FieldCache.getStringIndex should not throw exception if term count exceeds 
 doc count
 

 Key: LUCENE-2142
 URL: https://issues.apache.org/jira/browse/LUCENE-2142
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Reporter: Michael McCandless
Assignee: Michael McCandless
Priority: Minor
 Fix For: 2.9.4, 3.0.3, 3.1, 4.0

 Attachments: LUCENE-2142-fix-3x.patch, LUCENE-2142-fix-trunk.patch


 Spinoff of LUCENE-2133/LUCENE-831.
 Currently FieldCache cannot handle more than one value per field.
 We may someday want to fix that... but until that day:
 FieldCache.getStringIndex currently does a simplistic check to try to
 catch when you've accidentally allowed more than one term per field,
 by testing if the number of unique terms exceeds the number of
 documents.
 The problem is, this is not a perfect check, in that it allows false
 negatives (you could have more than one term per field for some docs
 and the check won't catch you).
 Further, the exception thrown is the unchecked RuntimeException.
 So this means... you could happily think all is good, until some day,
 well into production, once you've updated enough docs, suddenly the
 check will catch you and throw an unhandled exception, stopping all
 searches [that need to sort by this string field] in their tracks.
 It's not gracefully degrading.
 I think we should simply remove the test, ie, if you have more terms
 than docs then the terms simply overwrite one another.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-2142) FieldCache.getStringIndex should not throw exception if term count exceeds doc count

2010-06-19 Thread Uwe Schindler (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated LUCENE-2142:
--

Attachment: LUCENE-2142-fix.patch

After a coffee i have seen the problem, too - stupoid :(

Here is the fix for 3.x (also 3.0 and 2.9) - in trunk the fix is not needed, as 
there are growable arrays. Maybe we should add a simple test to all branches!



 FieldCache.getStringIndex should not throw exception if term count exceeds 
 doc count
 

 Key: LUCENE-2142
 URL: https://issues.apache.org/jira/browse/LUCENE-2142
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Reporter: Michael McCandless
Assignee: Michael McCandless
Priority: Minor
 Fix For: 2.9.3, 3.0.2, 3.1, 4.0

 Attachments: LUCENE-2142-fix.patch


 Spinoff of LUCENE-2133/LUCENE-831.
 Currently FieldCache cannot handle more than one value per field.
 We may someday want to fix that... but until that day:
 FieldCache.getStringIndex currently does a simplistic check to try to
 catch when you've accidentally allowed more than one term per field,
 by testing if the number of unique terms exceeds the number of
 documents.
 The problem is, this is not a perfect check, in that it allows false
 negatives (you could have more than one term per field for some docs
 and the check won't catch you).
 Further, the exception thrown is the unchecked RuntimeException.
 So this means... you could happily think all is good, until some day,
 well into production, once you've updated enough docs, suddenly the
 check will catch you and throw an unhandled exception, stopping all
 searches [that need to sort by this string field] in their tracks.
 It's not gracefully degrading.
 I think we should simply remove the test, ie, if you have more terms
 than docs then the terms simply overwrite one another.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-2142) FieldCache.getStringIndex should not throw exception if term count exceeds doc count

2010-06-19 Thread Uwe Schindler (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated LUCENE-2142:
--

Attachment: (was: LUCENE-2142-fix.patch)

 FieldCache.getStringIndex should not throw exception if term count exceeds 
 doc count
 

 Key: LUCENE-2142
 URL: https://issues.apache.org/jira/browse/LUCENE-2142
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Reporter: Michael McCandless
Assignee: Michael McCandless
Priority: Minor
 Fix For: 2.9.3, 3.0.2, 3.1, 4.0

 Attachments: LUCENE-2142-fix-3x.patch, LUCENE-2142-fix-trunk.patch


 Spinoff of LUCENE-2133/LUCENE-831.
 Currently FieldCache cannot handle more than one value per field.
 We may someday want to fix that... but until that day:
 FieldCache.getStringIndex currently does a simplistic check to try to
 catch when you've accidentally allowed more than one term per field,
 by testing if the number of unique terms exceeds the number of
 documents.
 The problem is, this is not a perfect check, in that it allows false
 negatives (you could have more than one term per field for some docs
 and the check won't catch you).
 Further, the exception thrown is the unchecked RuntimeException.
 So this means... you could happily think all is good, until some day,
 well into production, once you've updated enough docs, suddenly the
 check will catch you and throw an unhandled exception, stopping all
 searches [that need to sort by this string field] in their tracks.
 It's not gracefully degrading.
 I think we should simply remove the test, ie, if you have more terms
 than docs then the terms simply overwrite one another.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-2142) FieldCache.getStringIndex should not throw exception if term count exceeds doc count

2010-06-19 Thread Uwe Schindler (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated LUCENE-2142:
--

Attachment: LUCENE-2142-fix-3x.patch
LUCENE-2142-fix-trunk.patch

Here patch with test for 3.x and before. Trunk patch only contains test, which 
passes.

 FieldCache.getStringIndex should not throw exception if term count exceeds 
 doc count
 

 Key: LUCENE-2142
 URL: https://issues.apache.org/jira/browse/LUCENE-2142
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Reporter: Michael McCandless
Assignee: Michael McCandless
Priority: Minor
 Fix For: 2.9.3, 3.0.2, 3.1, 4.0

 Attachments: LUCENE-2142-fix-3x.patch, LUCENE-2142-fix-trunk.patch


 Spinoff of LUCENE-2133/LUCENE-831.
 Currently FieldCache cannot handle more than one value per field.
 We may someday want to fix that... but until that day:
 FieldCache.getStringIndex currently does a simplistic check to try to
 catch when you've accidentally allowed more than one term per field,
 by testing if the number of unique terms exceeds the number of
 documents.
 The problem is, this is not a perfect check, in that it allows false
 negatives (you could have more than one term per field for some docs
 and the check won't catch you).
 Further, the exception thrown is the unchecked RuntimeException.
 So this means... you could happily think all is good, until some day,
 well into production, once you've updated enough docs, suddenly the
 check will catch you and throw an unhandled exception, stopping all
 searches [that need to sort by this string field] in their tracks.
 It's not gracefully degrading.
 I think we should simply remove the test, ie, if you have more terms
 than docs then the terms simply overwrite one another.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-2142) FieldCache.getStringIndex should not throw exception if term count exceeds doc count

2010-05-30 Thread Michael McCandless (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless updated LUCENE-2142:
---

Fix Version/s: 2.9.3

 FieldCache.getStringIndex should not throw exception if term count exceeds 
 doc count
 

 Key: LUCENE-2142
 URL: https://issues.apache.org/jira/browse/LUCENE-2142
 Project: Lucene - Java
  Issue Type: Bug
  Components: Search
Reporter: Michael McCandless
Assignee: Michael McCandless
Priority: Minor
 Fix For: 2.9.3, 4.0


 Spinoff of LUCENE-2133/LUCENE-831.
 Currently FieldCache cannot handle more than one value per field.
 We may someday want to fix that... but until that day:
 FieldCache.getStringIndex currently does a simplistic check to try to
 catch when you've accidentally allowed more than one term per field,
 by testing if the number of unique terms exceeds the number of
 documents.
 The problem is, this is not a perfect check, in that it allows false
 negatives (you could have more than one term per field for some docs
 and the check won't catch you).
 Further, the exception thrown is the unchecked RuntimeException.
 So this means... you could happily think all is good, until some day,
 well into production, once you've updated enough docs, suddenly the
 check will catch you and throw an unhandled exception, stopping all
 searches [that need to sort by this string field] in their tracks.
 It's not gracefully degrading.
 I think we should simply remove the test, ie, if you have more terms
 than docs then the terms simply overwrite one another.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org