[jira] [Updated] (SOLR-17189) DockMakerTest.testRealisticUnicode fails from whitespace assumption

2024-04-10 Thread David Smiley (Jira)


 [ 
https://issues.apache.org/jira/browse/SOLR-17189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Smiley updated SOLR-17189:

Fix Version/s: 9.6.0

> DockMakerTest.testRealisticUnicode fails from whitespace assumption
> ---
>
> Key: SOLR-17189
> URL: https://issues.apache.org/jira/browse/SOLR-17189
> Project: Solr
>  Issue Type: Test
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: benchmarks
>Reporter: David Smiley
>Assignee: David Smiley
>Priority: Major
> Fix For: 9.6.0
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> DockMakerTest.testRealisticUnicode fails 1-2% of the time -- 
> [link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode].
> {quote}java.lang.AssertionError: expected:<6> but was:<7> 
> at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0)   
> •••
> at 
> org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote}
> It seems clear it's because it assumes that the "realistic unicode" chars 
> won't match the regexp: {{\s}}.  A single space char is used to join the 
> words but maybe this or other whitespace chars are in those unicode codepoint 
> blocks.
> Additionally, it's frustrating that this particular benchmark framework 
> doesn't honor tests.seed in its generation of random data and thus it's hard 
> to reproduce the failure.  That ought to be fixed as well.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org



[jira] [Updated] (SOLR-17189) DockMakerTest.testRealisticUnicode fails from whitespace assumption

2024-02-29 Thread David Smiley (Jira)


 [ 
https://issues.apache.org/jira/browse/SOLR-17189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Smiley updated SOLR-17189:

Description: 
DockMakerTest.testRealisticUnicode fails 1-2% of the time -- 
[link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode].

{quote}java.lang.AssertionError: expected:<6> but was:<7>   
at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0) 
•••
at 
org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote}

It seems clear it's because it assumes that the "realistic unicode" chars won't 
match the regexp: {{\s}}.  A single space char is used to join the words but 
maybe this or other whitespace chars are in those unicode codepoint blocks.

Additionally, it's frustrating that this particular benchmark framework doesn't 
honor tests.seed in its generation of random data and thus it's hard to 
reproduce the failure.  That ought to be fixed as well.

  was:
DockMakerTest.testRealisticUnicode fails 1-2% of the time -- 
[link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode].

{quote}java.lang.AssertionError: expected:<6> but was:<7>   
at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0) 
•••
at 
org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote}
It seems clear it's because it assumes that the "realistic unicode" chars won't 
match the regexp: {{\s}} (which is the char used to join multiple unicode 
words).

Additionally, it's frustrating that this particular benchmark framework doesn't 
honor tests.seed in its generation of random data and thus it's hard to 
reproduce the failure.  That ought to be fixed as well.


> DockMakerTest.testRealisticUnicode fails from whitespace assumption
> ---
>
> Key: SOLR-17189
> URL: https://issues.apache.org/jira/browse/SOLR-17189
> Project: Solr
>  Issue Type: Test
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: benchmarks
>Reporter: David Smiley
>Priority: Major
>
> DockMakerTest.testRealisticUnicode fails 1-2% of the time -- 
> [link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode].
> {quote}java.lang.AssertionError: expected:<6> but was:<7> 
> at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0)   
> •••
> at 
> org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote}
> It seems clear it's because it assumes that the "realistic unicode" chars 
> won't match the regexp: {{\s}}.  A single space char is used to join the 
> words but maybe this or other whitespace chars are in those unicode codepoint 
> blocks.
> Additionally, it's frustrating that this particular benchmark framework 
> doesn't honor tests.seed in its generation of random data and thus it's hard 
> to reproduce the failure.  That ought to be fixed as well.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org