[
https://issues.apache.org/jira/browse/OPENNLP-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Richard Zowalla closed OPENNLP-1702.
------------------------------------
Resolution: Fixed
> BratDocumentStream should process files in bratCorpusDir deterministically
> --------------------------------------------------------------------------
>
> Key: OPENNLP-1702
> URL: https://issues.apache.org/jira/browse/OPENNLP-1702
> Project: OpenNLP
> Issue Type: Bug
> Components: Build, Packaging and Test
> Affects Versions: 2.5.3
> Reporter: Martin Wiesner
> Assignee: Martin Wiesner
> Priority: Minor
> Fix For: 2.5.4
>
> Time Spent: 50m
> Remaining Estimate: 0h
>
> With the recent addition of {{BratNameSampleStreamFactoryTest}} via
> OPENNLP-1695, it became obvious (Eval test run), that the code in
> BratDocumentStream is prone to non-determinism. This stems from the fact that
> {{java.util.File#listFiles(..)}} does not guarantee any order of the returned
> elements.
> A potential fix for achieving determinism again is to sort the result of
> listFiles(..) alphabetically in ASC order.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)