Create reusable sequence detection classes
------------------------------------------

                 Key: OPENNLP-102
                 URL: https://issues.apache.org/jira/browse/OPENNLP-102
             Project: OpenNLP
          Issue Type: Improvement
            Reporter: Jörn Kottmann
            Priority: Minor


There is a common pattern that in many places a piece of code needs to detect a 
BIO sequences and create spans out of these. It turned out that writing this 
code is error prone and testing it with unit tests is complex.

We should create reusable classes which can perform the sequence detection and 
can be used in all the places where it is required.
Having common code also makes difficult to do performance optimizations 
possible, like mapping via perfect hashing instead of string parsing or regular 
expressions.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to