Hi, Thanks for your comments on the JIRA.
Should I be expecting exact results if the training data and the sample data are exactly the same or is there just too little training data to tell at this stage? I think having a model trained from html would be very useful. Cheers Paul Cowan Cutting-Edge Solutions (Scotland) http://thesoftwaresimpleton.blogspot.com/ On 19 January 2011 20:42, Paul Cowan <[email protected]> wrote: > I have created a JIRA issue which contains a sample html and a failing > test. > > https://issues.apache.org/jira/browse/OPENNLP-67 > > Cheers > > Paul Cowan > > Cutting-Edge Solutions (Scotland) > > http://thesoftwaresimpleton.blogspot.com/ > > > > On 13 January 2011 10:21, Paul Cowan <[email protected]> wrote: > >> >> Open a new jira issue, either just attach a piece of test data or >> contribute a patch which also contains the additions to the unit tests. >> >> I will do that. >> >> >> Cheers >> >> Paul Cowan >> >> Cutting-Edge Solutions (Scotland) >> >> http://thesoftwaresimpleton.blogspot.com/ >> >> >> >> On 13 January 2011 10:15, Jörn Kottmann <[email protected]> wrote: >> >>> On 1/13/11 10:55 AM, Paul Cowan wrote: >>> >>>> Maybe you can contribute >>>>>> >>>>> a small sample of your training data to the project so we can >>>> add a junit test. >>>> >>>> I will gladly do that. how is the best way to do that? I believe the >>>> source control is moving. >>>> >>>> Is git an option or mercurial? Pull requests are great for this type of >>>> thing through github or the mercurial equivalent. I will make the model >>>> available for HTML parsing when it is finished also. >>>> >>> >>> Even when you do not have issues it would be nice to have a small html >>> test. >>> >>> The code is already moved to the Apache repository, even our website has >>> a checkout instructions: >>> http://incubator.apache.org/opennlp/source-code.html >>> >>> Open a new jira issue, either just attach a piece of test data or >>> contribute >>> a patch which also contains the additions to the unit tests. >>> >>> Thanks, >>> Jörn >>> >>> >>> >> >
