Hi,

Thanks for your comments on the JIRA.

Should I be expecting exact results if the training data and the sample data
are exactly the same or is there just too little training data to tell at
this stage?

I think having a model trained from html would be very useful.

Cheers

Paul Cowan

Cutting-Edge Solutions (Scotland)

http://thesoftwaresimpleton.blogspot.com/



On 19 January 2011 20:42, Paul Cowan <[email protected]> wrote:

> I have created a JIRA issue which contains a sample html and a failing
> test.
>
> https://issues.apache.org/jira/browse/OPENNLP-67
>
> Cheers
>
> Paul Cowan
>
> Cutting-Edge Solutions (Scotland)
>
> http://thesoftwaresimpleton.blogspot.com/
>
>
>
> On 13 January 2011 10:21, Paul Cowan <[email protected]> wrote:
>
>> >> Open a new jira issue, either just attach a piece of test data or
>> contribute a patch which also contains the additions to the unit tests.
>>
>> I will do that.
>>
>>
>> Cheers
>>
>> Paul Cowan
>>
>> Cutting-Edge Solutions (Scotland)
>>
>> http://thesoftwaresimpleton.blogspot.com/
>>
>>
>>
>> On 13 January 2011 10:15, Jörn Kottmann <[email protected]> wrote:
>>
>>> On 1/13/11 10:55 AM, Paul Cowan wrote:
>>>
>>>> Maybe you can contribute
>>>>>>
>>>>>  a small sample of your training data to the project so we can
>>>> add a junit test.
>>>>
>>>> I will gladly do that.  how is the best way to do that?  I believe the
>>>> source control is moving.
>>>>
>>>> Is git an option or mercurial?  Pull requests are great for this type of
>>>> thing through github or the mercurial equivalent.  I will make the model
>>>> available for HTML parsing when it is finished also.
>>>>
>>>
>>> Even when you do not have issues it would be nice to have a small html
>>> test.
>>>
>>> The code is already moved to the Apache repository, even our website  has
>>> a checkout instructions:
>>> http://incubator.apache.org/opennlp/source-code.html
>>>
>>> Open a new jira issue, either just attach a piece of test data or
>>> contribute
>>> a patch which also contains the additions to the unit tests.
>>>
>>> Thanks,
>>> Jörn
>>>
>>>
>>>
>>
>

Reply via email to