I was hoping to find a detailed discussion of the binary model format and the scoring function
On Wed, Nov 13, 2013 at 3:44 PM, Jörn Kottmann <[email protected]> wrote: > The documentation for the document categorizer can be found here: > http://opennlp.apache.org/documentation/1.5.3/manual/ > opennlp.html#tools.doccat > > Jörn > > > On 11/13/2013 11:14 PM, Charles Martin wrote: > >> Jörn >> Whats the link? >> >> >> Charles >> >> >> >> On Wed, Nov 13, 2013 at 2:01 PM, Joern Kottmann <[email protected]> >> wrote: >> >> Have a look at the documentation. The doc finder in the current state >>> uses >>> either a maxent or perception model. And generates bag of word features >>> for >>> the document. >>> >>> There is no sample model for it. You need to train it on your own data. >>> >>> Jörn >>> On Nov 13, 2013 6:18 PM, "Charles Martin" <[email protected]> >>> wrote: >>> >>> Where I can i find a detailed description of the model.bin file and the >>>> scoring functions for the document classifier? >>>> >>>> >>>> >>>> >>>> On Wed, Nov 13, 2013 at 1:32 AM, Thomas Zastrow <[email protected] >>>> >>>>> wrote: >>>>> Am 12.11.2013 21:19, schrieb Jörn Kottmann: >>>>> >>>>> On 11/12/2013 09:11 PM, Thomas Zastrow wrote: >>>>>> >>>>>> Are there any plans to merge the repositories? Any other sources for >>>>>>> models? >>>>>>> >>>>>>> There are a few people here and there publishing models on their >>>>>> >>>>> sites, >>> >>>> we should add a model link page to our wiki, so people can easily find >>>>>> them. >>>>>> >>>>>> Jörn >>>>>> >>>>>> I think this is a good idea. The main point here is in my oppinion, >>>>> >>>> that >>> >>>> most models were created by using copyright protected material, which >>>>> >>>> puts >>>> >>>>> them into a grey zone. Thats the reason why many people are not >>>>> distributing them. >>>>> >>>>> But anyway, having a list of models would be great. >>>>> >>>>> Best, >>>>> >>>>> Tom >>>>> >>>>> -- >>>>> Dr. Thomas Zastrow >>>>> Riemerfeldring 7a >>>>> >>>>> 85748 Garching >>>>> Tel.: 0162 422 8029 >>>>> www.thomas-zastrow.de >>>>> >>>>> >>>>> >>>> -- >>>> This e-mail message, and any attachments, is intended only for the use >>>> of >>>> the individual or entity identified in the alias address of this message >>>> and may contain information that is confidential, privileged and subject >>>> >>> to >>> >>>> legal restrictions and penalties regarding its unauthorized disclosure >>>> >>> and >>> >>>> use. Any unauthorized review, copying, disclosure, use or distribution >>>> is >>>> strictly prohibited. If you have received this e-mail message in error, >>>> please notify the sender immediately by reply e-mail and delete this >>>> message, and any attachments, from your system. Thank you. >>>> >>>> >> >> > -- This e-mail message, and any attachments, is intended only for the use of the individual or entity identified in the alias address of this message and may contain information that is confidential, privileged and subject to legal restrictions and penalties regarding its unauthorized disclosure and use. Any unauthorized review, copying, disclosure, use or distribution is strictly prohibited. If you have received this e-mail message in error, please notify the sender immediately by reply e-mail and delete this message, and any attachments, from your system. Thank you.
