Hi there,
First of all, nice job with OpenNLP. :)
I'm sending this email to ask your opinion on how to implement some
features for my project.
The objective of my project is to extract entities (persons, dates,
companies, etc) and their relations in resumes (CVs), for example, company
name and respective working period (date) of a person.
I have a set of resumes and I'm creating a training set for entity
recognition, however I'm a little worried about relation extraction. Is
there a way to
write nested tags in a training set to improve this relation extraction? If
there isn't, what recommendations can you give me to this problem?
As an example for this, right now i have:
<START:description> Senior Developer <END> <START:date> 2009-current <END>
<START:company> Apple Inc. <END>
What i hoped i could have:
<START:jobs>
<START:description> Senior Developer <END> <START:date> 2009-current
<END> <START:company> Apple Inc. <END>
...
<END>
Is a nested tag approach the correct way of trying to achieve this
correlation between entities or is there better approaches to this problem?
Thank you very much for your help.
Best regards,
--
Filipe Araújo