Hi, In Portuguese we have contractions, that are words composed by, for example, a preposition + article, pronoun or an adverb.
Example: Nós acreditávamos nele. (We believed him.) Where "nele" can be divided into "em" + "ele". (in + him) To properly analyze this, I created two token annotation with the same begin and end, but the first I associated with the POS Tag preposition, and the second pronoun. This is especially important when we are doing chunking, because the first token will be part of a prepositional phrase, while the second of a nominal phrase. How can I guarantee that when I call UIMAFit JCasUtil.select I will get the tokens ordered, first the preposition, second the pronoun? Thank you, William