Features to tokenizer

Manoj B. Narayanan Mon, 25 Sep 2017 21:05:41 -0700

Hi,

I was wondering if there is an possibility to provide features to
tokenizer. Sometimes, tokenization might depend on certain factors.


For example, the word 'semi-supervised' shouldn't be tokenized while
'august-september' should be tokenized.

Is there any way by which we could add custom features to the Learnable
Tokenizer similar to NER.

Thanks.

Manoj.

Features to tokenizer

Reply via email to