Hi, I have noticed that in the EMS, the factors are generated after the parallel corpus is given as an input, by postaggers like MXPOST
I have a gold POS tagged parallel corpus available for usage, which is the format *word1/POS1 word2/POS2 word3/POS3* Is there a way to use the gold corpus directly (and in what specific format should it be used ) from the EMS config file instead of writing intermediate factor generation scripts. Also is it possible to add morphological analysis as factors alongside to the POS tagged corpus, directly to the corpus ? -- - Jayendra Rakesh. BTech CSD.
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support