Hi,

Please remember than unmunch does not guarantee a one-to-one mapping between words and root forms. For example, an unmunched word may be generated by many different root words and affixes and not just once.

That is why the unmunched list of words is typically uniquely sorted to remove duplicates.

The basic idea is that a raw word list when compressed by affix compression (munch) will always expand (unmunch) to exactly the same raw word list after sorting uniquely with no additions or deletions.

FWIW,

Kevin


On Apr 10, 2007, at 2:31 PM, Oleg Burlaca wrote:

Jancs wrote:
i suppose, you have to edit unmuch source to get such option.

Janis

Yes Jancs, you was write, I've modified the /src/tools/unmunch.c file from the hunspell package.
Just added a line:
  fprintf(stdout, "%s\n", "---");
after the block that writes out wordforms:
   for (i=0; i < numwords; i++) {
     fprintf(stdout,"%s\n",wlist[i].word);
     free(wlist[i].word);
     wlist[i].word = NULL;
     wlist[i].pallow = 0;
   }


It was easier than I thought :))
Thanks.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: dev- [EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to