Am 03.08.2012 um 18:29 schrieb Anton Karl Ingason:
We are very pleased to announce that version 0.1 of the Faroese Parsed
Historical Corpus is now available for free download.
The corpus can be downloaded from:
http://linguist.is/farpahc/
In the txt files I see that sort of text:
I see UTF-8 files in the txt folder, and decidedly non-empty files in
the psd folder. Perhaps your download was corrupted. I'd try again.
--
Doug Ewell | Thornton, Colorado, USA
http://www.ewellic.org | @DougEwell
Original Message
Subject: Re: FarPaHC 0.1.
From:
TUS 6.1 says:
P9 [Guideline] When a nonspacing mark is applied to the letters i and j or
any other character with the Soft_Dotted property, the inherent dot on the
base character is suppressed in display.
Well, the term non-spacing mark is too wide here. Non-spacing marks include
marks below
On Fri, Aug 3, 2012 at 4:11 PM, Kent Karlsson kent.karlsso...@telia.com wrote:
UAX 44 has:
Characters with a soft dot, like i or j. An accent placed on these
characters causes the dot to disappear. That is sort of correct, but
apparently open to misinterpretation. This goes for all cc 230
4 matches
Mail list logo