On Wed, Feb 22, 2017 at 1:58 AM, David Haslam <dfh...@googlemail.com> wrote:

> The UTF8GreekAccents filter also fails to remove one particular Greek
> diacritic.
>
> At least, that's the case for how it works within diatheke 4.7 distributed
> with Xiphos 4.0.4
>
> After the normalization step, it actually leaves in the following:
>
> U+0345  ͅ       COMBINING GREEK YPOGEGRAMMENI
>

Are you sure this is actually a diacritic of the type that we want to be
stripping? I learned that this is actually a letter within Greek.

--Greg


>
> This a new "alpha test" bug!
>
> And there was I thinking I'd only unearthed "beta test" bugs....
>
> FWIW, the combined letters where this accent remains are these three:
>
> 001FB3  ᾳ       GREEK SMALL LETTER ALPHA WITH YPOGEGRAMMENI
> 001FC3  ῃ       GREEK SMALL LETTER ETA WITH YPOGEGRAMMENI
> 001FF3  ῳ       GREEK SMALL LETTER OMEGA WITH YPOGEGRAMMENI
>
> It's possible there might be others outside the corpus I was testing.
>
>
>
> Best regards,
>
> David
>
>
>
>
>
> --
> View this message in context: http://sword-dev.350566.n4.
> nabble.com/GlobalOptionFilter-UTF8GreekAccents-and-non-Greek-modules-
> tp4656719p4656778.html
> Sent from the SWORD Dev mailing list archive at Nabble.com.
>
> _______________________________________________
> sword-devel mailing list: sword-devel@crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to