This suggestion is in ignorance of the code and other matters, but I wonder if the filter could be changed to allow single quote in the *absence* of a preceeding Greek letter in the list you gave.

Tom Sullivan
i...@beforgiven.info
FAX: 815-301-2835
---------------------
Great News!
God created you, owns you and gave you commands to obey.
You have disobeyed God - as your conscience very well attests to you.
God's holiness and justice compel Him to punish you in Hell.
Jesus Christ became Man, was crucified, buried and rose from the dead
as a substitute for all who trust in Him, redeeming them from Hell.
If you repent (turn from your sin) and believe (trust) in Jesus Christ,
you will go to Heaven. Otherwise you will go to Hell.
Warning! Good works are a result, not cause, of saving trust.
More info is at www.esig.beforgiven.info
Do you believe this? Copy this signature into your email program
and use the Internet to spread the Great News every time you email.

On 06/11/2018 07:46 AM, David Haslam wrote:
Further clarification and observations about the SWORD filter for UTF8GreekAccents...

My reply of 7th June was sent before I was informed about the source code for UTF8GreekAccents. In fact, this does make use of the mapping table that I provided in March 2017. Thanks, Troy!

You can visit the latest version in SVN trunk here.
https://crosswire.org/svn/sword/trunk/src/modules/filters/utf8greekaccents.cpp

Please note that it was patched during the weekend to add the lines to process GREEK KORONIS & COMBINING GREEK KORONIS. as well as to remove a residual (unused) declaration leftover from the original version. Thanks, Troy.

We may have been wondering why the filter still includes a line to remove the RIGHT SINGLE QUOTATION
       converters[0x2019] = ""; // RIGHT SINGLE QUOTATION MARK

This is because the source text in some older accented Greek modules used this Unicode character. These are usually found at End of Word locations, with typically 1218 occurrences. More recent editions of the Greek NT use the GREEK KORONIS 0x1FBD in all these same locations.

Modules with 0x2019 include MorphGNT, TischMorph and 2TGreek.
Modules with 0x1FBD include SBLG_THE.

FIO. The only Greek letters ever followed by the character are typified by the following analysis (extracted from MorphGNT).
Count Pattern
0034 δ’
0107 θ’
0233 τ’
0292 π’
0213 λ’
0132 φ’
0061 ρ’
0149 ι’
The counts vary slightly for different modules.

We should consider the conjecture that the first ever digitisation of (e.g.) the Tischendorf NT was simply transcribed incorrectly. i.e. 0x2019 was keyed everywhere one would nowadays expect to use a GREEK KORONIS. Maybe the task was performed between Unicode 1.0 (October 1991) and Unicode 1.1 (June 1993) ?

Aside: It's very likely that digitisation took place before Unicode even existed, and that the text was subsequently converted to Unicode. Some of you may remember Claremont-Michigan encodings for Hebrew, Aramaic and Greek.

So, rather than being a bug in SWORD, in retrospect it looks more like an accommodation to a systematic transcription error in some NT Greek text sources.
What we should do about it remains an open question.

One new question arises from the changes to the SWORD filter (2017 & 2018).
Has anything similar been done for the equivalent JSword filter?

Best regards.

David

Sent with ProtonMail <https://protonmail.com> Secure Email.

‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐
On 7 June 2018 8:06 PM, David Haslam <dfh...@protonmail.com> wrote:

This ongoing problem affects far too many module releases.
The immediate cause is a wrong assumption implemented in the confmaker script.

The UTF8GreekAccents filter does not restrict its filtering to accents joined or adjacent to letters in the Greek alphabet. And by "accents" please remember that some of these are actually Unicode punctuation marks. It applies the filter "willy-nilly" no matter what the context in terms of language, script or alphabet. It's a one-way valve that should never be used "backwards" to determine whether or not it should be present in the .conf file.

Aside: The other UTF8 filters are not like this, so it's OK for confmaker to use them for testing to see if they are required.

The set of Unicode characters filtered by UTF8GreekAccents are not unique to the Koine Greek language.
Some of them are found in many other languages.

It's theoretically feasible to redesign the filter such that it applies only in the context of Greek letters.
So yes, this is a matter for SWORD developers to consider too.
I documented a suitable mapping table in my GitHub repo in March 2017. See
https://github.com/DavidHaslam/UTF8-Greek-Accents

It was discussed in this mailing list at the time.
Troy was unwilling to replace the existing filter on the grounds that it does what it was designed for on accented Greek modules. The point is this. It was never designed to be used in general to test whether it is needed by a module. When used for this unintended "backwards" purpose, it generally gives the wrong answer.

This concept is not difficult to understand.

Unless and until the filter itself is redesigned, we need a compromise workaround for the confmaker script. My suggestion is to restrict applying this "backwards" test to only the modules in which this line is present.

Lang=grc

This would largely prevent the ongoing spurious addition of this filter due to the automation of module publishing. One can imagine there may be corner cases, such as where (e.g.) a French Bible module had study notes which included some accented Greek words. But the impact would be minimal by not having the filter in the conf file in such rare cases.

Best regards,

David

Sent with ProtonMail <https://protonmail.com> Secure Email.

‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐
On 7 June 2018 7:25 PM, DM Smith <dmsm...@crosswire.org> wrote:

I think it is a bug in the SWORD engine if single right quotation mark is seen as a Greek diacritic.

Will look later to verify.

If it is then the module should not have the option.

— DM Smith


On Jun 7, 2018, at 8:54 AM, "ref...@gmx.net <mailto:ref...@gmx.net>" <ref...@gmx.net <mailto:ref...@gmx.net>> wrote:
If a Greek accent is in use, the filter will be there. If this is a bug, I.e. there should not be a Greek accent, please highlight this at source. I guess this is the right approach here too. Then the next iteration will not have a spurious filter

Sent from my mobile. Please forgive shortness, typos and weird autocorrects.


-------- Original Message --------
Subject: Re: [sword-devel] Module upload: FreLXX
From: David Haslam
To: SWORD Developers' Collaboration Forum
CC:


    This line in frelxx.conf is superfluous:

    GlobalOptionFilter=UTF8GreekAccents

    I think it's triggered in confmaker script by the presence of
    these characters.
    U+2019 ’ 656 RIGHT SINGLE QUOTATION MARK

    NB. The source text is inconsistent in which character is used
    for the typographical apostrophe. cf.
    U+0027 ' 39,200 APOSTROPHE

    Example:
    Exodus 3:13 contains "les fils d'Israël" (character U+0027 used)
    Exodus 3:15 contains "aux fils d’Israël" (character U+2019 used)

    When the Greek Accents filter is disabled (in Xiphos) the latter
    becomes "aux fils dIsraël" (without the apostrophe).

    There are no Greek letters in the module, so the GreekAccents
    filter should not be included.

    Best regards,

    David

    Sent with ProtonMail Secure Email.

    ‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐

    On 4 June 2018 7:38 AM, wrote:

    > Dear All,
    >
    > This is to announce that we have just now uploaded FreLXX.
    >
    > This is is an updated version of FreLXX.
    >
    > Many thanks to update for the hard work.
    >
    > yours
    >
    > The Module Team
    >
    > P.S.: This email is sent automatically on upload of a
    new/updated module
    >
    > sword-devel mailing list: sword-devel@crosswire.org
    <mailto:sword-devel@crosswire.org>
    >
    > http://www.crosswire.org/mailman/listinfo/sword-devel
    >
    > Instructions to unsubscribe/change your settings at above page



    _______________________________________________
    sword-devel mailing list: sword-devel@crosswire.org
    <mailto:sword-devel@crosswire.org>
    http://www.crosswire.org/mailman/listinfo/sword-devel
    Instructions to unsubscribe/change your settings at above page
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org <mailto:sword-devel@crosswire.org>
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page



______________________________________________________________________
This email has been scanned by the Symantec Email Security.cloud service.
For more information please visit http://www.symanteccloud.com
______________________________________________________________________


_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page


_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to