Re: [sword-devel] TEI formatting, duplicated key (BDB Glosses)

DM Smith Mon, 30 Apr 2012 07:00:40 -0700


On 04/30/2012 09:37 AM, Daniel Owens wrote:

On 04/30/2012 06:54 AM, Chris Little wrote:
On 4/30/2012 4:39 AM, David Troidl wrote:
Hi Chris,

I'm certainly no expert on your TEI dictionaries, but wouldn't it make
sense to have the first key be one that would sort properly, andpresent
the dictionary in true alphabetical order? I'm thinking of Middle
Liddell, as well as the Hebrew. This key wouldn't even necessarily have
to be shown to the user. The second key, the title, could then maintain
the proper accents for display, without hindering sorting, searching or
navigation.
I confess, I don't understand what you're proposing this as analternative to.
In the example Karl cites, there's just one actual key per entry. Itis an uppercased version of the entryFree's n attribute. This is thekey that is sorted.
The un-uppercased version from the n attribute is being rendered aspart of the entry text via the TEI filters. This is the part I'mproposing we retain, but render somewhere else, e.g. right-justifiedat the bottom of the entry.
We also render all the text of the entry, which in these casesincludes the text from a title element.
I don't know what 'true alphabetical order' means, but if you meanlocalized sort order, it's not possible with the currentimplementation of this module type.
--Chris
I think David's concern is something that needs to be dealt with. Anumber of possibilities could be pursued, some of them together:
1. The current implementation is to sort by unicode code points.This works particularly well with numeric keys. A quick solution forlanguages for which such sorting is not alphabetical would be tofollow David's suggestion of using keys that the user does not evensee. This has the advantage of providing a workable solution rightaway, but there are some problems with this. First, we could create anew "strongs" standard because the current implementation does notactually hide keys. That could be solved by making the keys so obscurethat no one would remember them. Second, any future, more robustsolution would require reworking all modules keyed to it. I have toyedwith this solution, and it might be the pragmatic way forward, but itis not ideal.
2. A localized sort order, which I think this is what David meansby true alphabetical order, would be a better long-term solution.
3. In addition, using genbooks for lexica would work for lexicathat are sorted by root, with subentries nested in a hierarchy, justlike in the Hesychius module and BDB. I have been working with Troy onthis. Unfortunately, front-ends do not recognize the Feature=HebrewDefoption in the conf file and allow genbooks as lexica. I can sendanyone an example lexicon if you are interested in working on this. Inthat case, instead of @n as the key, */x-entry/@osisID would be the key.
Any thoughts?

I think there is a problem with the sorting of entries in dictionarieswhere the keys are not ascii. I don't remember the details, but I seemto remember it having been discussed here.

For JSword, we'll be building a Lucene search index for the key, theterm and the whole entry. A user lookup will be normalized and thesearch will return the key with which lookup will proceed internally asit does today. ICU provides the ability to create a localized sort key(not at all suitable for display) that can be used to sort dictionaryentries for the end-users locale. I'm thinking that for TEI dictionariesthe representation of the key should not be shown at all.


From what I can remember, this will solve all the issues.

In Him,
    DM





_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Re: [sword-devel] TEI formatting, duplicated key (BDB Glosses)

Reply via email to