On 7/7/25 18:05, Hans Hagen via ntg-context wrote:
> [...]
> Indeed. When there are useful and reliable mappings we can of course add
> more. In that case there also should be intended structure mentioned
> because we can have a richer structure in context than possible with
> those mappings to pdf which then can result in an invalidated mapping.
>
> It's mostly about playing safe.
Of course, I totally agree with that. I only want to play to be able to
investigate which options are the right ones.
This is my way of looking into this, just by testing (or breaking) it first.
> [...]
> Did you test it with ua 1 replaced by ua 2 ?
No, it works fine. Sorry, but with lines such as:
description = { pua = "ua1", pdf = "Note" },
`pua = "ua1"` made me think that it was UA-1 only.
> If needed we can pick up on it next month or so unless there are clean
> and easy additions to make (we're currently not in that mode and it's
> not the most interesting stuff to play with so we like to concentrate
> it, and the more it rains the better).
Tagging is boring and despairing for me (but subcolumnsets for parallel
texts are in pause mode now).
As far as I can recall, there are three things pending regarding spans:
languages, alternative text and actual text (/StructElem is missing there).
Language tagging should be able for some block elements (at least,
delimitedtext and probably description).
> Thanks for looking into it (I have no use cases and no possibilities to
> test and Mikael depends on what means and demands the university puts on
> it.)
Many thanks for mentioning it.
The latest tagging sample mentioned in the list (shorter version,
https://perspectivia.net/servlets/MCRFileNodeServlet/pnet_derivate_00007290/nolte_spaces.pdf)
contains also some tagging that is really hard to believe it might be
right, such as:
/P <</Lang (de-DE)/MCID 312 >>BDC
0.145 Tw 9 0 0 9 66.4399 67.6929 Tm
[(e)13.6 (.)27.4 (g)16.2 (. Peter Frankopan)-7 (, )]TJ
EMC
/P <</Lang (de-DE)/MCID 313 >>BDC
/TT1 1 Tf
10.022 0 Td
[(The Earth Transformed: An )-0.6 (Untold History)]TJ
EMC
At least, there are the following issues here:
1. Too many /P elements for what should be /Span elements.
2. “The Earth Transformed: An Untold Story” is English, not German.
3. /MCID are orphan (they lack their corresponding /StructElem).
#2 may be a human honest mistake, but #1 and #3?
My impression is that many (when not most) people may be struggling with
proper tagging.
Many thanks for your reply and your help,
Pablo
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the
Wiki!
maillist : [email protected] /
https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive : https://github.com/contextgarden/context
wiki : https://wiki.contextgarden.net
___________________________________________________________________________________