Re: [XeTeX] XeTeX maintenance

2015-04-28 Thread Johann Spies
On Sat, Apr 25, 2015 at 02:13:08PM -0700, Gildas Hamel wrote:

 I thank you for all the work, dedication, and grace you brought to
 XeTeX. I'm an end user who barely understands how it all works. I keep
 marveling at the wonders Xe(La)TeX and Lua(La)TeX typesetting allows
 me to do daily.

I fully agree with this.  Thank you very much.

Regards
Johann

-- 
J.H. Spies - Tel. 021-982 2694 / 082 782 0336 / 021-808 4599(w)  
 Posbus 4668, Tygervallei 7536

 Thy word have I hid in mine heart, that I might not 
  sin against thee.  Psalm 119:11 


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-28 Thread Joseph Wright
On 28/04/2015 00:48, Douglas McKenna wrote:
 That isn't at all clear, I don't see any evidence that the equivalent 
 #; notation in XML is getting less used. For runs of natural language 
 text then clearly using character data directly makes more sense
  but to get specific symbols accessing by code point often makes sense. 

 To get a math bold A, It is much easier to tell someone to enter ^1d400 
 than to tell them how
 to enter 퐀 in whatever system they are using.
 
 Of course, every Unicode reference on the web would be referring to it as 
 U+1D4000.  Sigh.
 
 Anyway, duly noted.  Except in the future, whatever system they are using 
 will likely be able to handle the UTF-8 character as direct input, just like 
 it appears (in my email reader) above as a math bold A.

Well yes and no. Whilst editors, viewers, etc. can be expanded to cover
the entire Unicode range, no one font will cover the entire spectrum. At
the same time, most keyboards are only ever going to have ~100 keys. So
whilst for the main language of a document Unicode makes sense, saying
that you have to find the correct Unicode code point for everything is
not so convenient. (One can arrange different key binds to flip between
which could be used to do for example the math-mode bold A business, but
that may or may not be easier than just typing \mathbf{A} or whatever,
depending on the use case.)
--
Joseph Wright




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Zdenek Wagner
Hi all,

if you use para indentation=none looseness=1Something.../para you
are not using XML properly. The purpose of XML is to describe the structure
of a document, not its appearance. Generally you will print the information
from the XML file in a different order or you will print just a part of it.
As the second step you will run XSLT in order to extract the elements that
have to be printed and finally XSL-FO in order to format the output. Of
course you can combine these two steps into a single one and you can use
other formatting engines instead of XSL-FO. On the contrary, the TeX
primitives do not describe the document structure but ist appearance. LaTeX
offers a kind of structural markup, eg \chapter, \section etc. Plain TeX
users are supposed to invent such a markup themselves fr each docunebt or
they can use OPmac developed by Petr Olšák.

XML may, of course, be used almost for everything. It helped me much when
typesetting a book with pictures that have fixed places and the text is
floating. It is described here:
http://icebearsoft.euweb.cz/bharat.php

I gave a lecture on this topic but only in Czech so the slides might be
useless for you but you can look at the sources and XSLT stylesheets. I use
a commercial font that cannot be distributed so uless you have it, you will
not be able to compile the source, at least you will have to redefine the
font.

As I wrote, XML defines the document structure. When creating my web pages,
I have all language variants in one XML file and XSLT builds 1 HTML file
per language. For instance, this page is avaiable in three languages:
http://icebearsoft.euweb.cz/ArunakashKaRasta/

The tools for creating the language versions are described here:
http://icebearsoft.euweb.cz/webmake.php

I have other applications of XML + TeX but if they are published,
informatin is (unfortunatelly) only in Czech.

Zdeněk Wagner
http://hroch486.icpf.cas.cz/wagner/
http://icebearsoft.euweb.cz

2015-04-27 12:39 GMT+02:00 Philip Taylor p.tay...@rhul.ac.uk:



 Joseph Wright wrote:
  On 27/04/2015 07:35, Philip Taylor wrote:
  Going even further off-topic, but pursuing this one aspect of the
  thread, is there not only real one problem :  the need to educate users
  to cease marking up their documents in raw (La)TeX syntax, and instead
  to express them in well-formed XML ?  I have just finished typesetting
  (using [plain] XeTeX) a 544pp book marked up entirely in XML, and whilst
  I have made no efforts to generate PDF/UA, I am convinced that the task
  of so doing (assuming that the necessary primitives are or were
  available in XeTeX) would have been 1/1000 of the effort needed to do so
  had the book been marked up in traditional (La)TeX syntax with its usual
  accompanying conflation of form and content.
 
  As Ross says in a parallel message, XML raises different issues and is
  not a panacea. For a start, we can ask if XML is a particularly good
  format not only here or for anything (there's a blog post by Linus
  Torvalds suggesting the answer is 'no'!). Assuming XML is at some level
  a good plan, that still doesn't make it a good plan for the end user nor
  ensure that the end sure will stick to logical structures. There's also
  the business that TeX is useful because sometimes we do need some visual
  adjustment or programming element.

 Let me address the last point first, because it is by far the easier to
 rebut.  In the 544pp book to which I referred earlier, there are
 occasional places where TeX's typesetting system, in the absence of
 explicit guidance, produces sub-optimal results.  This is overcome at
 the XML level by the simple expedient of attributes (where such can be
 restricted to a single element):

 Para indentation=none vadjust=0,75 hbadness=4000image
 status=active vadjust=-1,8 source=FO+78-81-57-1813 repository=NA
 callout=Document_2/imageforeign language=GreekΔιὰ τῆς παρούσης
 μου ἀναφορᾶς ἀναφέρω τῇ ἐξοχότητί της, ὅτι κατὰ τὸ ͵αωαʹ ἔτος

 or of pragmats (where they may be required in a more general situation):

 Parapragmat code=\looseness = 1 \emergencystretch = 0,1 em
 \tolerance =  \hbadness = \tolerance \parfillskip = 0 pt plus
 0,3\hsize \relax/pragmatIn April 1813 the then Patriarch of
 placeJerusalem/place owner-individual

 indexterm=Polykarpos,_Patriarch_of_JerusalemPolykarpos/owner-individual
 (1808–27) wrote to other-person indexterm=Liston,~RobertRobert
 Liston/other-person, the British Ambassador in

 The former are used fairly frequently to optimise appearance; the latter
 are used in only a very few places.

 As to whether XML is a particularly good format not only here or for
 anything, all I can say is that in my experience we (humanity, that is)
 have not yet come up with anything better; LaTeX 2e, by explicitly
 permitting the conflation of form and content, fails abysmally in this
 respect (IMHO, of course).

 ** Phil.


 --
 Subscriptions, Archive, and List information, etc.:
   

Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Philip Taylor


Joseph Wright wrote:
 On 27/04/2015 07:35, Philip Taylor wrote:
 Going even further off-topic, but pursuing this one aspect of the
 thread, is there not only real one problem :  the need to educate users
 to cease marking up their documents in raw (La)TeX syntax, and instead
 to express them in well-formed XML ?  I have just finished typesetting
 (using [plain] XeTeX) a 544pp book marked up entirely in XML, and whilst
 I have made no efforts to generate PDF/UA, I am convinced that the task
 of so doing (assuming that the necessary primitives are or were
 available in XeTeX) would have been 1/1000 of the effort needed to do so
 had the book been marked up in traditional (La)TeX syntax with its usual
 accompanying conflation of form and content.
 
 As Ross says in a parallel message, XML raises different issues and is
 not a panacea. For a start, we can ask if XML is a particularly good
 format not only here or for anything (there's a blog post by Linus
 Torvalds suggesting the answer is 'no'!). Assuming XML is at some level
 a good plan, that still doesn't make it a good plan for the end user nor
 ensure that the end sure will stick to logical structures. There's also
 the business that TeX is useful because sometimes we do need some visual
 adjustment or programming element.

Let me address the last point first, because it is by far the easier to
rebut.  In the 544pp book to which I referred earlier, there are
occasional places where TeX's typesetting system, in the absence of
explicit guidance, produces sub-optimal results.  This is overcome at
the XML level by the simple expedient of attributes (where such can be
restricted to a single element):

Para indentation=none vadjust=0,75 hbadness=4000image
status=active vadjust=-1,8 source=FO+78-81-57-1813 repository=NA
callout=Document_2/imageforeign language=GreekΔιὰ τῆς παρούσης
μου ἀναφορᾶς ἀναφέρω τῇ ἐξοχότητί της, ὅτι κατὰ τὸ ͵αωαʹ ἔτος

or of pragmats (where they may be required in a more general situation):

Parapragmat code=\looseness = 1 \emergencystretch = 0,1 em
\tolerance =  \hbadness = \tolerance \parfillskip = 0 pt plus
0,3\hsize \relax/pragmatIn April 1813 the then Patriarch of
placeJerusalem/place owner-individual
indexterm=Polykarpos,_Patriarch_of_JerusalemPolykarpos/owner-individual
(1808–27) wrote to other-person indexterm=Liston,~RobertRobert
Liston/other-person, the British Ambassador in

The former are used fairly frequently to optimise appearance; the latter
are used in only a very few places.

As to whether XML is a particularly good format not only here or for
anything, all I can say is that in my experience we (humanity, that is)
have not yet come up with anything better; LaTeX 2e, by explicitly
permitting the conflation of form and content, fails abysmally in this
respect (IMHO, of course).

** Phil.


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Martin Schröder
2015-04-27 16:40 GMT+02:00 Apostolos Syropoulos asyropou...@yahoo.com:
 Many consider that JSON will eventually replace XML.

Only if JSON gets schemes.
http://json-schema.org/ exists, but is not widely used.

Best
   Martin


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Philip Taylor


Joseph Wright wrote:

 Somewhat away from the original topic, but it strikes me that building a
 tagged PDF is going to be much more problematic at the macro layer than
 at the engine level: is that fair? Deciding what elements of a document
 are 'structure' is hard, and in 'real' documents it's not unusual to see
 a lot of input that's more about appearance than structure. That of
 course isn't limited to TeX: I suspect anyone trying to generate tagged
 output has the same concern (users do odd things).

Going even further off-topic, but pursuing this one aspect of the
thread, is there not only real one problem :  the need to educate users
to cease marking up their documents in raw (La)TeX syntax, and instead
to express them in well-formed XML ?  I have just finished typesetting
(using [plain] XeTeX) a 544pp book marked up entirely in XML, and whilst
I have made no efforts to generate PDF/UA, I am convinced that the task
of so doing (assuming that the necessary primitives are or were
available in XeTeX) would have been 1/1000 of the effort needed to do so
had the book been marked up in traditional (La)TeX syntax with its usual
accompanying conflation of form and content.

** Phil.


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Joseph Wright
On 27/04/2015 00:22, Ross Moore wrote:
 But of course that doesn't address the problem for LaTeXt users until
 someone writes a suitable/comparable package (maybe someone did
 already, I didn't try to follow).
 
 I have coding for much of what is needed, using the modified pdfTeX.
 But there is a lot that still needs to be added; e.g. PDF’s table model,
 References, footnotes, etc. 

Somewhat away from the original topic, but it strikes me that building a
tagged PDF is going to be much more problematic at the macro layer than
at the engine level: is that fair? Deciding what elements of a document
are 'structure' is hard, and in 'real' documents it's not unusual to see
a lot of input that's more about appearance than structure. That of
course isn't limited to TeX: I suspect anyone trying to generate tagged
output has the same concern (users do odd things).
--
Joseph Wright




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Joseph Wright
On 27/04/2015 08:43, Ross Moore wrote:
 Hi Joseph,
 
 On 27/04/2015, at 4:19 PM, Joseph Wright wrote:
 
 On 27/04/2015 00:22, Ross Moore wrote:
 But of course that doesn't address the problem for LaTeXt users until
 someone writes a suitable/comparable package (maybe someone did
 already, I didn't try to follow).

 I have coding for much of what is needed, using the modified pdfTeX.
 But there is a lot that still needs to be added; e.g. PDF’s table model,
 References, footnotes, etc.

 Somewhat away from the original topic, but it strikes me that building a
 tagged PDF is going to be much more problematic at the macro layer than
 at the engine level: is that fair? 
 
 Certainly one needs help at the engine level, to build the tree
 structures: what is a parent/child of what else.

Yes, I didn't mean that engine support isn't required, but that some of
the more complex concepts are probably at the macro layer. You know a
lot more about this than I do, but I assume that there is more to tagged
PDFs than sectioning (which is relatively easy to define). For example,
as a chemist I'd guess one has to worry about chemical formulae and
about reference numbers to compounds. (We tend to give the latter in
bold and they commonly refer to graphics representing the structures.
That looks very tricky to me to express in a tagged form!)

 But macros are needed to determine where new structure starts
 and finishes.
 Think  \section  and friends, list environments, \item  etc.

Yes, those elements seem relatively clear. As I've noted in another
reply, ConTeXt MkIV has moved to a more XML-like \startitem ...
\stopitem construct as the preferred way to deal with (here) items, I
guess in part as that makes such things easier. As a user that's
slightly more tricky: I'd say that the ideal that one item is ended by
the start of the next is pretty clear :-)

 Indicators must go in at a high level, before these are decomposed
 into the content:  letters, font-switches, etc.

Again, understood and I think reasonably clear for the macro level.

 In short, determining where structure is to be found is *much* harder
 at the engine level; but doing the book-keeping to preserve that
 structure, once known, is definitely easier when done at that level.

Makes sense. One can imagine constructing a tree at the macro level but
as there still needs to be some tagging I guess it doesn't help. (Can
the latter be done using \specials?)

 Philip Taylor is correct in thinking that such things can be
 better controlled in XML. But there the author has to put in
 the extra verbose markup for themselves --- hopefully with help
 from some kind of interface.
 However, that can involve a pretty steep learning curve anyway.

 Word has had styles for decades, but how many authors actually
 make proper use of them?  e.g. linking one style to another,
 setting space before  after, rather than just using newlines,
 and inserting space runs instead of setting tabs.
 How many even know of the difference between return  and
 Shift-return  (or is it Option-return ) ?

:-)

 The point of (La)TeX is surely to allow the human author
 to not worry too much about detailed structure, but still allow
 sufficient hints (via the choice of environments and macros used)
 that most things should be able to be worked out.

That's the plan, I guess.
--
Joseph Wright




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Joseph Wright
On 27/04/2015 07:35, Philip Taylor wrote:
 Going even further off-topic, but pursuing this one aspect of the
 thread, is there not only real one problem :  the need to educate users
 to cease marking up their documents in raw (La)TeX syntax, and instead
 to express them in well-formed XML ?  I have just finished typesetting
 (using [plain] XeTeX) a 544pp book marked up entirely in XML, and whilst
 I have made no efforts to generate PDF/UA, I am convinced that the task
 of so doing (assuming that the necessary primitives are or were
 available in XeTeX) would have been 1/1000 of the effort needed to do so
 had the book been marked up in traditional (La)TeX syntax with its usual
 accompanying conflation of form and content.

As Ross says in a parallel message, XML raises different issues and is
not a panacea. For a start, we can ask if XML is a particularly good
format not only here or for anything (there's a blog post by Linus
Torvalds suggesting the answer is 'no'!). Assuming XML is at some level
a good plan, that still doesn't make it a good plan for the end user nor
ensure that the end sure will stick to logical structures. There's also
the business that TeX is useful because sometimes we do need some visual
adjustment or programming element.

LaTeX2e is already not bad for structure if used in the right way, and
ConTeXt MkIV has gone further along an XML-like road without using this
as the native syntax (\startsection/\stopsection for example), and of
course plain users can define similar structures (indeed without the
constraints that LaTeX has of needing not to break things).
--
Joseph Wright



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Ross Moore
Hi Joseph,

On 27/04/2015, at 4:19 PM, Joseph Wright wrote:

 On 27/04/2015 00:22, Ross Moore wrote:
 But of course that doesn't address the problem for LaTeXt users until
 someone writes a suitable/comparable package (maybe someone did
 already, I didn't try to follow).
 
 I have coding for much of what is needed, using the modified pdfTeX.
 But there is a lot that still needs to be added; e.g. PDF’s table model,
 References, footnotes, etc. 
 
 Somewhat away from the original topic, but it strikes me that building a
 tagged PDF is going to be much more problematic at the macro layer than
 at the engine level: is that fair?

Certainly one needs help at the engine level, to build the tree
structures: what is a parent/child of what else.

But macros are needed to determine where new structure starts
and finishes.
Think  \section  and friends, list environments, \item  etc.

Indicators must go in at a high level, before these are decomposed
into the content:  letters, font-switches, etc.


In short, determining where structure is to be found is *much* harder 
at the engine level; but doing the book-keeping to preserve that 
structure, once known, is definitely easier when done at that level.



Philip Taylor is correct in thinking that such things can be 
better controlled in XML. But there the author has to put in 
the extra verbose markup for themselves --- hopefully with help
from some kind of interface.
However, that can involve a pretty steep learning curve anyway.

Word has had styles for decades, but how many authors actually
make proper use of them?  e.g. linking one style to another,
setting space before  after, rather than just using newlines,
and inserting space runs instead of setting tabs.
How many even know of the difference between return  and 
Shift-return  (or is it Option-return ) ?


The point of (La)TeX is surely to allow the human author
to not worry too much about detailed structure, but still allow
sufficient hints (via the choice of environments and macros used) 
that most things should be able to be worked out.


In particular, you need to hack into  \everypar  to determine
where the TeX mode switches from vertical to horizontal.
(LaTeX already does this, so it is delicate programming to mix
in what (La)TeX wants with what is needed for tagging.)

Doing it this way keeps things well hidden from the author,
who most likely just doesn't want to know anyway.


 Deciding what elements of a document
 are 'structure' is hard, and in 'real' documents it's not unusual to see
 a lot of input that's more about appearance than structure. That of
 course isn't limited to TeX: I suspect anyone trying to generate tagged
 output has the same concern (users do odd things).

Absolutely, as in my Word examples above.

LaTeX wants you to use a \section-like command, rather than
switching to bold-face, perhaps after inserting vertical space.
But if a human can recognise this, it should also be possible
to program TeX to recognise it. A really friendly system would
pause and question the author, perhaps with several options
available on how to proceed --- TeX can do this.
And TeX has a  \nonstopmode  to override such stoppages.


 --
 Joseph Wright

Enough on this for now.  This is surely a topic for TUG-2015.
By then we should know when the revised ADA Section 508 
will come into effect
--- or if it has been delayed or watered down. :-)


Cheers,

Ross


Ross Moore

Senior Lecturer
Mathematics Department  |   Level 2, E7A 
Macquarie University, NSW 2109, Australia
T: +61 2 9850 8955   |  F: +61 2 9850 8114
M: +61 407 288 255  |  http://www.maths.mq.edu.au/



CRICOS Provider Number 2J. Think before you print. Please consider the 
environment before printing this email.

This message is intended for the addressee named and may contain confidential 
information. If you are not the intended recipient, please delete it and notify 
the sender. Views expressed in this message are those of the individual sender, 
and are not necessarily the views of Macquarie University.



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Joseph Wright
On 27/04/2015 01:05, Douglas McKenna wrote:
 Joseph Wright wrote:
 
 \def\{0}\expandafter\def\csname^00022\endcsname{1}
 \ifnum\=0 \message{tex82}\else\message{newstuff}\fi
 
 When I implemented a Unicode escape sequence extension using double-caret 
 notation in the JSBox TeX-language interpreter I've been working on (which is 
 all 21-bit Unicode internally, all the time, but can be configured at 
 run-time to be 8-bit input only), I was unaware of what XeTeX had 
 implemented, so I just used
 
 ^^u (for 16-bit, BMP codes)
 ^^Uxx (for all 21-bit Unicode code points)
 
 Seemed straightforward enough.

XeTeX conventions have been picked up by LuaTeX on this, and there's
been some 'feedback' from LuaTeX to XeTeX to give us some
standardisation for Unicode primitives/syntax (admittedly with bugs, but
that's a different point). I'd hope that any future Unicode TeX-like
systems would also pick up on the model used by XeTeX/LuaTeX.

 Given that the number of TeX input files using ^^u is likely miniscule, and 
 the number of those that follow the ^^u or ^^U with four or six hex digits is 
 even smaller, it seemed like a worthwhile benefit vs. cost, 
 compatibility-wise.  Maybe there's something I've not thought out well.

I didn't mean that there would be many real-world docs with this issue.
I was trying to point out that it's almost impossible to imagine that a
Unicode TeX-like engine could be used as a drop-in replacement for the
current 8-bit ones (pdfTeX most obviously), so when we talk about 'the
future' we have to mean 'for documents written assuming Unicode' rather
than 'for all existing TeX documents'. (For mathematicians the latter
point is very important.)

 This discussion I just found is both pertinent and frightening, I suppose:
 
 http://stackroulette.com/tex/62725/the-notation-in-various-engines

That's a (questionable) reuse of the info from

http://tex.stackexchange.com/questions/62725/the-notation-in-various-engines

Note that the discussion is editable (wiki-like) and to my knowledge is
still correct as-is. There are some tricky issues in XeTeX, particularly
related to non-BMP chars, partly because working out what should happen
here has been a work-in-progress.
--
Joseph Wright


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Simon Cozens
On 27/04/2015 19:39, Philip Taylor wrote:
 As to whether XML is a particularly good format not only here or for
 anything, all I can say is that in my experience we (humanity, that is)
 have not yet come up with anything better; LaTeX 2e, by explicitly
 permitting the conflation of form and content, fails abysmally in this
 respect (IMHO, of course).

For what it's worth, SILE's approach to this is to have pluggable input
parsers, shipping with an XML and a TeX-like parser by default.

People who use software tools to author their documents, or convert them
in from other sources, can use the XML syntax; people authoring by hand
can use the TeX-like syntax. The two syntaces are isomorphic:

 foo thing=wibblebar/foo
is equivalent to
 \foo[thing=wibble]{bar}
and is also equivalent to
 \begin[thing=wibble]{foo}bar\end{foo}

This means that if you have an XML document you want to typeset, you can
define processing expectations for its tags in an auxiliary class:

 SILE.registerCommand(foo, function(options,content)
SILE.process(content)
if options.thing == wibble then
SILE.typeset( (wobble))
end
 end)

 % Or even \define[command=foo]{\dowhatever{\process}}

and then load in the class on the command line; the upshot being you can
then feed the XML file directly to SILE without having to mess about
with XSLT or whatever.

I haven't tried creating tagged PDFs with SILE yet - there isn't support
for this in the libtexpdf library so it would mean messing about with
raw PDF specials (essentially what luatex was doing). I don't need the
functionality myself right now, so it's not a priority.

But if this is going to be a big deal, and it sounds like it might be,
then it could be worth adding specials for PDF tagging into libtexpdf
and dvipdfmx.

S


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Philip Taylor


Zdenek Wagner wrote:

 If you want to describe the page layout in a structured way, the page
 ... /page markup might cause other mark up not well formed. Of course,
 if the XML is for your own purpose, you can do anything. Even if it is
 not suggested by XML experts, it may be helpful. And if you like to use
 XML regularly, I would strongly suggest to learn one of the schema
 languages (Relax NG or W3C Schema) and write the document using a
 validating XML editor. You have to invest some time but then it saves
 you a lot of work.

The data are entered as Excel, imported from Excel into oXygen,
validated against a custom schema auto-derived from the master sources
and then exported from oXygen as well-formed XML.

 Yes, I agree, I do the same. For instance, I do not use the geometry
 package but I have my own zwpagelayout. And if you develop something,
 you certainly learn a lot. It might be useful for you to see how SAX
 processors work. It is not necessary to study a code of any of them,
 just to see a brief description which SAX events are generated. It can
 be a good inspiration for your work.

Thank you, Zdeněk, I shall investigate as you recommend.
** Phil.


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Zdenek Wagner
2015-04-27 15:00 GMT+02:00 Philip Taylor p.tay...@rhul.ac.uk:



 Zdenek Wagner wrote:

  If you want to describe the page layout in a structured way, the page
  ... /page markup might cause other mark up not well formed. Of course,
  if the XML is for your own purpose, you can do anything. Even if it is
  not suggested by XML experts, it may be helpful. And if you like to use
  XML regularly, I would strongly suggest to learn one of the schema
  languages (Relax NG or W3C Schema) and write the document using a
  validating XML editor. You have to invest some time but then it saves
  you a lot of work.

 The data are entered as Excel, imported from Excel into oXygen,
 validated against a custom schema auto-derived from the master sources
 and then exported from oXygen as well-formed XML.


OK, oXygen is a good editor, I use it too. It supports also NVDL, you can
plug in any schema. It can even switch the spellcheck according to lang and
xml:lang attributes. It was my feature request several years ago and they
implemented it.


  Yes, I agree, I do the same. For instance, I do not use the geometry
  package but I have my own zwpagelayout. And if you develop something,
  you certainly learn a lot. It might be useful for you to see how SAX
  processors work. It is not necessary to study a code of any of them,
  just to see a brief description which SAX events are generated. It can
  be a good inspiration for your work.

 Thank you, Zdeněk, I shall investigate as you recommend.
 ** Phil.




Zdeněk Wagner
http://hroch486.icpf.cas.cz/wagner/
http://icebearsoft.euweb.cz


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Philip Taylor


Zdenek Wagner wrote:

 Formatting advices should better be present in processing instructions.
 You can type ?TeX something? ?HTML something else ? etc.

Why do you write better, Zdeněk ?  They /can/ be expressed as
processing instructions, but that would require an augmented parser;
they can equally well be expressed as attributes or as a pragmat -- I
do not think that either is better, they are simply different.  XML is
an /eXtensible/ Markup Language -- one should never hesitate to extend it.

 Such parsers already exist in luatex and ConTeXt. I only know about
 them, I have never used them. And FO may be processed by passiveTeX. And
 there are, of course,  numerous other tools.

All of which will typically have a steep learning curve.  In my
experience, it is usually better to write a tool for oneself; in that
way, not only does one have an innate understanding of how it works, one
can easily amend or augment it if it fails to deliver.  Other people's
tools, on the other hand, rarely deliver /exactly/ what is required, and
it is only too easy to blame the tool when one's own package fails to
deliver in turn, rather than attempt to fix a tool which is very easy to
exploit but very hard to modify.  Far too many example of this abound in
the real world for me to single out any one as an example, but suffice
to say that one does not need to look outside the TeX world in order to
find examples ...

** Phil.



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Zdenek Wagner
2015-04-27 13:57 GMT+02:00 Philip Taylor p.tay...@rhul.ac.uk:



 Zdenek Wagner wrote:

  Formatting advices should better be present in processing instructions.
  You can type ?TeX something? ?HTML something else ? etc.

 Why do you write better, Zdeněk ?  They /can/ be expressed as
 processing instructions, but that would require an augmented parser;
 they can equally well be expressed as attributes or as a pragmat -- I
 do not think that either is better, they are simply different.  XML is
 an /eXtensible/ Markup Language -- one should never hesitate to extend it.


If you want to describe the page layout in a structured way, the page ...
/page markup might cause other mark up not well formed. Of course, if the
XML is for your own purpose, you can do anything. Even if it is not
suggested by XML experts, it may be helpful. And if you like to use XML
regularly, I would strongly suggest to learn one of the schema languages
(Relax NG or W3C Schema) and write the document using a validating XML
editor. You have to invest some time but then it saves you a lot of work.


  Such parsers already exist in luatex and ConTeXt. I only know about
  them, I have never used them. And FO may be processed by passiveTeX. And
  there are, of course,  numerous other tools.

 All of which will typically have a steep learning curve.  In my
 experience, it is usually better to write a tool for oneself; in that
 way, not only does one have an innate understanding of how it works, one
 can easily amend or augment it if it fails to deliver.  Other people's
 tools, on the other hand, rarely deliver /exactly/ what is required, and
 it is only too easy to blame the tool when one's own package fails to
 deliver in turn, rather than attempt to fix a tool which is very easy to
 exploit but very hard to modify.  Far too many example of this abound in
 the real world for me to single out any one as an example, but suffice
 to say that one does not need to look outside the TeX world in order to
 find examples ...


Yes, I agree, I do the same. For instance, I do not use the geometry
package but I have my own zwpagelayout. And if you develop something, you
certainly learn a lot. It might be useful for you to see how SAX processors
work. It is not necessary to study a code of any of them, just to see a
brief description which SAX events are generated. It can be a good
inspiration for your work.


 ** Phil.


 Zdeněk Wagner
 http://hroch486.icpf.cas.cz/wagner/
 http://icebearsoft.euweb.cz




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Philip Taylor


Zdenek Wagner wrote:

 if you use para indentation=none looseness=1Something.../para
 you are not using XML properly. The purpose of XML is to describe the
 structure of a document, not its appearance.

Yes, the element describes the structure; attributes such as those
you quote in the example above convey hints about its intended
appearance. There is no conflict -- software wishing to ascertain the
structure interrogates the elements; software wishing to depict the
structure visually can make use of the formatting attributes if it so
chooses.

 Generally you will print the information from the XML file in a 
 different order ...

Indeed we do; notes are taken out of the flow and re-set as end-notes.
This behaviour is unaffected by the use of formatting attributes.

 As the second step you will run XSLT in order to extract the elements
 that have to be printed and finally XSL-FO in order to format the
 output. 

That is /a/ methodology; there is nothing written in tablets of stone
that says that one is required to follow it.  The XML sources to which I
refer /could/ be processed using such a methodology, because the XML is
well-formed; in practice, it is far far simpler to implement an XML
parser and formatter using XeTeX, which is what I have elected to do.

** Phil.


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Zdenek Wagner
2015-04-27 13:27 GMT+02:00 Philip Taylor p.tay...@rhul.ac.uk:



 Zdenek Wagner wrote:

  if you use para indentation=none looseness=1Something.../para
  you are not using XML properly. The purpose of XML is to describe the
  structure of a document, not its appearance.

 Yes, the element describes the structure; attributes such as those
 you quote in the example above convey hints about its intended
 appearance. There is no conflict -- software wishing to ascertain the
 structure interrogates the elements; software wishing to depict the
 structure visually can make use of the formatting attributes if it so
 chooses.


Formatting advices should better be present in processing instructions. You
can type ?TeX something? ?HTML something else ? etc.


  Generally you will print the information from the XML file in a
  different order ...

 Indeed we do; notes are taken out of the flow and re-set as end-notes.
 This behaviour is unaffected by the use of formatting attributes.

  As the second step you will run XSLT in order to extract the elements
  that have to be printed and finally XSL-FO in order to format the
  output.

 That is /a/ methodology; there is nothing written in tablets of stone
 that says that one is required to follow it.  The XML sources to which I
 refer /could/ be processed using such a methodology, because the XML is
 well-formed; in practice, it is far far simpler to implement an XML
 parser and formatter using XeTeX, which is what I have elected to do.


Such parsers already exist in luatex and ConTeXt. I only know about them, I
have never used them. And FO may be processed by passiveTeX. And there are,
of course,  numerous other tools.


 ** Phil.



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Apostolos Syropoulos

 As to whether XML is a particularly good format not only here or for
 anything, all I can say is that in my experience we (humanity, that is)
 have not yet come up with anything better; LaTeX 2e, by explicitly
 permitting the conflation of form and content, fails abysmally in this
 respect (IMHO, of course).
 
 

Well I think that JSON is currently the next hot thing in computing together 
with big data.
Many consider that JSON will eventually replace XML. Also, about the PDF format 
I think that
archivable PDF


http://www.digitalpreservation.gov/formats/fdd/fdd000252.shtml

is very important.

A.S.
--
Apostolos Syropoulos
Xanthi, Greece


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Douglas McKenna
Ross Moore wrote -

 On 27/04/2015, at 10:05 AM, Douglas McKenna wrote:
 
 Given that the number of TeX input files using ^^u is likely miniscule, and 
 the number of those that follow the ^^u or ^^U with four or six hex digits 
 is even smaller, it seemed like a worthwhile benefit vs. cost, 
 compatibility-wise.  Maybe there's something I've not thought out well.
 
 For user-input files, then yes it is probably very small.
 But such constructions figure to be used a lot within package sources
 --- precisely to create macros that shield users from the syntax.
 
 For example, try in a terminal:
 
   grep \^\^\^\^ `kpsewhich unicode-math.dtx` | wc -l
  
 There are 160 lines of input and/or macro definitions.
 (four of these use ^ )
 Doubtless packages supporting other languages are similar.

Well, yes, but that's a separate issue.  Nothing says that both syntaxes can't 
be supported in an engine to maximize compatibility with both older packages 
and newer user input.

Regardless, as the future unfolds, more and more input files are going to be 
UTF-8 text files, and the need to escape 4-digit or 6-digit Unicode code points 
in a pure ASCII environment is going to gradually disappear.

Doug McKenna




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread David Carlisle
On 27 April 2015 at 01:05, Douglas McKenna d...@mathemaesthetics.com
wrote:

 Joseph Wright wrote:

  \def\{0}\expandafter\def\csname^00022\endcsname{1}
  \ifnum\=0 \message{tex82}\else\message{newstuff}\fi

 When I implemented a Unicode escape sequence extension using double-caret
 notation in the JSBox TeX-language interpreter I've been working on (which
 is all 21-bit Unicode internally, all the time, but can be configured at
 run-time to be 8-bit input only), I was unaware of what XeTeX had
 implemented, so I just used

 ^^u (for 16-bit, BMP codes)
 ^^Uxx (for all 21-bit Unicode code points)

 Seemed straightforward enough.


Introducing incompatible syntax for functionality shared across Unicode TeX
extensions
is a major headache for package authors. Please don't use this syntax!


 In the first case, if any one of the four 'x's is not a lowercase hex
 digit, interpretation reverts to the standard TeX escape sequence ^^u
 (ASCII '5'), followed by four input characters, at least one of which is
 not a hex digit.  Similarly for the six hex digit case, for whatever
 character ^^U converts to, if at least one of the six characters following
 is not a hex digit.


In the abstract this isn't an unreasonable syntax but ,^,^^ has
been in use for years
in xetex and luatex.


 Given that the number of TeX input files using ^^u is likely miniscule,
 and the number of those that follow the ^^u or ^^U with four or six hex
 digits is even smaller, it seemed like a worthwhile benefit vs. cost,
 compatibility-wise.  Maybe there's something I've not thought out well.


Incompatible syntax makes supporting cross platform formats like latex much
more difficult than it would otherwise be,



 This discussion I just found is both pertinent and frightening, I suppose:

 http://stackroulette.com/tex/62725/the-notation-in-various-engines


that's old and relates to old bugs isn't it?




 Doug McKenna



David


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Douglas McKenna
David Carlisle wrote -

 Introducing incompatible syntax for functionality shared across Unicode TeX 
 extensions
 is a major headache for package authors. Please don't use this syntax! 

Understood.

 That isn't at all clear, I don't see any evidence that the equivalent #; 
 notation in XML is getting less used. For runs of natural language text then 
 clearly using character data directly makes more sense
  but to get specific symbols accessing by code point often makes sense. 
 
 To get a math bold A, It is much easier to tell someone to enter ^1d400 
 than to tell them how
 to enter 퐀 in whatever system they are using.

Of course, every Unicode reference on the web would be referring to it as 
U+1D4000.  Sigh.

Anyway, duly noted.  Except in the future, whatever system they are using 
will likely be able to handle the UTF-8 character as direct input, just like it 
appears (in my email reader) above as a math bold A.

Thanks.

Doug McKenna




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread David Carlisle
On 27 April 2015 at 23:41, Douglas McKenna d...@mathemaesthetics.com
wrote:

 Ross Moore wrote -

  On 27/04/2015, at 10:05 AM, Douglas McKenna wrote:
 
  Given that the number of TeX input files using ^^u is likely miniscule,
 and the number of those that follow the ^^u or ^^U with four or six hex
 digits is even smaller, it seemed like a worthwhile benefit vs.
 Regardless, as the future unfolds, more and more input files are going to
 be UTF-8 text files, and the need to escape 4-digit or 6-digit Unicode code
 points in a pure ASCII environment is going to gradually disappear.


That isn't at all clear, I don't see any evidence that the equivalent
#; notation in XML is getting less used. For runs of natural language
text then clearly using character data directly makes more sense
 but to get specific symbols accessing by code point often makes sense.

To get a math bold A, It is much easier to tell someone to enter ^1d400
than to tell them how
to enter 퐀 in whatever system they are using.

David


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-27 Thread Ross Moore
Hi all,

On 28/04/2015, at 0:40, Apostolos Syropoulos asyropou...@yahoo.com wrote:

 
 As to whether XML is a particularly good format not only here or for
 anything, all I can say is that in my experience we (humanity, that is)
 have not yet come up with anything better; LaTeX 2e, by explicitly
 permitting the conflation of form and content, fails abysmally in this
 respect (IMHO, of course).
 
 
 
 Well I think that JSON is currently the next hot thing in computing together 
 with big data.
 Many consider that JSON will eventually replace XML. Also, about the PDF 
 format I think that
 archivable PDF
 
 
 http://www.digitalpreservation.gov/formats/fdd/fdd000252.shtml
 
 is very important.

Agreed.
Now PDF/A-1a, PDF/A-2a, PDF/A-3a are all accessible tagged PDF
(whereas the 'b' and 'u' sub levels need not be tagged).
It isn't much more to get PDF/UA from PDF/A-1a, etc, and so have validation for 
both.
This should be a major aim of our community.

 
 A.S.
 --
 Apostolos Syropoulos
 Xanthi, Greece


Ross


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Joseph Wright
On 26/04/2015 11:47, Philip Taylor wrote:
 To my mind, XeTeX /is/ the future of TeX.  The days of entering
 français as fran\c cais are surely numbered, and it has never been
 possible to enter العربية, ελληνικά or עברית (etc) in an analogous
 way.  Therefore, is it not time to petition the TUG Board to adopt XeTeX
 as a formal TUG project, and to allocate adequate funding to ensure not
 only its continued existence but its continued development, at least
 until such time as a clearly superior alternative not only emerges but
 becomes adopted as the /de facto/ replacement for TeX ?
 
 Philip Taylor

The problem as always is not so much money as people. [Also, you do know
about LuaTeX, yes? ;-) More seriously, XeTeX isn't a drop-in replacement
for TeX90/pdfTeX.]
--
Joseph Wright




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Philip Taylor
To my mind, XeTeX /is/ the future of TeX.  The days of entering
français as fran\c cais are surely numbered, and it has never been
possible to enter العربية, ελληνικά or עברית (etc) in an analogous
way.  Therefore, is it not time to petition the TUG Board to adopt XeTeX
as a formal TUG project, and to allocate adequate funding to ensure not
only its continued existence but its continued development, at least
until such time as a clearly superior alternative not only emerges but
becomes adopted as the /de facto/ replacement for TeX ?

Philip Taylor


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Philip Taylor


Joseph Wright wrote:

 See for example details in
 http://tex.stackexchange.com/questions/86/what-are-the-incompatibilities-of-pdftex-xetex-and-luatex
 for places where there are edge cases. The most obvious would be that
 XeTeX requires the xdvipdfmx back-end (so differences at the \special
 level), 

Yes, I accept that, but to the user (as I have argued elsewhere), XeTeX
subsumes 'xdvipdfmx' -- the fact that they are, historically, two
separate pieces of software and are separately maintained is a sad fact
of life but not one that the user of XeTeX should be required to consider.

but a simple piece of code
 
 \def\{0}\expandafter\def\csname^00022\endcsname{1}
 \ifnum\=0 \message{tex82}\else\message{newstuff}\fi
 
 (ConTeXt wiki) gives different results with TeX90 and XeTeX due to
 different treatment of more than two ^^ (catcode 7) in a row.

OK, agreed: by adding support for wider characters, some breakages will,
almost of necessity occur, but I would respectfully argue that these are
pathological cases that will not impact real-world documents.

** Phil.


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Youcef Mohammed
بارك الله فيك أخي خالد...

2015-04-26 13:08 UTC+01:00, Jonathan Kew jfkth...@gmail.com:
 On 25/4/15 18:31, Khaled Hosny wrote:
 Due to lack of time, skills and motivation, I’ll be no longer able to
 work on XeTeX, so I’m stepping down as a maintainer.

 Regards,
 Khaled


 Many thanks, Khaled, for all you've done over the past few years to
 maintain and improve XeTeX. Your contribution to the project has been
 invaluable, and is greatly appreciated.

 Jonathan



 --
 Subscriptions, Archive, and List information, etc.:
   http://tug.org/mailman/listinfo/xetex




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Philip Taylor


Joseph Wright wrote:

 The problem as always is not so much money as people.

Yes, I do appreciate that, but sometimes money is also an obstacle (do
I work on X, which will help keep a roof over my head, or on XeTeX,
which may bring me fame but which may also result in my eviction ?).

 [Also, you do know about LuaTeX, yes? ;-)

Yes, of course, but I see it as an evolutionary dead-end, much as I
would wish to see it otherwise.

 More seriously, XeTeX isn't a drop-in replacement for TeX90/pdfTeX.]

I have yet to find a legacy document which behaves differently (legacy
Plain TeX, that is, not legacy LaTeX); if you can point me at one, I
should be interested to experience the differences for myself.

** Phil.


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Philip Taylor


Joseph Wright wrote:

 The problem is that WEB code is *hard* to work with!

Some would argue (and I am one of them) that it was far easier before
Web2C became involved ...  But now we digress, so please do not feel
obliged to respond.

** Phil.


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Jonathan Kew

On 25/4/15 18:31, Khaled Hosny wrote:

Due to lack of time, skills and motivation, I’ll be no longer able to
work on XeTeX, so I’m stepping down as a maintainer.

Regards,
Khaled



Many thanks, Khaled, for all you've done over the past few years to 
maintain and improve XeTeX. Your contribution to the project has been 
invaluable, and is greatly appreciated.


Jonathan



--
Subscriptions, Archive, and List information, etc.:
 http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Dominik Wujastyk
Thank you, Khaled, for all you've done.

Dominik Wujastyk

--
Dr Dominik Wujastyk
Department of South Asia, Tibetan and Buddhist Studies
http://stb.univie.ac.at,
University of Vienna,
Spitalgasse 2-4, Courtyard 2, Entrance 2.1
1090 Vienna, Austria
and
Adjunct Professor,
Division of Health and Humanities,
St. John's Research Institute, http://www.sjri.res.in/ Bangalore, India.
Project http://www.istb.univie.ac.at/caraka/ | home page
http://univie.academia.edu/DominikWujastyk | HSSA
http://hssa.sayahna.org | PGP http://wujastyk.net/pgp.html




On 25 April 2015 at 11:31, Khaled Hosny khaledho...@eglug.org wrote:

 Due to lack of time, skills and motivation, I’ll be no longer able to
 work on XeTeX, so I’m stepping down as a maintainer.

 Regards,
 Khaled


 --
 Subscriptions, Archive, and List information, etc.:
   http://tug.org/mailman/listinfo/xetex



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Joseph Wright
On 26/04/2015 12:16, Philip Taylor wrote:
 
 
 Joseph Wright wrote:
 
 See for example details in
 http://tex.stackexchange.com/questions/86/what-are-the-incompatibilities-of-pdftex-xetex-and-luatex
 for places where there are edge cases. The most obvious would be that
 XeTeX requires the xdvipdfmx back-end (so differences at the \special
 level), 
 
 Yes, I accept that, but to the user (as I have argued elsewhere), XeTeX
 subsumes 'xdvipdfmx' -- the fact that they are, historically, two
 separate pieces of software and are separately maintained is a sad fact
 of life but not one that the user of XeTeX should be required to consider.

Still requires changes in a document, particularly one written for
pdfTeX in PDF mode (certainly for plain: for LaTeX of course this is
more transparent).

 but a simple piece of code

 \def\{0}\expandafter\def\csname^00022\endcsname{1}
 \ifnum\=0 \message{tex82}\else\message{newstuff}\fi

 (ConTeXt wiki) gives different results with TeX90 and XeTeX due to
 different treatment of more than two ^^ (catcode 7) in a row.
 
 OK, agreed: by adding support for wider characters, some breakages will,
 almost of necessity occur, but I would respectfully argue that these are
 pathological cases that will not impact real-world documents.

My point though is that neither XeTeX nor indeed any other Unicode
TeX-like engine can be used as a direct replacement for an 8-bit engine:
contrast the fact that the standard engine for TeX Live is nowadays
pdfTeX used as a direct drop-in replacement for TeX90 (with the
exception of using tex, which is Kunth's TeX unaltered). As such,
whilst new documents may be written using a Unicode engine, pdfTeX will
remain vital.

All that said, I am keen that some way is found to continue to work on
XeTeX. The problem is that WEB code is *hard* to work with!
--
Joseph Wright



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Joseph Wright
On 26/04/2015 12:00, Philip Taylor wrote:
 
 
 Joseph Wright wrote:
 
 The problem as always is not so much money as people.
 
 Yes, I do appreciate that, but sometimes money is also an obstacle (do
 I work on X, which will help keep a roof over my head, or on XeTeX,
 which may bring me fame but which may also result in my eviction ?).
 
 [Also, you do know about LuaTeX, yes? ;-)
 
 Yes, of course, but I see it as an evolutionary dead-end, much as I
 would wish to see it otherwise.
 
 More seriously, XeTeX isn't a drop-in replacement for TeX90/pdfTeX.]
 
 I have yet to find a legacy document which behaves differently (legacy
 Plain TeX, that is, not legacy LaTeX); if you can point me at one, I
 should be interested to experience the differences for myself.
 
 ** Phil.

See for example details in
http://tex.stackexchange.com/questions/86/what-are-the-incompatibilities-of-pdftex-xetex-and-luatex
for places where there are edge cases. The most obvious would be that
XeTeX requires the xdvipdfmx back-end (so differences at the \special
level), but a simple piece of code

\def\{0}\expandafter\def\csname^00022\endcsname{1}
\ifnum\=0 \message{tex82}\else\message{newstuff}\fi

(ConTeXt wiki) gives different results with TeX90 and XeTeX due to
different treatment of more than two ^^ (catcode 7) in a row.
--
Joseph Wright



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Ross Moore
Hi Mojca,

 On 27 Apr 2015, at 6:53 am, Mojca Miklavec mojca.miklavec.li...@gmail.com 
 mailto:mojca.miklavec.li...@gmail.com wrote:
 
 On Sun, Apr 26, 2015 at 10:26 PM, Ross Moore wrote:
 
 No standard TeX implementation currently comes close to producing Tagged PDF.
 
 ConTeXt MkIV does:
https://www.tug.org/TUGboat/tb31-3/tb99hagen.pdf 
 https://www.tug.org/TUGboat/tb31-3/tb99hagen.pdf

Yes; I’m aware of what Hans can achieve, and hold him in awe. :-)
Besides, this uses LuaTeX.  viz. this quote from the end of Hans’ article.
“Also, it is yet another nice test case and torture test for LuaTEX and it 
helps us to find buglets and oversights.” 

That is precisely why I used the word “standard” qualifying “TeX installation” 
in my statement above.

 
 But of course that doesn't address the problem for LaTeXt users until
 someone writes a suitable/comparable package (maybe someone did
 already, I didn't try to follow).

I have coding for much of what is needed, using the modified pdfTeX.
But there is a lot that still needs to be added; e.g. PDF’s table model,
References, footnotes, etc. 

 
 Mojca
 
 PS: Our government is still mainly depending on documents with a doc
 extension.

Right. Conversion to PDF requires Adobe’s converters.
There are known bugs — but this is doubtless being worked on.

The point is that, for people wishing to use TeX-based software to
produce PDFs, then extra converters or manual conversion techniques
(e.g., using Acrobat Pro) will be required to produce a valid PDF/UA document.
Unless, that is, our community takes this seriously and creates a major project.

Another quote from Han’s article:
 “This is a typical case where more energy has to be spent on driving the voice 
of Acrobat but I will do that when we find a good reason.” 

That reason is getting much, much closer.


All the best,

Ross




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Dominik Wujastyk
I'd like to add a voice of support to Philip's suggestion about petitioning
the TUG Board to adopt XeTeX as a formal TUG project.  To be honest, I was
startled that it doesn't already have such recognition.  There's a big
community of us using XeTeX and it really should be close to the heart of
TUG, and to have access to whatever support, recognition and responsibility
as TUG can give.

Dominik


--
​Dominik Wujastyk
University of Alberta​



On 26 April 2015 at 04:47, Philip Taylor p.tay...@rhul.ac.uk wrote:

 To my mind, XeTeX /is/ the future of TeX.  The days of entering
 français as fran\c cais are surely numbered, and it has never been
 possible to enter العربية, ελληνικά or עברית (etc) in an analogous
 way.  Therefore, is it not time to petition the TUG Board to adopt XeTeX
 as a formal TUG project, and to allocate adequate funding to ensure not
 only its continued existence but its continued development, at least
 until such time as a clearly superior alternative not only emerges but
 becomes adopted as the /de facto/ replacement for TeX ?

 Philip Taylor


 --
 Subscriptions, Archive, and List information, etc.:
   http://tug.org/mailman/listinfo/xetex



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Douglas McKenna
Joseph Wright wrote:

 \def\{0}\expandafter\def\csname^00022\endcsname{1}
 \ifnum\=0 \message{tex82}\else\message{newstuff}\fi

When I implemented a Unicode escape sequence extension using double-caret 
notation in the JSBox TeX-language interpreter I've been working on (which is 
all 21-bit Unicode internally, all the time, but can be configured at run-time 
to be 8-bit input only), I was unaware of what XeTeX had implemented, so I just 
used

^^u (for 16-bit, BMP codes)
^^Uxx (for all 21-bit Unicode code points)

Seemed straightforward enough.

In the first case, if any one of the four 'x's is not a lowercase hex digit, 
interpretation reverts to the standard TeX escape sequence ^^u (ASCII '5'), 
followed by four input characters, at least one of which is not a hex digit.  
Similarly for the six hex digit case, for whatever character ^^U converts to, 
if at least one of the six characters following is not a hex digit.

Given that the number of TeX input files using ^^u is likely miniscule, and the 
number of those that follow the ^^u or ^^U with four or six hex digits is even 
smaller, it seemed like a worthwhile benefit vs. cost, compatibility-wise.  
Maybe there's something I've not thought out well.

This discussion I just found is both pertinent and frightening, I suppose:

http://stackroulette.com/tex/62725/the-notation-in-various-engines


Doug McKenna




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Mojca Miklavec
On Sun, Apr 26, 2015 at 10:26 PM, Ross Moore wrote:

 No standard TeX implementation currently comes close to producing Tagged PDF.

ConTeXt MkIV does:
https://www.tug.org/TUGboat/tb31-3/tb99hagen.pdf

But of course that doesn't address the problem for LaTeXt users until
someone writes a suitable/comparable package (maybe someone did
already, I didn't try to follow).

Mojca

PS: Our government is still mainly depending on documents with a doc
extension.


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Ross Moore
Hi all,

On 26/04/2015, at 20:51, Joseph Wright joseph.wri...@morningstar2.co.uk wrote:

 On 26/04/2015 11:47, Philip Taylor wrote:
 To my mind, XeTeX /is/ the future of TeX.  The days of entering
 français as fran\c cais are surely numbered, and it has never been
 possible to enter العربية, ελληνικά or עברית (etc) in an analogous
 way.  Therefore, is it not time to petition the TUG Board to adopt XeTeX
 as a formal TUG project, and to allocate adequate funding to ensure not
 only its continued existence but its continued development, at least
 until such time as a clearly superior alternative not only emerges but
 becomes adopted as the /de facto/ replacement for TeX ?
 
 Philip Taylor
 
 The problem as always is not so much money as people. [Also, you do know
 about LuaTeX, yes? ;-) More seriously, XeTeX isn't a drop-in replacement
 for TeX90/pdfTeX.]

There is an even bigger issue which is going to affect the future of TeX.

http://www.access-board.gov/guidelines-and-standards/communications-and-it/about-the-ict-refresh/proposed-rule

The laws (in the US, but this will propagate) are going to becoming much 
tougher about requiring Accessibility of electronic documents, both for 
websites and PDFs.
Basically all PDFs (produced by government agencies for public consumption) 
*must* satisfy the PDF/UA published standard. That is, they must be Tagged PDF, 
and satisfy all the extra recommendations for enhancing Accessibility of the 
document's content and structure.
Being a legal requirement for Government Agencies has a knock-on effect for 
everyone, so TeX software will need to be enhanced to meet such requirements, 
else extinction becomes a real possibility.

No standard TeX implementation currently comes close to producing Tagged PDF.
LuaTeX, with it's extra scripting, has the potential to do so.
Extra primitives for pdfTeX go a long way, but require 1000s of extra lines of 
TeX/LaTeX coding to implement proper structure tagging without placing a burden 
on authors.
(Those primitives are not yet standard in pdfTeX, but are in a separate 
development branch.)

It may be possible to continue with a  .tex  — .dvi — .pdf  workflow, but I 
doubt it very much.
Structure tagging requires a completely separate tree-like view of a document’s 
structure and which must be interleaved with the content within the page-tree 
structure. Storing everything that will be required into the .dvi  file, on a 
page-page basis for later processing by a separate program, is unlikely to give 
a viable solution; at least not without substantial extension of  dvips , 
dvipdfmx, etc. and Ghostscript itself perhaps.

Direct production of the PDF by a single engine is surely the best approach.


 --
 Joseph Wright


Hope this helps,

Ross


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-26 Thread Ross Moore
Hi Doug,

On 27/04/2015, at 10:05 AM, Douglas McKenna wrote:

 Given that the number of TeX input files using ^^u is likely miniscule, and 
 the number of those that follow the ^^u or ^^U with four or six hex digits is 
 even smaller, it seemed like a worthwhile benefit vs. cost, 
 compatibility-wise.  Maybe there's something I've not thought out well.

For user-input files, then yes it is probably very small.
But such constructions figure to be used a lot within package sources
--- precisely to create macros that shield users from the syntax.

For example, try in a terminal:

  grep \^\^\^\^ `kpsewhich unicode-math.dtx` | wc -l
 
There are 160 lines of input and/or macro definitions.
(four of these use ^ )
Doubtless packages supporting other languages are similar.

Of course, since these are in packages the coding can be changed,
if engines need to be changed.
(Except that old versions will still have to be retained for those 
people who do not update to newer versions of the engine.)

 
 This discussion I just found is both pertinent and frightening, I suppose:
 
 http://stackroulette.com/tex/62725/the-notation-in-various-engines

Yeah. Thanks for this link. It is from July 2012
--- so maybe some of that incompatibility is fixed now?

If not, then TUG-2015 in Germany this July may be a good 
place to discuss the status of all this?


 
 
 Doug McKenna
 

Cheers,

Ross


Ross Moore

Senior Lecturer
Mathematics Department  |   Level 2, E7A 
Macquarie University, NSW 2109, Australia
T: +61 2 9850 8955   |  F: +61 2 9850 8114
M: +61 407 288 255  |  http://www.maths.mq.edu.au/



CRICOS Provider Number 2J. Think before you print. Please consider the 
environment before printing this email.

This message is intended for the addressee named and may contain confidential 
information. If you are not the intended recipient, please delete it and notify 
the sender. Views expressed in this message are those of the individual sender, 
and are not necessarily the views of Macquarie University.



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-25 Thread Gildas Hamel
* Skriv a reas Khaled Hosny (khaledho...@eglug.org):
  |  Due to lack of time, skills and motivation, I’ll be no longer able to
  |  work on XeTeX, so I’m stepping down as a maintainer.
  |  
  |  Regards,
  |  Khaled

I thank you for all the work, dedication, and grace you brought to XeTeX. I'm 
an end user who barely understands how it all works. I keep marveling at the 
wonders Xe(La)TeX and Lua(La)TeX typesetting allows me to do daily. Names like 
Khaled Hosny are part of my secret pantheon of creators.
--Gildas Hamel



--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-25 Thread Karljurgen Feuerherm
Dear Khaled,

Your efforts have been deeply appreciated. I can only imagine the time
this must have cost.

Best wishes in your other endeavours!

K

On 2015-04-25, 13:31, Khaled Hosny khaledho...@eglug.org wrote:

Due to lack of time, skills and motivation, I¹ll be no longer able to
work on XeTeX, so I¹m stepping down as a maintainer.

Regards,
Khaled


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex




--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-25 Thread David Carlisle
On 25 April 2015 at 18:31, Khaled Hosny khaledho...@eglug.org wrote:

 Due to lack of time, skills and motivation, I’ll be no longer able to
 work on XeTeX, so I’m stepping down as a maintainer.

 Regards,
 Khaled



Khaled,

thanks for all your work on this.

I have a horrible feeling that my tex--xet bug report the other day was
the final straw
But in any case I wish you well in whatever new projects you take up.

David


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex


Re: [XeTeX] XeTeX maintenance

2015-04-25 Thread Vafa Khalighi
Thanks Khaled for all your hard work. I am really grateful to you for the
effort you put in.

I am also interested in working on xetex as a maintainer.


--
Subscriptions, Archive, and List information, etc.:
  http://tug.org/mailman/listinfo/xetex