Re: Why not drop DVI ?

Michel Lavaud Wed, 17 Sep 2008 07:02:30 -0700

Jean-Marc Lasgouttes a écrit :

Le 16 sept. 08 à 13:12, Michel Lavaud a écrit :
On the other hand, I never noted any problem of incorrect displaywith any dvi file I viewed.
While have already witnessed problems like the ones you describe withPDF, I'll add to be fair that I would
not send a dvi to somebody because:
1/ the fonts are not included (so if the receiving machine ha adifferent setup, you may have problems)
2/ the images are not included.

Thanks for raising the point, I agree it's the only one that has to besignaled to beginners, when using dvi. I raised it also in 2003 at theRMLL conference about Free Software in Metz. Well, the solution is inthe question : send the required images and fonts together with the dviin a zip archive - in the same way as the LyX executable is sent withits plugins and utility files, in a zip archive (or would you not sendLyX to somebody because the plugins and the utilities are not includedin the exe ? ;-). On Windows platforms, from Windows XP on, there is abuit-in unzip utility, I think it is also present on most Linuxdistribs. To take care of beginner users of older Windows platforms,caring authors posting dvi on the net could add a pointer to unzip.exefile with the mention "if you have not the unzip utility, you candownload it freely from this site", in the same way as caring authorswho post pdf files add the mention "If you have not Acrobat readerinstalled, you can download it freely from the Adobe site". This couldalso be solved even more simply at the level of the dvi viewers if theywere able to use directly zipped archives, but this would require to askdevelopers to add this functionality in their viewers.BTW, this obvious solution of sending the dvi file together with allits required files in an archive has been used decades ago by TomasRokicki in his dvips software : he includes the images and the fontsnecessary to display the document in the PS output. The only differencebetween dvi and ps is that, with ps, no codec is needed, becausePostScript is a programming language, so it is sufficient to concatenatethe additional files as subprograms to the main file.To compare further dvi and ps : originally, the fonts were not includedin ps files either, as they were available only in the printers(together with a PS interpreter, in the antique so-called "PostScriptprinters"). More generally, I think that criticizing (or praising) thedvi format, or the ps (or any other format) for not including fontand/or images is a moot point : there are pros and there are cons,depending on the case one has to deal with.

Actually, I do not think that dvi has ever been advocated as aninterchange document. But the simplicityof the format indeed means that very few bugs are seen when changingviewer.

Did you ever see a bug arising from dvi itself, I mean not frombeginners who use unofficial fonts in draft versions, so that thecomplete document is unreadable (I've seen that, actually) ? I've seenalso a few bugs in an old version of a dvi viewer (Dview by B Malyshev,otherwise an excellent viewer) but it has been corrected since.

Scientific articles are available in dvi format in the ArXiV archive(and also in PS, and more recently pdf) and in many other publicarchives of scientific articles. dvi files are also usually viewablejust by clicking on them, in most Linux distributions, it seems to me ?(although I don't know all of Linux distributions).

To elaborate further on what I said in my preceding post (using pdf forscientific articles instead of dvi is like going back from writtencivilization to oral one) : a big conceptual problem with pdf (i.e.outside bugs in Acrobat Reader or other viewers and changes in pdfspecifications) is that it does not go far enough towards mimickingwhat I would call "the ultimate format for scientific articles", i.e. aset of sheets of printed paper published in scientific journals andarchived in University libraries. The main property of this "ultimateformat" as far as science is concerned is that, if I go in a Universitylibrary in Paris to read a given article, I am totally confident that Ior a colleague will have exactly the same article if I/he is in aVladivostok or Peking library. The physical characterization of aprinted sheet of paper is that it is a two-dimensional white surfacewith tiny black spots of ink scattered on it. So an electronicequivalent of a printed article is a set of bitmap images withsufficiently high resolution defined as a sequence of rows of black andwhite points. Another more elaborate electronic equivalent is a dvifile. More elaborate (and also more complete than the sheets of paper)because it indicates further that such group of black points is in factletter a, this other group is letter \omega, etc. These indications arenot included in the printed paper, only the mind of the reader canrebuild this information. So, the dvi file has the same property as theprinted article in the scientific journal (unfalsifiability), and itadds also further information, so that it is even superior to printedarticles in scientific journals since it contains more informations :some printed articles with particular fonts may make it difficult oreven impossible to distinguish between 1 (number one) and l (letterell), x and \kappa, etc, only the context can provide the information tothe reader ; so that even if perfect OCR software for math articlesexisted (which is not), it would be unable to make correctinterpretation in some places from the printed article.

On the contrary, as I said in my preceding message, a pdf file isbasically a PostScript file with additional constraints to make itreadable rapidly. So, it is still a computer program and in particularit may recompute, each time one reads the pdf file, the positions of thecharacters on the sheet of paper. Therefore, errors can occur in thepositioning of some characters (with respect to the positioning in theoriginal article as reviewed by the author in his final proof-readingbefore publication), depending on the viewer used, the version of theviewer, or the version of pdf prescriptions used for the document. So, apdf file is more or less like a sheet of music and a pdf viewer like anartist : an artist can make false notes, may not have exactly the sametempo as devised by the author, etc. For music, these differencesbetween various interpretations are generally not a problem (exceptmaybe for purists or if the interpret is really too bad !). But forscience, it is (or it ought to be, in my opinion) redhibitory. Tosummarize : pdf files are too far from printed articles because thepositions of characters are not necessarily hard-wired in the file. Thisis basically why I advocate to keep in LyX the dvi format as thereference output for articles (say as the "ultimate electronic format").But of course, I see no problem in using derived files in pdf, ps, htmlformats, provided the reference dvi format is kept.

A final remark : the files produced by dvips, although PS files (andthus computer programs), have most probably the general problem of pdffiles, because they are produced from dvi output, and thus I suppose(although I did not check) they use only the PS instruction "printcharacter Ci at position (xi,yi) on page P(i)" with the positionsalready computed. I would suspect that the pdf output of dvipdfm wouldbe correct too for the same reason, but this does not makes us safe froma change in pdf specifications by Adobe. I remember some big fuss in theTeX community some years ago because of that. The dvi specifications arefrozen, thanks to Knuth, so that using dvi format is free from this kindof problem. To work properly, mathematicians need that, if they usetheorem X that means something today, it will mean exactly the samething in two years or two hundred years. If they need a theorem withdifferent hypotheses, a theorem with a new name is created. In computerscience, version numbers are theoretically a solution if implementedrigorously. But in practice, for pdf files, as they are supposed to beused by anybody, it seems (from the examples I gave in my precedingpost) that the tendency cannot be anything else but to ignore any checkand display something by all means, even if it is incorrect. The othertendency of Adobe to extend the pdf specifications to provide morepossibilities is a good point for ordinary work but a bas one forscientific work that requires absolute rigour.

Dvi allows interactivity and almost everything (even making a cup ofcoffee, with suitable hardware installed :-) through the \specialcommand, see the remarkable Advi software by Pierre Weiss and othersfrom Inria (unfortunately not ported to Windows). As for the difficultyto navigate in dvi files, that you mentioned in another post, I hadwritten a software (AsTeX navigator) about ten years ago that allows tonavigate in a dvi file, as one can navigate in a LyX document from itsdetached table of contents. I presented it in TUG 2000 conference atCambridge. Unfortunately, it works only on Windows. I promised inCambridge to port it to Linux, but I never found the time.


Best wishes,

JMarc


Michel

Re: Why not drop DVI ?

Reply via email to