Re: [Oorexx-devel] New Process for Building the ooRexx Documentation

Gil Barmwater Thu, 20 Feb 2020 11:11:24 -0800

Thanks P.O. for doing the compare! I'm glad to hear that you found nodifferences in this very large document :-).

There is still the issue of the Table Of Contents differences but I havedetermined that they are actually generated and added by FOP and NOT byxsltproc (surprise!). I think I may have found a parameter to controlthe depth so a bit more testing before I recreate the package. Hopefullythat will happen soon.


Gil

On 2/20/2020 8:24 AM, P.O. Jonsson wrote:

Dear Gil,
The Rexxref you sent was the latest version, 11980, so I could compareit also to the version we have on Sourceforge:
1: Your rexxref is BINARY IDENTICAL to the one on sourceforge!
2: I could compare your Rexxref to my own build and see that thetextual content is identical to my builds BUT there is an important difference, namely where I have additional spaces (I did notimplement Erichs filter) and you don’t, which goes to proof that yourstrategy was successful. The additional spaces in my build docs leadto shift in content eventually leading to renumbering of pages.
I might have a go at injecting Erichs filter process in my build chainbut only as an academic exercise, I think it is safe to say that yourbuild change can replace Publican in the near future. Hats off!
I could not include screenshots of the comparison so I included themin a document that I have attached.
Hälsningar/Regards/Grüsse,
P.O. Jonsson
oor...@jonases.se <mailto:oor...@jonases.se>
Am 20.02.2020 um 02:02 schrieb Gil Barmwater <gbarmwa...@alum.rpi.edu<mailto:gbarmwa...@alum.rpi.edu>>:
It's at 11974, the same one that is packaged with the r11978 build of2 Feb. I will zip it up and send it to you directly.
On 2/19/2020 4:52 PM, P.O. Jonsson wrote:
Dear Gil,
This is great news! I will gladly go over your version ofrexxref.pdf with a magnifying glass (Beyond Compare) and compare itwith the version I have made using Publican.
If you want to do some comparisons as well I have all the Docs in myDropbox here:
https://www.dropbox.com/sh/p66c7g01h4jz5ss/AAAZd_Q2yQddrTHagxPo_UiTa?dl=0

In a folder ooRexxDocs
In order to make the comparison meaningful we should be using thesame build, please let me know which one you have done and I willrebuild mine using the same.
Hälsningar/Regards/Grüsse,
P.O. Jonsson
oor...@jonases.se <mailto:oor...@jonases.se>
Am 19.02.2020 um 22:29 schrieb Gil Barmwater<gbarmwa...@alum.rpi.edu <mailto:gbarmwa...@alum.rpi.edu>>:
And we have success! I found the reason for my previous failure anda way around it that only involved one change to a parameter in thestylesheet. At P.O.'s suggestion, I also bumped up the heap spacefor FOP to 1536Mb (my laptop couldn't support 2Gb). The resultingPDF appears almost identical to the most recent version on SourceForge - same number of pages, no extra lines in the examples,railroad diagrams are good - but I get more entries in the Table OfContents. This has been the case with all the documents I've builtso I suspect something has changed in the DocBook stylesheets; thePublican process uses an old version I believe as I see one in thewindows-build-tools directory on SVN while my process retrieves itfrom the web. I suspect that most folks would prefer the "current"style of TOC so I will continue to investigate this issue. Ifanyone is interested in going over the rexxref PDF I built with afine tooth comb to see if there are other issues, I will zip it upand put it in my Dropbox. In the meantime, I will update thepackage files that I've modified in order to make this work and zipthem up into a package that folks can download and try. Stay tuned...
Gil B.

On 2/17/2020 11:59 AM, P.O. Jonsson wrote:
Dear Gil,
Have you tried an even higher value? When I built using Publicanit balked at 950 kb (value set be Erich I think) for rexxref so Iraised it to 2 GB and then it passed. It is worth a try, memory isnot a bottleneck nowadays :-)
Hälsningar/Regards/Grüsse,
P.O. Jonsson
oor...@jonases.se <mailto:oor...@jonases.se>
Am 17.02.2020 um 15:12 schrieb Gil Barmwater<gbarmwa...@alum.rpi.edu <mailto:gbarmwa...@alum.rpi.edu>>:
An update on my progress is long overdue but Real Life sometimesgets in the way!
I have "put the pieces together" and zipped them up along withtwo files of documentation and have been able to take thatpackage to another computer, install it and successfully buildthe rxmath book. I also researched the article on Java heap spaceand found a way to specify a larger value - currently using 1GB -without having to change the FOP package. Then, because I knowthat folks will want to build the rexxref book right away, Idecided to try it, mainly to see if 1GB would be large enough.And, of course, it failed! But the problem was not with FOP butrather with the xsltproc step. It seems that the Publicanstylesheet is looking for a piece of Perl code which is obviouslynot present. So I'm back in debug mode, trying to determine whattag rexxref is using that wasn't used by rxmath and then what Ican do about it. If I can get the rexxref book to build, I willmake the tool package available so we can find any other problemsthat may be lurking.
Gil B.

On 1/30/2020 10:26 AM, Rony G. Flatscher wrote:
Dear Gil:
thank you *very* much for this interesting and informativeupdate! Looking forward to your tooling! :-)
---

Ad "Java heap space": just skim over
<https://alvinalexander.com/blog/post/java/java-xmx-xms-memory-heap-size-control>.
Maybe helpful: there are two command line help information givenby Java, one ("java --help") thedefault help, and another giving extended help ("java -X") whichdocuments the switches for
controlling the heap size Java should reserve.

Best regards

---rony


On 29.01.2020 21:38, Gil Barmwater wrote:
Previously I wrote: One other bit of good news is that thecombination of these patches and theCommon_Content sub-folder work-around are the only requiredchanges in order to use the XSLTPROCand FOP tools to successfully build our documents. I willdescribe that process in my next post.
...
So this is that next post but I am replying to Rony's post as Iwanted to also address thequestions that he raised. The process I came up with is verysimilar to that used with thePublican tools - run a transform tool, either Publican orXSLTPROC, to create an XSL-FO file fromour Docbook/XML files and a (modified) Docbook stylesheet. Runan ooRexx program written by Erichto remove extra blank lines in the .fo file. Run FOP to createa PDF from the (modified) .fo file.
But as always, the devil is in the details.
I chose XSLTPROC as several web sites suggested it althoughother tools like Xalan were mentionedas well. I was attempting to follow some step by stepdirections for building a PDF from Docbooksource but, of course, those web sites are never up to date andI had to adapt the directions as Iencountered problems. I also wanted to minimize the number ofchanges to our Publican process aswe are generally happy with the results it produces. Sosubstituting XSLTPROC for Publican as theXSL transform tool seemed a good starting point. Likewise, Ikept the Publican stylesheet - anoverride to the standard Docbook stylesheet - that we hadfurther modified but I was able toeliminate a part of it as Docbook had corrected a problem thatit was fixing, something to do withfootnote spacing. And, of course, I used the most currentversions of the tools that were
available, both for XSLTPROC and FOP (ver. 2.4).
Now I know that some folks are "chomping at the bit" toreplicate what I have done but before yourun off and start searching for the tools to download, let megive you a list of the "pieces" thatare needed. First there is the XSLTPROC transform tool: this isactually 4 packages(!) which needto be downloaded, unzipped, and the executable folders (bin)added to the path. Then of coursethere is the FOP package which needs to be downloaded, unzippedand the appropriate sub-folderadded to the path. In order to get the same "look" to thedocuments as produced by Publican, youneed to add some special fonts - 2 packages - to your system.And then there are the two Publicanstylesheets, one of which has been modified, and aconfiguration file for FOP so that it can findthe graphic files to be included and use the special fonts thatwere installed. Finally, you needto retrieve the blank-stripping program by Erich from the SVNrepository. And once you have allthe "pieces" in place, you need to checkout the latest versionof the documents from SVN, copy the"common" folder to the working copy for the book you will bebuilding and add the fopconfiguration file to it. Then you can run xsltproc, theblank-line stripping program and then
FOP. Piece of cake!
Because the above might seem overwhelming(!), I have beendeveloping a "package" that simplifiesit to a large degree. If you were to use this package, itcontains all the "pieces" and a set ofCMD files to execute the process steps. It is designed to beunzipped into a folder that willbecome the working location for building one or more?documents. After installing it, you wouldneed to install the fonts (included) and then you could build adocument. The first cmd file to berun is DOCPATH which takes one argument - the path to the SVNworking copy of the documents. Thatpath is saved in an environment variable for use by theremaining steps. Then you run DOCPREPwhich also takes one argument - the name of the "book" you wantto build, e.g. rxmath. It takescare of creating the "Common_Content" sub-folder and adding theFOP configuration file to it aswell as saving the document name in another environmentvariable. Next you run DOC2FO which runsthe transform step. And finally, FO2PDF which runs FOP. The .fofile, the .pdf file and a .logfile containing all the (many) messages from FOP are placed ina sub-directory named e.g. out-rxmath.
The cmd files are written and have been tested on the rxmath"book". I need to put the piecestogether and zip them up which is my next step. Then I willprovide a link so anyone interestedcan download it and give it a try. Note that I have NOT triedthis on any other "books" so Iexpect there will be issues with some of them. E.g. as P.O.noted in a different thread andmentioned by Erich as well, the Java heap space needs to beincreased for some of our documents. Ido not know how to do that <blush> but it was not necessary forthe rxmath book. Any other issuesshould be "book-related", not process-related and can be fixedas they are uncovered. And any
process issues or enhancements I am willing to investigate.
If it is the consensus that I should run this process on "all"the documents before I release it,
i.e. actually do a full test(!), I would be willing to do so.

Your thoughts and comments are welcome.

Gil B.

On 1/7/2020 9:28 AM, Rony G. Flatscher wrote:
Hi Gil,
any chance for your next posting to get an idea of what youhave done and come up to? Maybe with abird eyes's view how you now would suggest to create thedocumentation according to your analysis,
tests?
Also, would you have already suggestions for the software touse, e.g. xsltproc (how about using
Apache Xalan [1] for this), the FOP is probably Apache FOP [2].
Guessing that everyone has been waiting eagerly for your nextinsights and directions of how toduplicate your efforts to successfully create thedocumentation! :)
---rony

[1] Apache Xalan Project:<https://xalan.apache.org/>
[2] Apache FOP:<https://xmlgraphics.apache.org/fop/>u


On 06.01.2020 20:07, Gil Barmwater wrote:
This thread is a continuation of the thread titled "Questionsad generating the documentation(publican, pandoc)" with a different Subject since Pandoc isno longer being considered as an
alternative.
To review, the ooRexx documentation is written in DocBook andhas been turned into PDFs and HTMLfiles using a system called Publican, originally developed byRedHat. Publican is no longersupported and works only occasionally under Windows 10. Underthe covers, Publican transforms theDocBook XML into XSL-FO using xsltproc, probably the Perlbindings based on comments by Erich, andmodified DocBook stylesheets. It then runs the FOP program toconvert the xsl-fo output into a PDFfile. In between those two steps, we run a Rexx programwritten by Erich to remove extra blank
lines from the examples.
The new process uses the latest XSLTPROC programs directlyalong with the latest version of FOP.However, Publican imposes some unique structure to theDocBook XML which must be accounted for.Publican has the concept of a "brand" which lets one definecommon text and graphics that shouldappear the same in all of a project's documentation. Onedenotes those common text/graphic filesin the XML by preceding their names with "Common_Content/".As Publican merges the various partsof the document together so that it can be transformed by thestylesheets, it resolves anyreferences to Common_Content so that the correct file ismerged into the complete source. As thisprocess is unique to Publican, we must account for it inorder to use XSLTPROC instead.
One approach we could take would be to replaceCommon_Content/ with either a relative or absolutepath to the location in our source tree where the filesactually are located. For the sake of thisdiscussion, I will assume the working copy of thedocumentation has been checked out to adirectory named docs. Then the main xml file for the rxmathbook would be located atdocs\rxmath\en-US\rxmath.xml. And the files referenced byCommon_Content would be indocs\oorexx\en-US\. The relative path would then be..\..\oorexx\en-US\. The only problem withthis approach is the number of places this would need to bechanged. My analysis shows over 140
locations in over 50 files.
A more expedient approach, and the one I would advocate, isto create a "temporary" sub-directoryfor the purpose of building the documentation and then tocopy everything from docs\oorexx\en-US\into it. So if one were going to build the rxmath book, onewould createdocs\rxmath\en-US\Common_Content\ and copy into it. Thisallows XSLTPROC to locate the files thatneed to be merged without having to make any changes to oursource. The disadvantage is that oneneeds to do this for each book being built. It is however asimple step that can be done eitherwith File Explorer or automated using the xcopy or robocopycommands.
Having gotten by the Common_Content issue, running XSLTPROCreveals another problem caused by theway Publican does the merge of the Common_Content files whichI will describe in the next posting.
_______________________________________________
Oorexx-devel mailing list
Oorexx-devel@lists.sourceforge.net<mailto:Oorexx-devel@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/oorexx-devel
--
Gil Barmwater



_______________________________________________
Oorexx-devel mailing list
Oorexx-devel@lists.sourceforge.net<mailto:Oorexx-devel@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/oorexx-devel
_______________________________________________
Oorexx-devel mailing list
Oorexx-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oorexx-devel
--
Gil Barmwater
_______________________________________________
Oorexx-devel mailing list
Oorexx-devel@lists.sourceforge.net<mailto:Oorexx-devel@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/oorexx-devel
_______________________________________________
Oorexx-devel mailing list
Oorexx-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oorexx-devel
--
Gil Barmwater
_______________________________________________
Oorexx-devel mailing list
Oorexx-devel@lists.sourceforge.net<mailto:Oorexx-devel@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/oorexx-devel
_______________________________________________
Oorexx-devel mailing list
Oorexx-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oorexx-devel


--
Gil Barmwater

_______________________________________________
Oorexx-devel mailing list
Oorexx-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oorexx-devel

Re: [Oorexx-devel] New Process for Building the ooRexx Documentation

Reply via email to