Gilberto C. Andrade wrote:
After some work, I didn't resist and use the herold utility:
herold -i controls.html -o chapter-controls.xml -s UTF-8 -r book

What did I find?
-the root tag is book: this isn't an problem
-we are using old doctype specification, this one is the actual:
[code]
<?xml version='1.0' encoding='UTF-8'?>
<!DOCTYPE book PUBLIC '-//OASIS//DTD DocBook XML V4.5//EN'
'http://www.oasis-open.org/docbook/xml/4.5/docbookx.dtd'>
[/code]


As long as we can still generate the output thats great.


-I need revise chapter, section and link, just to conform with
introductions.xml ids pattern (you defined)


We can change the id pattern, as long as its unique and easy to bookmark. Not sure what ids herold generates.


-the images link references images/image01.png while introductions.xml
uses images/introduction/image01.png: I don't know which to follow!
And these images aren't images folder.


I used the convention that each chapter will have its own images folder. That way its easier to find images when we have to update screenshots.


So, this tool very good and I will use it to help us on this task (
html tags to docbook xml tags) initially.


Sounds good. Don't worry if the conversion isn't 100%. This is a one-time task and I'll double check that the output is generated correctly.

kind regards

bob

Reply via email to