Re: Migrating documentation from HTML files

2008-03-04 Thread Vincent Siveton
2008/3/4, Lukas Theussl <[EMAIL PROTECTED]>: > Ehm, yes, sorry, I talked quicker than I thought. Of course, the parser > is an xml parser so it will cough up any tags that are not properly > closed. So it has to be xhtml. You can use tools like htmltidy [1] to > convert html to xhtml. > > Btw,

Re: Migrating documentation from HTML files

2008-03-04 Thread Lukas Theussl
Ehm, yes, sorry, I talked quicker than I thought. Of course, the parser is an xml parser so it will cough up any tags that are not properly closed. So it has to be xhtml. You can use tools like htmltidy [1] to convert html to xhtml. Btw, Vincent just added a simple tool to do document translat

Re: Migrating documentation from HTML files

2008-03-04 Thread Cristóbal Fandiño
Output latex2html produces no XHTML code. For example: HTML == XhtmlParser == org.apache.maven.doxia.parser.ParseException: Error parsing the model: end tag name must be the same as start tag from line 19 (position: TEXT seen ...\n\n... @21:8) at org.apache.maven.doxia.pars

Re: Migrating documentation from HTML files

2008-03-03 Thread Lukas Theussl
doxia doesn't have a latex parser (I'd like to have one too!), latex2html is the only solution I can think of (there exist other latex translators though but that's the only one I know). I am not sure what kind of output latex2html produces, however, the difference HTML - xhtml shouldn't matter

Re: Migrating documentation from HTML files

2008-03-02 Thread krycho fandino
Thanks for your help, however my HTML files isn't XHTML and XhtmlParser throws a lot of exceptions. Perhaps, I should convert these HTML files to XHTML format, but I've a lot of pages and should be a hard task. Really, I has generated these HTML files using latex2html conversion tool. I don't know

Re: Migrating documentation from HTML files

2008-03-01 Thread Lukas Theussl
If you use the current development branch of doxia (beta-1-SNAPSHOT) then this should work rather well for simple html files. However, you will probably loose a lot of information if you have anything fancy (eg special layout, tables, figures are not well supported), don't expect it to be perfe

Re: Migrating documentation from HTML files

2008-03-01 Thread Vincent Siveton
Hi, Frankly, I never test your use case. But I guess that you need to have an XHTML file in input with no header, footer or navbar something to the div bodyColumn in [1]. The snippet should be something like the following: File f = new File( "blabla.html" ); XhtmlParser parser = new XhtmlParser

Migrating documentation from HTML files

2008-03-01 Thread krycho fandino
I'm a newbie using doxia. I've a lot of documentation in HTML format an I'd like convert these files to apt format. Is there some way to transform easily? I want to create a maven site for my project and, right now, I only have this documentation in HTML format without css styles nor menu. Could y