Tom Poe wrote:

> Hi:  I managed to lose a hard drive, and not have backup system in place
> for data.  . . . . don't ask!
> 
> I'd like to take a book that was originally in .lyx format, and is now
> on the web site as a html book, and prepare it to be opened in Lyx.  I
> saw some stuff on Google about html2tex, and html to text, but wonder if
> anyone has something that they used to get a .html file opened in Lyx?
> I don't have scanner/OCR software.  I'm thinking I could copy/paste the
> view source pages for text, and do it that way.
> Any help appreciated.

If the html was generated from a .lyx file then it's likely to retain lots
of logical markup. Regenerating a .tex file from it using your chosen html
to latex converter should retain this markup. Thereafter, LyX's .tex
to .lyx converters should be able to recreate the .lyx file. (See
http://wiki.lyx.org/LaTeX/LatexToLyx.) In the first instance I'd try reLyX
from the command line. Only if that fails to do the job would I try out
tex2lyx.

This TUG page
   http://www.tug.org/utilities/texconv/pctotex.html
has a section "HTML to LaTeX" that mentions three html to latex converters:

* Frans Faase's html2tex (NL site) (C source)
  http://www.iwriteiam.nl/html2tex.html
* Peter Thatcher's html2latex at sourceforge.net (Perl script)
  http://html2latex.sourceforge.net/
* Jeffrey Schaefer's html2latex at www.geom.umn.edu (Perl script)
  http://www.geom.uiuc.edu/software/html2latex/welcome.html

Note, however, that if your .html page contains bitmap images of math
equations, then you're going to have to re-type them in your .lyx
document.

-- 
Angus

Reply via email to