Klaus,
The most important part is that htdig is set up with the correct mime-type 
mappings. 
I would suggest that you check the header returned by your server somehow: 
options include the 'LiveHTTPheaders'  extension in Firefox,  a direct telnet 
to port 80,  or verbose options with something like WGET.  
That would eliminate any remaining confusion over what mime-type is being used 
- as you can see, there is a degree of faith required that the web server picks 
the 'correct' mime-type in each case!
 
If it does turn out to be returning application/octet-stream that needs to be 
fixed, but otherwise I would just put in a line for every possible variation of 
the WordPerfect mime-type - as you spotted, doc2html makes its own attempt to 
identify which kind of document it is given, which is independent of mime-type.
 
Hope that helps,
Mike 
 
 


________________________________

        From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Klaus 
Bömken
        Sent: Sunday, May 06, 2007 7:14 PM
        To: [email protected]
        Subject: Re: [htdig] doc2html unable to convert wordperfect
        
        
        That is a point that I am very uncertain about . Here is what I can 
tell after reading more of the documentation.
        
        This is part of my /usr/share/file/magic:
        
        >#WordPerfect type files Version 1.6 - PLEASE DO NOT REMOVE THIS LINE
        >0       string  
\377WPC\020\000\000\000\022\012\001\001\000\000\000\000 (WP) loadable text
        >>15     byte    0       Optimized for Intel
        >>15     byte    1       Optimized for Non-Intel
        >1       string  WPC     (Corel/WP)
        >>8      short   257     WordPerfect macro
        (and so on)
        
        This is part of my /etc/mime.types:
        
        >application/wordperfect                         wpd
        >application/wordperfect5.1                      wp5
        
        Now if I use the file-command:
        
        >file mswordfile.doc
        >mswordfile.doc: Microsoft Office Document
        
        >file -i mswordfile.doc
        >mswordfile.doc: application/msword
        
        That's what I would expect. But:
        
        >file wordperfectfile.wpd
        >wordperfectfile.wpd: (Corel/WP)
        
        >file -i wordperfectfile.wpd
        >wordperfectfile.wpd: application/octet-stream
        
        ... seems wrong to me.
        
        On the other hand: how does doc2html identify file-types? 
        
        I'm not familiar with perl, but it seems to me that there is some sort 
of comparison with "\377" within the perl-script.
        
        In this respect:
        
        >od -b wordperfectfile.wpd | head
        >0000000 377 127 120 103 365 007 000 000 001 012 002 001 000 000 000 002
        
        ... seems correct to me.
        
        Would this be the information you asked about? What can I do next?
        
        Thanks for your help,
        Klaus
        
        
        >-------in reply to
        Re: [htdig] doc2html unable to convert wordperfect 

        michael.brockington
        Fri, 04 May 2007 01:58:35 -0700

        Have you got any idea what mime-type your server is putting out for 
these 
        documents?
        The key is that what you have in your config file must match what your 
server 
        is saying, regardless of whether either of them is technically correct.
        
        Regards,
        Mike
         
        
        > -----Original Message-----
        > From: [EMAIL PROTECTED] 
        > [mailto:[EMAIL <mailto:%5BEMAIL>  PROTECTED] On 
        > Behalf Of Klaus Bömken
        > Sent: Wednesday, May 02, 2007 7:56 PM
        > To: [email protected]
        > Subject: [htdig] doc2html unable to convert wordperfect
        > 
        
        <snip>
        
        > 
        > I read some stuff about mime-types that I don't understand. 
        > Is it matter
        > of wordperfect-versions?
        > 
        > Any hints?
        > 
        > Klaus
        > 
        -- 
        *************************
        Klaus Bömken
        Herzogstraße 89b
        46145 Oberhausen
        
        Fon:   0208-612018
        Mobil: 0163-8898 436
        Mail: [EMAIL PROTECTED] 

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
ht://Dig general mailing list: <[email protected]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to