> Hi guys!
> 
> I have installed htdig and I have a little problem with German Umlaut.  I
> can search for words with Umlaut without any problem.  When I search for
> say 'C34644' (a file containing Umlaut), the results from htdig comes back
> with strange characters instead of Umlaut (for example, I get a circle (�)
> instead of �, or I get a bit � instead of a small �), and it seems to
> return that kind of results only for word documents.  What could be the
> problem?
> 
> In order to use german dictionary, in my htdig.conf I have the following:
> 
> ispell_dir=/usr/lib/ispell
> 
> # to use german dictionary
> locale:                 de_DE
> lang_dir:               ${ispell_dir}
> endings_affix_file:     ${lang_dir}/german.aff
> endings_dictionary:     ${lang_dir}/german.0
> 
In order to parse word documents, in my htdig.conf I have the following:

# to convert different binary documents (.doc, .ps, .pdf, .xls, .ppt)
#doc2html=/usr/local/bin/doc2html.pl
external_parsers:       application/pdf->text/html
/usr/local/bin/doc2html.pl \
                        application/postscript->text/html
/usr/local/bin/doc2htm
                        application/msword->text/html
/usr/local/bin/doc2html.pl
                        application/msexcel->text/html
/usr/local/bin/doc2html.p
                        application/vnd.ms-excel->text/html
/usr/local/bin/doc2h
                        application/vnd.ms-powerpoint->text/html
/usr/local/bin/

> By the way, I am using htdig on Suse 7.2 Linux distribution.
> 
> Could you please reply by email since I didn't subscribe to the mailing
> list?
> 
> Thanks in advance for any help.
> 
> Melanie.

_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to