Please keep such questions on the mailing list.  Thanks.

According to F3N1X - INDAYA TEAM:
> I'm designing an Intranet and I must put a "index server" in linux
> and I decided to use HTDIG,  the problem is, I don't know how to index 
> .pdf.xls and .docs files. so, I'd like u can help me if u r so glad.
> 
> Thanks for all
> 
> note: I read the FAQ's yet.-
> 
> but... it doesn't index pdf. doc or xls

Pay especially close attention to http://www.htdig.org/FAQ.html#q4.8,
http://www.htdig.org/FAQ.html#q4.8, http://www.htdig.org/FAQ.html#q5.25
and http://www.htdig.org/FAQ.html#q5.27

There are three main issues to resolve here:

1) is htdig actually finding links to the PDF, Word and Excel documents
you want to index?

2) if it is, is it correctly fetching them and passing them on to the
appropriate external converter to be able to index them?

3) if it is attempting to convert them, is the external converter doing
what it should, to feed some indexable text back into htdig's parser?

You need to figure out on which of the three stages the process is
failing, and focus on that stage to get to the bottom of why it's not
working at that stage.  You need to run htdig with anywhere from 1 to 4 -v
options, to get the debugging output you need to see where it's failing
and why.  This may be an iterative process, if htdig is failing at more
than one stage: you might fix one problem only to run into another.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/
Dept. Physiology, U. of Manitoba  Winnipeg, MB  R3E 3J7  (Canada)


-------------------------------------------------------
The SF.Net email is sponsored by EclipseCon 2004
Premiere Conference on Open Tools Development and Integration
See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
http://www.eclipsecon.org/osdn
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to