- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Deepu
Subject: indexer is not fetching the title of doc/pdf fiiles

hello,
I am using DataparkSearch Engine 4.29 and Successfully installed on Redhat 
Linux 9. I am able to run the ./indexer command successfully , but the problem 
is that am not getting any results if  the file names of pdf/doc files are 
given to search . I think the problem is present because, in the urlinfo table 
the title row is absent except for html/plain text files.

Here is an example of my search urlinfo table 
     
url_id |      sname                          |           sval

     2 | body                           | Tsearch2 - Introduction [Online 
version] of this document is available. The tsearch2 module is available to add 
as an extension to
the PostgreSQL database to allow for Full Text Indexing. This document is an int
roduction to installing, configuring, using a
      2 | Charset                       | ISO-8859-1
      2 | Content-Language                   | en
      2 | Content-Type                       | text/html
      2 | title                         | tsearch-v2-intro
     10 | body                          | Global Line 1400 TP Power Packed 
Small Server Produ
ct Highlights u SupportsSingleIntelPentium4 processor 2.4 GHz and above with 512
 KB/1 MB Cache u UltraFastthroughputwith800MHz front-side bus u 
Upto4GBofDDRRAMu FourTotalPCIslotswith3PCI-X u OnboardDualpo
     10 | Charset                       | ISO-8859-1
     10 | Content-Language                   | en
     10 | Content-Type                       | application/pdf

I want the indexer to parse the title for both pdf and doc files and put it in 
urlinfo table  as it does for the html/plain text files.

Please help me��
Thanking you in advance.....

deepu

- - - - - - - - - - - - - - - - - - - - - - - - - - - -

Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=02;post=

Reply via email to