- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Deepu
Subject: indexer is not fetching the title of doc/pdf fiiles
hello,
I am using DataparkSearch Engine 4.29 and Successfully installed on Redhat
Linux 9. I am able to run the ./indexer command successfully , but the problem
is that am not getting any results if the file names of pdf/doc files are
given to search . I think the problem is present because, in the urlinfo table
the title row is absent except for html/plain text files.
Here is an example of my search urlinfo table
url_id | sname | sval
2 | body | Tsearch2 - Introduction [Online
version] of this document is available. The tsearch2 module is available to add
as an extension to
the PostgreSQL database to allow for Full Text Indexing. This document is an int
roduction to installing, configuring, using a
2 | Charset | ISO-8859-1
2 | Content-Language | en
2 | Content-Type | text/html
2 | title | tsearch-v2-intro
10 | body | Global Line 1400 TP Power Packed
Small Server Produ
ct Highlights u SupportsSingleIntelPentium4 processor 2.4 GHz and above with 512
KB/1 MB Cache u UltraFastthroughputwith800MHz front-side bus u
Upto4GBofDDRRAMu FourTotalPCIslotswith3PCI-X u OnboardDualpo
10 | Charset | ISO-8859-1
10 | Content-Language | en
10 | Content-Type | application/pdf
I want the indexer to parse the title for both pdf and doc files and put it in
urlinfo table as it does for the html/plain text files.
Please help me��
Thanking you in advance.....
deepu
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=02;post=