Kir or Markus, What are the proper steps to having this type of functionality? I must first download the converters then , how do I configure ASPSeek for the external converters?
Diego --- Kir Kolyshkin <[EMAIL PROTECTED]> wrote: > ASPSeek will also present "text version" of beer.pdf > to be viewed > (in the place where "cached" link usually is), much > like as Google does, > so you can see the result of conversion. Excerpts > are also supported. > > > [EMAIL PROTECTED] wrote: > > > > no no, > > > > the external converter is started from aspseek > during the index process when aspseek finds a pdf > file. > > so in your case: > > > > when aspseek indexes www.crazy.com and finds > beer.pdf it starts the converter. the converter > reads the pdf-document convert it to txt/html. now > aspseek indexes this export. > > > > no your users can search also in pdf documents. so > when "beer" is in beer.bdf, aspseek will list the > link to beer.pdf as a result and even displays the > short extract. your users now can click on the link > and acrobat reader opens to display the pdf-file. > > > > so external converter means a helper programme for > apseek to index pdf-documents. > > > > Markus Rietzler > > * kommunikation & online service > > * RZF NRW > > * Tel: 0211.4572-130 > > > > -----Urspr�ngliche Nachricht----- > > Von: Diego Montalvo [mailto:[EMAIL PROTECTED]] > > Gesendet am: Donnerstag, 21. Februar 2002 16:55 > > An: [EMAIL PROTECTED] > > Betreff: Re: [aseek-users] ASPSeek - PDF / RTF > > > > Kir, > > > > I am somewhat confused, so ASPSeek will crawl and > > index .PDF and such files, but will not present > them > > as .html? Therefore I need a external converter? > > > > Or does an external converter first convert, then > I > > run ASPSeek? > > > > example: I want to index "www.crazy.com/beer.pdf" > i > > simply use ASPSeek, to retreive words from > "beer.pdf" > > but then I mst use an external program to view in > > html? > > > > do you have a link to such a search engine using > > ASPSeek with external converters? > > > > Diego > > > > --- Kir Kolyshkin <[EMAIL PROTECTED]> wrote: > > > Diego Montalvo wrote: > > > > > > > > Hello, > > > > > > > > In the ASPSeek Manual pages there is a mention > > > that > > > > ASPSeek understands PDF, RTF formats with help > of > > > an > > > > external program, what program is that? I > would > > > like > > > > to embed it into ASPSeek. > > > > > > There's no need to embed. Manual talks about > > > External Converters, > > > described in > > > > http://www.aspseek.org/man/aspseek.conf.5.html#lbAM > > > So as long as you have program that can convert, > > > say, pdf to html, > > > you can index pdf documents with aspseek. > > > > > > Good ps to text (or html) converter is here: > > > http://www.nzdl.org/html/prescript.html > > > There are also links to other such tools. > > > > > > As for converter from rtf or doc format, I know > of > > > word2x: http://word2x.alcom.co.uk/ > > > antiword: > http://www.winfield.demon.nl/index.html > > > unrtf: > http://www.geocities.com/tuorfa/unrtf.html > > > -- > > > [EMAIL PROTECTED] http://kir.vtx.ru/ ICQ > 7551596 > > > Phone +7 903 6722750 > > > Hi, I'm a signature virus: copy me to your > > > .signature to help me spread! > > > -- > > > > __________________________________________________ > > Do You Yahoo!? > > Yahoo! Sports - Coverage of the 2002 Olympic Games > > http://sports.yahoo.com > > -- > [EMAIL PROTECTED] http://kir.vtx.ru/ ICQ 7551596 > Phone +7 903 6722750 > Hi, I'm a signature virus: copy me to your > .signature to help me spread! > -- __________________________________________________ Do You Yahoo!? Yahoo! Sports - Coverage of the 2002 Olympic Games http://sports.yahoo.com
