> >
> > 1)    Create an .html file for each file you need to index, each file to
> > contain metadata for indexing and a link to the file it refers to.  Then
> > index these files.  Not a very good solution as far as end-users are
> > concerned, but simple to do.
>
> Ok. Thats fairly similar to my first idea. I will use this solution
> only if there is really no alternative.
>
> > 2) Write a converter script which will add the appropriate metadata
> > to the output from doc2html.pl or whatever conversion script you are
> > using, during the indexing process. A bit of challenge to write, but
> > it should give you exactly what you want and will be transparent to
> > end users.
>
> Yes. But the problem is: How does the script get the metadata? I
> would write a wrapper for doc2html.pl, that means that i have no
> chance to detect which file htdig requested from the http server,
> isn't it?
>
>
>    --Nikolaus

Personally I would modify doc2html.pl rather than use a wrapper script, but
the method would be similar.

You need to create some sort of database (anything from flatfile to Oracle
table - whatever suits you) which contains the metadata for each document,
with the document URL as the key field.  Htdig passes the URL as the second
argument to doc2html.pl, so it can look up each document in your database as
it is presented for conversion and can insert your metadata in the
appropriate place in its output.  Bingo!

--
David Adams
Information Systems Services
Southampton University




-------------------------------------------------------
This sf.net email is sponsored by:
With Great Power, Comes Great Responsibility 
Learn to use your power at OSDN's High Performance Computing Channel
http://hpc.devchannel.org/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to