Some tima ago (a year, probably) I posted a message about modifying
ht://Dig to do a geographic search. At the time there was some interest
expressed.

The original modifications were against an earlier tarball - 3.1.5 I
think. I intended to rework the modifications against the CVS version, and
it has taken rather a long time. I finally did it in October/November last
year. I updated the patch last month and everything still compiles and
runs.

Essentially there are 3 bits of code that were added:
- code in htsearch to parse geotags from HTML - META statements with
position information. This could be extended to parse similar metadata in HTML or
XML
- code in htdig to store position data  for documents, to compute a
"search distance", search ordered by distance, graphical input and output of
search location
- code in htsearch to restrict searches to certain geographic regions, or
pages with geotags, in order to search for geographically-enabled pages in
a much larger body of regular pages

The modifications are done with configuration elements such that the
normal text-based search of ht://Dig is unaffected

The modifications are described and patch file available from
http://andrew.triumf.ca/htdig/mods/
A demonstration set of data (pages for international airports with
latitude/longitude information) is included

There are also a couple of modifications associated with trying to index
non-text media (splitting max_doc_size between text and e.g. movies), and
a couple of other minor changes (adding an HTTP/1.1 range
request to retrieve partial documents, instead of asking for the whole
thing then dropping the connection)


The site geotags.com is still using the old  version. There is some
problem with indexing a larger set of pages that seems to be fixed in the
3.2 release. An upgrade is planned ...

Andrew Daviel
TRIUMF


-------------------------------------------------------
The SF.Net email is sponsored by EclipseCon 2004
Premiere Conference on Open Tools Development and Integration
See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
http://www.eclipsecon.org/osdn
_______________________________________________
ht://Dig Developer mailing list:
[EMAIL PROTECTED]
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to