Hello,


I started indexer from UdmSearch 3.0.2[01] few weeks ago to index few
domains and now I give (with PHP front-end) some queries. And I found some
oddities:

When I give keyword say "hany" I receive results like this:

1. Mega ?Loman: intranet: M1 [1]
        Mega ?Loman - intranet: M1 M1.megaloman.sk je Hanyho pracovna 
        stanica. [ Powered by Hany ž š TM ] sluzby na tomto pocitaci nie 
        su garantovane, kedze tento pocitac je urceny na vyvoj a
        testovanie hany Dokumentácia The Apache documentatio...

                http://www.X1.megaloman.sk/ (text/html) Thu, 27 Jul 2000 13:38:01 GMT, 
2268 bytes 
                http://www.X2.megaloman.sk/ (text/html) Thu, 27 Jul 2000 13:38:01 GMT 
                http://www.X3.megaloman.sk/ (text/html) Thu, 27 Jul 2000 13:38:01 GMT 
                ... (about 10-20 other URLs)


I get few (2-3) such sets:
- each has same numer of URLs
- each is sorted differently
- each contains just ONE different URL than other sets

It also found some valid pages (existing pages, found in DB and containing
keyword).

Situation is this:

1) Page with title " Mega ?Loman: intranet: M1" and page body excerpt is
from index.html from machine on my local network (my workstation) which
should not be and is not indexed (though network connection IS possible).

URL of my workstation is not in 'url' table.

2) domains listed (say http://www.X1.megaloman.sk/) have following data in
database:

- those "different" URLs:

status: 504 and 304
title, keywords, docsize, tag, hops, crc, ...: from my local page (which
        is incorrect)
last_index_time values are different
same referrer

referrer: record with such rec_id is not in 'url' table.

- those "other i.e. same" URLs:

status: 200
othere values looks OK (title, keywords, ...)

(I checked just 2 URLs of both groups)

Is that some indexer error?


Sincerely

Peter Hanecak

-- 
===================================================================
  Peter Hanecak <[EMAIL PROTECTED]> - technology manager
  GPG pub.key: http://www.megaloman.com/gpg/hanecak-megaloman.txt
===================================================================

______________
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]

Reply via email to