NOOOooooo!!! Just kidding! :-) So maybe you can clear something up for me. In the future while building a new crawldb, if I only wanted to accept urls from the following:
http://myhost:81/site1/test.php?id=1234 http://myhost:81/site1/list.php?page=1234&count=21 http://myhost:81/site1/view.php?id=1234 http://myhost:81/site2/test2.php?id=12233 http://myhost:81/site2/list.php?page=25&count=12344 file:////sharedrive1/share1/ How would the regex-urlfilter look for the php pages? +^http://myhost:81/site1/test.php\?.* ??? -- View this message in context: http://lucene.472066.n3.nabble.com/Relative-urls-outlinks-tp4008601p4008603.html Sent from the Nutch - User mailing list archive at Nabble.com.