NOOOooooo!!!  Just kidding! :-)  

So maybe you can clear something up for me.  In the future while building a
new crawldb, if I only wanted to accept urls from the following:

http://myhost:81/site1/test.php?id=1234
http://myhost:81/site1/list.php?page=1234&count=21
http://myhost:81/site1/view.php?id=1234
http://myhost:81/site2/test2.php?id=12233
http://myhost:81/site2/list.php?page=25&count=12344

file:////sharedrive1/share1/

How would the regex-urlfilter look for the php pages?

+^http://myhost:81/site1/test.php\?.*    ??? 





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Relative-urls-outlinks-tp4008601p4008603.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to