Hi, I am writing code to make Nutch to fetch files in relative path in a html page.
The format of url of the webpage can be http://www.mysite.com/folder1/page.html or http://www.mysite.com/folder1 The format of path of the file can be "../../image.jpg", or "http://www.example.com/image.jpg", or "folder2/image.jpg", just a few examples. I am pretty sure that Nutch already has code for this, I just don't want to reinvent the wheel. Does anyone know the location of the code? If not, do you know of any library that does this? Thanks. -- View this message in context: http://www.nabble.com/How-does-Nutch-Fetch-Files-in-Relative-Path--tp23047386p23047386.html Sent from the Nutch - User mailing list archive at Nabble.com.
