Re: [Bug-wget] How to ignore link like index.html?lang=ja?
On Sat, May 29, 2010 at 12:11 PM, Micah Cowan mi...@cowan.name wrote: Unfortunately, wget doesn't currently let you match query strings. Yes, this is a major shortcoming. Peng Yu pengyu...@gmail.com wrote: There is the link index.html?lang=ja in index.html. I want to ignore such links. I use the following command. Would you please let me know how to ignore index.html?lang=ja? Is there a walkaround for this shortcoming? -- Regards, Peng
[Bug-wget] How to set -l to be zero? (Or how to download a single webpage and convert the absolute links to relative links with wget?)
I need to use the option --convert-links to download only one webpage, because I want to convert absolute links to relative links if the links are under the host directory where the webpage is in. Since I only interest in one page, I'd like to set -l be zero. But it seems that if I set it to zero, wget will download everything linked from this webpage. Is there a way to download a single webpage and convert the absolute links to relative links with wget? -- Regards, Peng
Re: [Bug-wget] How to set -l to be zero? (Or how to download a single webpage and convert the absolute links to relative links with wget?)
On 06/01/2010 04:36 PM, Peng Yu wrote: I need to use the option --convert-links to download only one webpage, because I want to convert absolute links to relative links if the links are under the host directory where the webpage is in. Since I only interest in one page, I'd like to set -l be zero. But it seems that if I set it to zero, wget will download everything linked from this webpage. The way to say I don't want to download recursively isn't -r -l 0, it's to leave out -r altogether. -- Micah J. Cowan http://micah.cowan.name/
Re: [Bug-wget] How to ignore link like index.html?lang=ja?
On 06/01/2010 03:51 PM, Peng Yu wrote: On Sat, May 29, 2010 at 12:11 PM, Micah Cowan mi...@cowan.name wrote: Unfortunately, wget doesn't currently let you match query strings. Yes, this is a major shortcoming. Peng Yu pengyu...@gmail.com wrote: There is the link index.html?lang=ja in index.html. I want to ignore such links. I use the following command. Would you please let me know how to ignore index.html?lang=ja? Is there a walkaround for this shortcoming? Not that I'm aware of. -- Micah J. Cowan http://micah.cowan.name/
Re: [Bug-wget] How to set -l to be zero? (Or how to download a single webpage and convert the absolute links to relative links with wget?)
On 06/01/2010 05:53 PM, Peng Yu wrote: On Tue, Jun 1, 2010 at 6:48 PM, Micah Cowan mi...@cowan.name wrote: On 06/01/2010 04:36 PM, Peng Yu wrote: I need to use the option --convert-links to download only one webpage, because I want to convert absolute links to relative links if the links are under the host directory where the webpage is in. Since I only interest in one page, I'd like to set -l be zero. But it seems that if I set it to zero, wget will download everything linked from this webpage. The way to say I don't want to download recursively isn't -r -l 0, it's to leave out -r altogether. It seems that --convert-links only works if -r is specified, right? If I want to use --convert-links, then I have to specify -r? Works for me without -r, at least in version 1.12. -- Micah J. Cowan http://micah.cowan.name/