Re: [Bug-wget] How to ignore link like index.html?lang=ja?

2010-06-01 Thread Peng Yu
On Sat, May 29, 2010 at 12:11 PM, Micah Cowan mi...@cowan.name wrote:
 Unfortunately, wget doesn't currently let you match query strings. Yes, this 
 is a major shortcoming.

 Peng Yu pengyu...@gmail.com wrote:

There is the link index.html?lang=ja in index.html. I want to ignore
such links. I use the following command. Would you please let me know
how to ignore index.html?lang=ja?

Is there a walkaround for this shortcoming?

-- 
Regards,
Peng



[Bug-wget] How to set -l to be zero? (Or how to download a single webpage and convert the absolute links to relative links with wget?)

2010-06-01 Thread Peng Yu
I need to use the option --convert-links to download only one webpage,
because I want to convert absolute links to relative links if the
links are under the host directory where the webpage is in. Since I
only interest in one page, I'd like to set -l be zero. But it seems
that if I set it to zero, wget will download everything linked from
this webpage.

Is there a way to download a single webpage and convert the absolute
links to relative links with wget?

-- 
Regards,
Peng



Re: [Bug-wget] How to set -l to be zero? (Or how to download a single webpage and convert the absolute links to relative links with wget?)

2010-06-01 Thread Micah Cowan
On 06/01/2010 04:36 PM, Peng Yu wrote:
 I need to use the option --convert-links to download only one webpage,
 because I want to convert absolute links to relative links if the
 links are under the host directory where the webpage is in. Since I
 only interest in one page, I'd like to set -l be zero. But it seems
 that if I set it to zero, wget will download everything linked from
 this webpage.

The way to say I don't want to download recursively isn't -r -l 0,
it's to leave out -r altogether.

-- 
Micah J. Cowan
http://micah.cowan.name/



Re: [Bug-wget] How to ignore link like index.html?lang=ja?

2010-06-01 Thread Micah Cowan
On 06/01/2010 03:51 PM, Peng Yu wrote:
 On Sat, May 29, 2010 at 12:11 PM, Micah Cowan mi...@cowan.name wrote:
 Unfortunately, wget doesn't currently let you match query strings. Yes, this 
 is a major shortcoming.

 Peng Yu pengyu...@gmail.com wrote:

 There is the link index.html?lang=ja in index.html. I want to ignore
 such links. I use the following command. Would you please let me know
 how to ignore index.html?lang=ja?
 
 Is there a walkaround for this shortcoming?

Not that I'm aware of.

-- 
Micah J. Cowan
http://micah.cowan.name/



Re: [Bug-wget] How to set -l to be zero? (Or how to download a single webpage and convert the absolute links to relative links with wget?)

2010-06-01 Thread Micah Cowan
On 06/01/2010 05:53 PM, Peng Yu wrote:
 On Tue, Jun 1, 2010 at 6:48 PM, Micah Cowan mi...@cowan.name wrote:
 On 06/01/2010 04:36 PM, Peng Yu wrote:
 I need to use the option --convert-links to download only one webpage,
 because I want to convert absolute links to relative links if the
 links are under the host directory where the webpage is in. Since I
 only interest in one page, I'd like to set -l be zero. But it seems
 that if I set it to zero, wget will download everything linked from
 this webpage.

 The way to say I don't want to download recursively isn't -r -l 0,
 it's to leave out -r altogether.
 
 It seems that --convert-links only works if -r is specified, right? If
 I want to use --convert-links, then I have to specify -r?

Works for me without -r, at least in version 1.12.

-- 
Micah J. Cowan
http://micah.cowan.name/