Re: Bug: wget cannot handle quote

2005-06-21 Thread Hrvoje Niksic
Will Kuhn <[EMAIL PROTECTED]> writes:

> Apparentl wget does not handle single quote or double quote very well.
> wget with the following arguments give error.
>
>  wget
>  --user-agent='Mozilla/5.0' --cookies=off --header
>  'Cookie: testbounce="testing";
>  ih="b'!!!0T#8G(5A!!#c`#8HWsH!!#wt#8I0HY!!#yf#8I0G3";
>  cf="b$y~!!!D)#"; hi="b#!!!D)8I=C]"'
>  'ad.yieldmanager.com/imp?z=12&n=2&E=01-329&I=508&S=508-1'
>  -O /home/admin/http/wwwscanfile.YYO3Cy

You haven't stated which error you get, but on my system the error
comes from the shell and not from Wget.  The problem is that you used
single quotes to quote a string that contains, among other things,
single quotes.  This effectively turned off the quoting for some
portions of the text, causing the shell to interpret the bangs ("!") 
as (invalid) history events.

To correct the problem, replace ' within single quotes with something
like '\'':

wget --user-agent='Mozilla/5.0' --cookies=off --header 'Cookie: 
testbounce="testing"; 
ih="b'\''!!!0T#8G(5A!!#c`#8HWsH!!#wt#8I0HY!!#yf#8I0G3"; 
cf="b$y~!!!D)#"; hi="b#!!!D)8I=C]"' 
'ad.yieldmanager.com/imp?z=12&n=2&E=01-329&I=508&S=508-1' -O 
/home/admin/http/wwwscanfile.YYO3Cy


Re: new subversion source code repository for Wget

2005-06-21 Thread Hrvoje Niksic
Hrvoje Niksic <[EMAIL PROTECTED]> writes:

> If you want to check out the 1.10 branch (recommended for
> distributions because it only contains bug fixes), you can use:
>
> svn checkout http://svn.dotsrc.org/repo/wget/trunk/ wget

Oops!  The above should read something like:

svn checkout http://svn.dotsrc.org/repo/wget/branches/1.10/ wget-stable


Re: new subversion source code repository for Wget

2005-06-21 Thread Hrvoje Niksic
Mauro Tortonesi <[EMAIL PROTECTED]> writes:

> The new repository is accessible at:
>
> http://svn.dotsrc.org/repo/wget/

For the uninitiated, to checkout the repository, you need a reasonably
recent version of the subversion client and issue something like:

svn checkout http://svn.dotsrc.org/repo/wget/trunk/ wget

If you try to checkout http://svn.dotsrc.org/repo/wget/trunk/, you
will get a checkout of *all* branches and tags, which is probably not
what you had in mind.

If you want to check out the 1.10 branch (recommended for
distributions because it only contains bug fixes), you can use:

svn checkout http://svn.dotsrc.org/repo/wget/trunk/ wget

If you want to get a diff between the 1.10 release and the tip of the
1.10 branch, use:

svn diff http://svn.dotsrc.org/repo/wget/tags/WGET_1_10 \
 http://svn.dotsrc.org/repo/wget/branches/1.10

...which you can do without having pulled the source or in any way
initiated an svn session.  Magic.


new subversion source code repository for Wget

2005-06-21 Thread Mauro Tortonesi

It's with great pleasure that I announce the migration of the Wget source code 
repository from CVS to subversion.

Hrvoje and I have been talking a lot about the switch to subversion in the 
last few months. Subversion is a very powerful development tool and we expect 
the new source code repository not only to have a much better access 
performance than the previous one, but also to provide us with a better 
support for both branching and commit notifications via email, and eventually 
to speed up the development of Wget.

The new repository is accessible at:

http://svn.dotsrc.org/repo/wget/

Please, don't use the old repository anymore. It is not up-to-date and it's 
going to be dismissed soon. The developers documentation on the official Wget 
website will be updated in a couple of hours.

Notice that for the moment the commit notification via email mechanism is not 
working. Please, understand that this situation is only temporary and that a 
better mechanism will be set up very soon.

Last but not least, I would like to thank the dotsrc.org (former sunsite.dk) 
staff, and especially Christian Reiniger, for their kind help in setting up a 
subversion server for us and performing the migration of the repository.

-- 
Aequam memento rebus in arduis servare mentem...

Mauro Tortonesi  http://www.tortonesi.com

University of Ferrara - Dept. of Eng.http://www.ing.unife.it
Institute for Human & Machine Cognition  http://www.ihmc.us
GNU Wget - HTTP/FTP file retrieval tool  http://www.gnu.org/software/wget
Deep Space 6 - IPv6 for Linuxhttp://www.deepspace6.net
Ferrara Linux User Group http://www.ferrara.linux.it


Re: Clobber??

2005-06-21 Thread Hrvoje Niksic
Ariel <[EMAIL PROTECTED]> writes:

> Was looking for an option to skip existing files, and after some time
> (minutes? hours?) of no luck, i looked at that option -nc "Dont
> clobber existing files".

clobber == overwrite

http://www.science.uva.nl/~mes/jargon/c/clobber.html

That term has been used so much in computing that I stopped thinking
of it as jargon, sorry.  I'll add an explanation in the docs.


Clobber??

2005-06-21 Thread Ariel
Was looking for an option to skip existing files, and after some time 
(minutes? hours?) of no luck, i looked at that option -nc "Dont clobber 
existing files". Ive checked the dictionary for what "clobber" meant, 
didnt help either. Clobber = hitting hard. What possibily hitting hard a 
file should mean? Anyway, I gave it a try, and it worked just as i wished.


So, cmon, why dont make our life easier, and change "clobber" to 
"overwrite", or "download"?


ps: I see now on a "computer dictionary" that clobber is a jargon for 
overwriting. Ok. One more useless thing to remember.




BUG? using -O effectively disables -N

2005-06-21 Thread Dennis Kaarsemaker
Hello,

When I try to combine the options -O and -N, timestamping is not used,
but the file is downloaded all the time. Timestamping does however check
the file that would be used without the -O option. Consider the
following scenario:

$ wget http://example.com/index.html

wget retrieves the file. Assume that after this, index.html changes.

$ wget -N http://example.com/index.html -O example_index.html

wget retrieves it correctly again, but does so after inspecting
index.html
Assume that after this, the index.html does not change.

$wget -N http://example.com/index.html -O example_index.html

wget will now again download index.html because it inspects the local
index.html and not example_index.html!

$wget -N http://example.com/index.html

wget will now again correctly download index.html

$wget -N http://example.com/index.html -O example_index.html

And because index.html is now up-to-date wget will not re-download the
page.

Because this behaviour is unexpected and undocumented, I consider it a
bug.
-- 
Sincerely,
Dennis Kaarsemaker


signature.asc
Description: This is a digitally signed message part