Re: [Bug-wget] Favicon is not downloaded (Suggestion for improvement)

2017-01-05 Thread Tim Rühsen
On Donnerstag, 5. Januar 2017 10:40:48 CET Dale R. Worley wrote:
> Павел Серегов  writes:
> > Often not exist code for favicon (in index.html), but site have.
> > 
> > My suggestion:
> > If use wget -m, need make download  http://example.com/favicon.ico
> > 
> > How do you like the idea?
> 
> The documentation for -m is:
> 
>-m
>--mirror
>Turn on options suitable for mirroring.  This option turns on
>recursion and time-stamping, sets infinite recursion depth and
>keeps FTP directory listings.  It is currently equivalent to -r
> -N -l inf --no-remove-listing.
> 
> I suggest defining "--favicon" specifically to download
> http(s):///favicon.ico, and then add --favicon to the specification
> of --mirror.

With default 'on'. Dale, do you mind to open an issue for that at https://
github.com/rockdaboot/wget2 ?
IMO, it should go there first.

Tim


signature.asc
Description: This is a digitally signed message part.


Re: [Bug-wget] ot: clicking email links advice

2017-01-05 Thread Tim Rühsen
On Donnerstag, 5. Januar 2017 09:34:39 CET Ander Juaristi wrote:
> Hi,
> 
> On 28/12/16 05:57, voy...@sbt.net.au wrote:
> > is there a way to run wget with that url and, tell it to 'press' one of
> > the buttons?
> 
> Not directly as you describe. Wget does not submit web forms.
> 
> You would need to write an external application to parse the HTML,
> generate the target link and then feed that link to wget.
> 
> Or,
> 
> You could use wget2, which we're designing as a library, although it's
> still in pre-alpha. It has functions to extract links from an HTML
> document (see the example in [1]), although I don't know if it can
> extract URLs from  fields as well, which is what I
> guess you need. Maybe @Tim can give more details on this.

The 'action' URL will not be followed automatically by wget, it is filtered out 
by purpose. Libwget should parse it, maybe wget2 does not filter it out right 
now - but this should be fixed.

Basically you need to parse HTML for more infos - the library contains a 
scanner and example code to use it. If you want to go that way, you need some 
knowledge/background about the HTML form mechanism (as Dale pointed out).

If you just have one use case, take a browser's 'development' tools (Firefox 
and Chromium have those), fill in your form as usual and just see what is 
transferred to the server. You can then set header lines and content via wget.
Cookies normally play a role when it comes to login/authentication.

Tim


signature.asc
Description: This is a digitally signed message part.


Re: [Bug-wget] Favicon is not downloaded (Suggestion for improvement)

2017-01-05 Thread Dale R. Worley
Павел Серегов  writes:
> Often not exist code for favicon (in index.html), but site have.
>
> My suggestion:
> If use wget -m, need make download  http://example.com/favicon.ico
>
> How do you like the idea?

The documentation for -m is:

   -m
   --mirror
   Turn on options suitable for mirroring.  This option turns on
   recursion and time-stamping, sets infinite recursion depth and
   keeps FTP directory listings.  It is currently equivalent to -r -N
   -l inf --no-remove-listing.

I suggest defining "--favicon" specifically to download
http(s):///favicon.ico, and then add --favicon to the specification
of --mirror.

Dale



[Bug-wget] Unable to establish SSL connection error

2017-01-05 Thread Raitis Misa

Hi!

I'm trying to download APOD with line - wget.exe -x -r -k -E -nc -e 
robots=off --page-requisites --tries=2 --level=2 --timeout=20 
--user-agent="Mozilla 1.5" --secure-protocol=TLSv1 
--no-check-certificate http://apod.nasa.gov/apod/archivepix.html


other --secure-protocol= options gives the same result as well as not 
using --no-check-certificate .


GNU Wget 1.11.4
Microsoft Windows [Version 10.0.10586]

Thank You!

Bets Regards,
Raitis Misa



Re: [Bug-wget] ot: clicking email links advice

2017-01-05 Thread Ander Juaristi
Hi,

On 28/12/16 05:57, voy...@sbt.net.au wrote:
> 
> is there a way to run wget with that url and, tell it to 'press' one of
> the buttons?

Not directly as you describe. Wget does not submit web forms.

You would need to write an external application to parse the HTML,
generate the target link and then feed that link to wget.

Or,

You could use wget2, which we're designing as a library, although it's
still in pre-alpha. It has functions to extract links from an HTML
document (see the example in [1]), although I don't know if it can
extract URLs from  fields as well, which is what I
guess you need. Maybe @Tim can give more details on this.

[1]
https://github.com/rockdaboot/wget2/blob/master/examples/print_html_urls.c

> 
> thanks for any pointer or advice
> 
> V
> 
> 



signature.asc
Description: OpenPGP digital signature