specifying
"https" by name anywhere in the script?
--
Matthew.van.Eerde (at) hbinc.com 805.964.4554 x902
Hispanic Business Inc./HireDiversity.com Software Engineer
Jacinta Richardson wrote:
> Please note that such a change is likely to break a lot of existing
> programs which read in a filename from somewhere and then don't chomp
> it.
This breakage would arguably be a feature.
--
Matthew.van.Eerde (at) hbinc.com 805.964.4554
kie that is for
> all subdomains, i.e. in Netscape cookie manager, site name is listed
> as "intervalworld.com", and the cookies domain is listed as
> ".intervalworld.com".
Hmmm... (tests in Firefox...)
I'm wrong, browsers do allow cross-doma
rent sites" because
the server name is different, and won't pass cookies received on one to the
other.
Maybe your inputs are incorrect? The form that I see shows three inputs named
"a", "loginID", and "rememberMe".
--
Matthew.van.Eerde (at) hbinc.com 805.964.4554 x902
Hispanic Business Inc./HireDiversity.com Software Engineer
,
);
# use cookies
$self->cookie_jar(HTTP::Cookies->new());
I don't need to save cookies from session to session... but if you do, there's
a way to load a cookie jar from a file.
--
Matthew.van.Eerde (at) hbinc.com 805.964.4554 x902
Hispanic Business Inc./HireDiversity.com Software Engineer
Prateek wrote:
> my $ua = LWP::UserAgent->new;
> $ua->proxy(['http', 'ftp', 'https'],
> 'http://:@< MyProxy>:1080');
>
> output is :
> Failed: 400 Bad Request
>
> please guide,
Are you %dd-encoding URI-unsafe chara
rprise Crawler/6.4 (helpdesk at fast.no)
Jakarta HTTP Client/1.0
UPG1 UP/4.0 (compatible; Blazer 1.0)
On the other hand it's doubtful that any of these use RobotRules.pm, so these
don't imply that a patch is called for.
--
Matthew.van.Eerde (at) hbinc.com 805.964.4554 x902
Hispanic Business Inc./HireDiversity.com Software Engineer
My robot will incorrectly refuse to spider anything, because
WWW::RobotRules::agent shortens $self->{'ua'} to "Hispanic".
I propose the attached patch to the RobotRules.pm included in libwww-perl 5.803
--
Matthew.van.Eerde (at) hbinc.com 805.96
well?
I've added a "warn" line in the case where a record separation is assumed... see
http://www.geocities.com/mvaneerde/RobotRules.patch-4.txt
rules.t patch:
http://www.geocities.com/mvaneerde/rules-patch.txt
make test runs fine
--
Matthew.van.Eerde (at) hbinc.com
Matthew.van.Eerde wrote:
> Gisle Aas wrote:
>> Andy Lester <[EMAIL PROTECTED]> writes:
>>
>>> [EMAIL PROTECTED] ([EMAIL PROTECTED]) wrote:
>>>> I've cobbled together a "fixed" version of RobotRules.pm
>>>
>>> Send a patc
es to be posted to this list
Here's the patch, for the list.
http://www.geocities.com/mvaneerde/RobotRules.patch.txt
--
Matthew.van.Eerde (at) hbinc.com 805.964.4554 x902
Hispanic Business Inc./HireDiversity.com Software Engineer
-- how can I get it
reviewed and ultimately blessed by the LWP community?
http://www.geocities.com/mvaneerde/RobotRules.pm.txt
--
Matthew.van.Eerde (at) hbinc.com 805.964.4554 x902
Hispanic Business Inc./HireDiversity.com Software Engineer
12 matches
Mail list logo