suggest would be
useful. If you already have something in the works (you mentioned
publishing some code), I'd be interested in learning more.
On 11/23/06, John J Lee <[EMAIL PROTECTED]> wrote:
On Wed, 22 Nov 2006, Christopher Hart wrote:
> Would an "easier" (yet still
On Wed, 22 Nov 2006, Christopher Hart wrote:
Would an "easier" (yet still monumental) starting point be to tackle the DOM
implementation independent of a JS engine?
[...]
This seems like a great open source project - it's way too much to handle
for most individual developers, but I think could
On Wed, 22 Nov 2006, Christopher Hart wrote:
I agree that folks have been talking about JS for a long time, and that it's
frustrating, but what I'm suggesting is that we need to tackle a different
problem first.
[...]
An HTML DOM implemention is a necessary part of JS support, sure (though
St
On Wed, 22 Nov 2006, Christopher Hart wrote:
I'm willing to take a crack at laying out a vision, high level objectives
and some implementation requirements based on my experiences and see how
[...]
Everyone who's seriously interested is willing to do that. Indeed, many
have surely done that
On Wed, 22 Nov 2006, apv wrote:
I've also been interested for a long time and tried to work on
this 2 years ago but didn't get far enough to bother trying
to release anything.
[...]
I would gladly throw down if there was a group effort with a
real plan. I'm not the right hacker to lead this pr
On Wed, 22 Nov 2006, Stefan Seifert wrote:
[...]
I too thought about that. Maybe using the JavaScript or
JavaScript::Spidermonkey module and XML::DOM. I will certainly
experiment around with them, as we need it at work. Doesn't seem to be
Sigh, we've had this same little discussion at least fiv
On Fri, 3 Nov 2006, Christopher Hart wrote:
I know there is a rich history of challenges implementing any kind of
JavaScript interpretation using Mechanize or any other web
scripting/automation utility, but I was wondering if anyone has tried to
focus on "Mechanizing" AJAX?
I realize this would
On Tue, 8 Aug 2006, Churton Budd wrote:
[...]
using LWP for display within this portal. When I get the return of the
PDF, the adobe acrobat plugin pops up but it comes up blank. For multi
page ECG's it comes up with multiple blank pages. I have saved this
blank file and looked at it, it seem
On Mon, 10 Jul 2006, flynfast wrote:
I'm trying to write a script to send a query to the patent and
trademark office webpage and capture the URL's pointing to the patents
identified. The problem is that the results appear on more than one
page (like Google lists its results on multiple pages).
On Thu, 6 Apr 2006, Peter Stevens wrote:
[...]
One typical use of Javascript is to perform argument checking before
posting to the server. The URL you want is probably just buried in
the Javascript function. Do a regular expression match on
| $mech->content()| to find the link that
On Wed, 5 Apr 2006, Mike Schilli wrote:
[...]
As soon as someone gets going and comes up with a reference implementation
(every browser naturally has its own DOM implementation, that's why IE
and Firefox behave differently at times), WWW::Mech is in business.
How cool would that be!
[...]
Sadl
On Sat, 17 Dec 2005, Andy Lester wrote:
> On Sat, Dec 17, 2005 at 12:16:29PM -0500, Christopher Hart ([EMAIL
> PROTECTED]) wrote:
> > There are also JavaScript engines available in C and Java
> > (SpiderMonkey and Rhino, respectively, available on mozilla.org). You
> > may be able to leverage th
On Fri, 18 Nov 2005, Christian Montanari wrote:
[...]
> My Quest has been in the dream of many others already.
> It is all about tackling this javascripting trought WWW::Mechanize but, tell
> me if
> my ideas about this topic is wrong, it seems that no good souls has ever yet
> done it!
[...]
h
I've lost the archives for this list again.
I'm sure somebody has one on the web. Can anybody point me to it? There
are lots of links to old sites that stop in 2001, and GMANE seems to start
in 2005, but I can't find anything between 2001 and 2005.
Cheers
John
On Thu, 28 Jul 2005, Mysql user wrote:
> I'm trying to write a perl program to access the configuration of a VOIP
> telephone through its web interface. The web interface assigns you a
> session id cookie once you've logged in. It works with browsers but not
> with libwww-perl5.803 as shipped with
On Thu, 14 Jul 2005, Peter Stevens wrote:
> Under the heading of small serious amounts of work...
>
> I mentioned previously Win32::IE::Mechanize - does anybody have any
> ideas on how to do the same thing with Firefox under Linux?
[...]
Warning: I'm not up-to-date on this, take what I say with a
[John Lee]
> That's not a small amount of work you've just set Warren to do. :-)
>
> (speaking as somebody who made a semi-serious attempt at it, in Python)
[deborah sciales]
> Well, I guess it depends on his set of needs, and he does have
> tokeparser and treebuilder, etc to use.
>
> If his java
[Warren Pollans]
> The problem I'm running into is "trying to deal with scripts that use
> javascript" - so far, I've had to ignore them or, at least, those
[deborah sciales]
> You might also try writing your own javascript parsing routines?
That's not a small amount of work you've just set Warre
On Mon, 11 Jul 2005, Warren Pollans wrote:
> I've been using WWW::Mechanize to automate testing of cgi scripts -
> works great!
>
> The problem I'm running into is "trying to deal with scripts that use
> javascript" - so far, I've had to ignore them or, at least, those
[...]
> I really like being
On Sat, 12 Mar 2005, Andrew Johnson wrote:
> I've been wrestling with a script to scrape some information off of
[...]
> What else could I try?
[...]
Hi Andrew
Read some past messages on this list from me. I think I've made the same
guesses about fifty times now ;-/ and most of the debugging hin
On Sat, 5 Mar 2005, Robert Barta wrote:
> I am using WWW::Mechanize/LWP and some of their subclasses now for
> several things and I see an architectural problem I will be facing in
> some future:
>
> For downstream developers (and for me) I need to offer a facility
> to choose a user agent which
On Tue, 1 Mar 2005, Andrea Setti wrote:
> Thank you for the answer.
>
> I had a look th WWW::Mechanize and it does almost everything that i need.
>
> The only thing i cannot understand is: how can i forward the cookie to the
> real browser?
> I need to fetch it from the real login page and then
On Tue, 1 Mar 2005, Peter Stevens wrote:
> HTTP::Cookies has two submodules. one for Mozilla browsers and one for
> Microsoft browsers. Unfortunately the MS version does not support saving
> the cookies. (BTW - everybody knows, Firefox is the better browser ;-) ).
[...]
Those are only needed if
On Fri, 25 Feb 2005, Martin Kos wrote:
> hi john
>
> > It wants this header (or similar, but this is a minimal one):
> > Accept: text/html
> i have added this header and it just works!!! thanks a LOT!
>
> > Maybe mechanize should sent an Accept header by default?
> i think that would be a good i
On Tue, 22 Feb 2005, Martin Kos wrote:
[...]
> i try to login to the page http://mymobile.sunrise.ch/ but it seems like
> mechanize is not doing the redirect that is on the start site... if i
> try with my browser or wget i get redirect to a page like
> http://mymobile.sunrise.ch/portal/res/gues
On Tue, 18 Jan 2005, Edward Peschko wrote:
> hey all,
>
> I've got a data retrieval problem - I need to get data from a secure
> website (ie: https) which has forms using javascript.
>
> What base technology can I use to do this? Will LWP suffice?
>
> I can't believe this isn't a FAQ - I searched
On Wed, 19 Jan 2005, Peter Stevens wrote:
> Edward Peschko wrote:
[...]
> >If there was an integration between LWP and seamonkey, what form
> >of integration would people feel would be most useful?
[...]
> I think seamonkey integration would be a good thing and see it as an
> alternative to mech. E
On Thu, 30 Dec 2004, [EMAIL PROTECTED] wrote:
[...]
> Why does WWW::Mechanize get directed to cookieerror.htm?
Beats me too. I get the same problem with this site using Python's
urllib2 &c. I tried near-identical headers to Firefox, and got the error
page.
Another guess: Sounds odd, I know, but
On Thu, 30 Dec 2004, [EMAIL PROTECTED] wrote:
>
> I am trying to determine why the following commands to WWW::Mechanize::Shell
> result as they do:
>
> [EMAIL PROTECTED] trwww]$ perl -MWWW::Mechanize::Shell -e 'shell'
> >get https://www.setsivr.odjfs.state.oh.us/Login.asp
> Retrieving https://www
On Mon, 27 Sep 2004, Ed Avis wrote:
[...]
> It's difficult to produce a self-contained test case since this
> program spends its time hitting someone else's website. I hope that
[...]
Easy but helpful would be to turn on HTTP::Cookies' debugging and post the
output (censored if necessary).
John
On Mon, 13 Sep 2004, Daniel E. Doherty wrote:
[...]
> Here is the javascript function that gets invoked:
> function FormSubmit(objForm)
> {
> var strVersion = new String(navigator.appVersion);
> var arrVersion = strVersion.split(" ");
> var intV
On Wed, 8 Sep 2004, Daniel E. Doherty wrote:
> I hit a page on the FDIC website that allows me to download Bank
> Performance Reports, so-called "Call Reports." I can fill in the fields
> on the page, but the button that kicks off the file transfer is
> generated by an HTML tag like this:
>
>
[
On Wed, 8 Sep 2004, Joseph Alotta wrote:
[...]
> > thing, so I don't read any impoliteness into your request, but only
> > because of conscious effort not to.
[...]
> He did say "please". I think his request was polite and to the point.
>
> You are asking too much from non-native speakers. Let'
On Wed, 8 Sep 2004, Suryya Ghosh wrote:
> How to simulate https secured login using lwp?
[...]
> I have open ssl installed in my system and Net ssleay 2.25 installed in
> my system , but while loging in i am getting arespnse of 302 Moved
> Temporarily
Doesn't sound like an SSL problem. Does it re
On Mon, 9 Aug 2004, Andy Lester wrote:
[...]
> LWP is RFC-compliant. Gisle has done a marvelous job of making sure it
> does just what the RFC says.
>
> WWW::Mechanize is a subclass and superset of LWP that does more
> "browser-like" stuff. Mechanize is meant as a browser in an object,
> whereas
[Juan]
> Why Cookie2: $Version="1" is still sent by default by LWP?
> No browser sends that header by default, neither MSIE, Mozilla nor Konqueror.
>
> I suggest to remove it (at least by default).
> I find much more useful to make LWP masquerade as MSIE instead
> following an RFC nobody follows.
[
On Thu, 5 Aug 2004, Joseph Alotta wrote:
[...]
> I think
> there is something going on in the java code in the first part.
[...]
That sounds like a fair bet . Your next step is to figure out what
that something is.
(Actually, it's JavaScript code. Java != JavaScript -- the two are quite
differe
On Mon, 9 Aug 2004, Richard Lawson wrote:
[...]
> I saw your post at libwww mail list but no answer.
> It's an old post but it's the closest thing in the archives to my issue.
> I have a similar problem where the page sets a client cookie and I need to
> set it in LWP, but I can't seem to confirm t
Juan asked me to forward this to this list. (just this once, Juan; get
yourself a free email account to post from -- eg. fastmail.fm)
John
-- Forwarded message --
Date: Mon, 09 Aug 2004 21:27:18 GMT
From: JUANMARCOSMOREN <[EMAIL PROTECTED]>
To: John J Lee <[EMAIL
bruce, please don't cross-post unless you have some valid reason for it.
On Mon, 28 Jun 2004, bruce wrote:
[...]
> however, if you examine the headers between the server/browser app, you can
> more or less.. see what's being transfered back/forth... in this case, the
> content/post data is availab
This is nothing to do with win32, so I've cut that list from the To: line.
On Sun, 27 Jun 2004, bruce wrote:
[...]
> i was under the impression that if i concatenated the url and the
> content/query from the headers, that i'd be able to "simulate" the submit
What do you mean by "the content/query
On Sat, 29 May 2004, bruce wrote:
> hi...
Hi
[...]
> basically, i'm looking to be able to get class schedule information from the
> http://lca.lehman.cuny.edu/dept/registrar/schedule/coursefinder.asp site.
[...]
#!/usr/bin/perl -w
use WWW::Mechanize;
my $b = WWW::Mechanize->new();
$b->get("htt
On Thu, 13 May 2004, Tim Brody wrote:
[...]
> To the best of my knowledge there aren't any other standards for the
> transport of bibliographic data through URIs.
Sounds fair enough to me.
> Besides that, OpenURL is
> likely to become the standard method of linking within the multi-billion
> dol
On Thu, 1 Apr 2004, JUANMARCOSMOREN wrote:
[...]
> > >Aleksandr Guidrevitch wrote:
> > >>We've found that LWP incorrectly handles cookies
> > >>containing ';' in the cookie value.
> > >>The patch (test case and fix) is attached
[...]
> So, why do you want ';' in cookies if they are not handled
> co
[yeah, I know this is a Perl list, but I thought people here might be
interested, since I like to follow the Perl list]
A new list for discussion of anything related to either web-client
software or URL-processing / -fetching software written in Python.
This includes, but is not limited to, the s
On Sun, 25 Jan 2004, Philippe 'BooK' Bruhat wrote:
> Le dimanche 25 janvier 2004 à 21:22, John J Lee écrivait:
> >
> > > In fact, you can already use HTTP::Proxy to see inside a HTTPS connection:
[...]
> > Any recommendations for a specific one?
>
> Well,
On Sun, 25 Jan 2004, Philippe 'BooK' Bruhat wrote:
> Le dimanche 25 janvier 2004 à 15:46, John J Lee écrivait:
> >
> > BTW, anybody have any tips on software / usage thereof for HTTPS proxying,
> > for debugging purposes, and how to set up with LWP? I'v
Attempting to look at the network traffic generated by a Perl program that
uses LWP for doing HTTPS POSTs, I put this in the driver script:
use LWP::Debug qw(conns);
But, though I see some debugging messages, I don't actually see the HTTP
headers or body data. Same happens with plain HTTP (no S
On Fri, 23 Jan 2004, Gedanken wrote:
> On Fri, 23 Jan 2004, John J Lee wrote:
>
> Yuck. Does it also work if you wave a dead chicken at it? ;-)
>
> Why not check the HTTP headers to find out what's going wrong?
>
> the headers are identical as far as i can tell. aft
On Fri, 23 Jan 2004, Gedanken wrote:
> On Fri, 23 Jan 2004, John J Lee wrote:
>
> Yuck. Does it also work if you wave a dead chicken at it? ;-)
>
> Why not check the HTTP headers to find out what's going wrong?
>
> the headers are identical as far as i can tell. afte
On Fri, 23 Jan 2004, bzzt wrote:
> I'm trying to log on to this site (www.thecityvibe.com/forum/) with the
> followin script but doesn't seem to succeed. Anyone knows what the problem
> might be?
(without reading your script): no cookie jar?
I don't recall if WWW::Mechanize makes one by default
On Fri, 23 Jan 2004, Justin Cook wrote:
[...HTML saved from browser and fetched with LWP appear different...]
> going on here? Is it the difference between a dynamic page and a static
> page being posted to?
No.
> Am I not recieving all the chunks of response in
> time to get the transaction id?
On Fri, 23 Jan 2004, Gedanken wrote:
[...]
> basically i manually set the form action... to the same thing it was
> already set to. and voila, stuff starts working. Ill edit your version
> below to show you what i mean.
Yuck. Does it also work if you wave a dead chicken at it? ;-)
Why not che
On Fri, 23 Jan 2004, Philippe 'BooK' Bruhat wrote:
[...]
> Maybe the transaction is put in the page by some javascript
> (document.print?). Your browser saves the resulting page, while
> WWW::Mechanize works on what the server sends.
No, browsers always save the original document. At least, that'
On Tue, 23 Dec 2003, Gedanken wrote:
[...]
> because, unbeknownst to me, the action for that form happened to have the
> phrase '&lang=FR' in it. well apparently &lang has a special meaning, as
> i can see from my request object that it has been encoded into an escape
> sequence against my will =)
On Wed, 3 Dec 2003, John J Lee wrote:
[...]
> Not in KDE 3.2: it decompresses automatically, so when you save or open
> with KWrite, it's just 200_gzip.xml.
...and I'd take a guess that's because Safari (Apple's browser based on
Konqueror) does the same, because 3.2 ap
On Wed, 3 Dec 2003, Gisle Aas wrote:
> [EMAIL PROTECTED] writes:
[...]
> > http://diveintomark.org/tests/client/http/200_gzip.xml
> >
> > IE "just does it".
>
[...]
> Konqueror suggest saving or opening the file in an
> external app, but the file saved or given to an external app is still
> gzipped
I think there might be a problem with _normalize_path, from HTTP::Cookies.
I'll explain what happens with my Python port, because I have no idea how
Perl and unicode interact: a unicode URI got passed to my equivalent of
_normalize_path() (a unicode string is a separate type from an ordinary
byte-s
On Wed, 19 Nov 2003, John J Lee wrote:
[...]
> The Yahoo email login page is full of Javascript code doing complicated
[...]
BTW, as I must have said here before, the first thing everybody seems to
do is to try to automate their Yahoo email account, so I'm sure there's
lots of free
I was about to say "search the archives", but I can't find them. Surely
they exist?? There are several places that have archives years out of
date, and one with a couple of messages from 2003 and nothing else.
Can the real libwww-perl archive stand up, please?
On Sun, 16 Nov 2003, tv fw wrote:
On Tue, 18 Nov 2003, Brian Spiegel wrote:
[...]
> The launched browser, if the login was successful, should take me to my
> inbox. However, I get a page stating that my browser doesn't allow cookies.
> Has anyone attempted logins with Yahoo or any of these other services? Is
> there something in
On Sun, 16 Nov 2003, John J Lee wrote:
> On Sat, 15 Nov 2003, allan juul wrote:
[...]
> > no - sorry,i didn't mean kill in that unix sense - i close the program
> > with an exit or die or nothing more to do, then restart the program a
> > bit later and at that point i ha
On Sat, 15 Nov 2003, allan juul wrote:
> On Saturday, Nov 15, 2003, at 16:36 Europe/Copenhagen, John J Lee wrote:
[...]
> > How did you kill the process? If you kill -kill it in Unix, then Perl
> > won't have a chance to run the code to save your cookies.
> >
> &g
Somebody is subscribed with an old address, apparently. Every time I post
here, I get this:
-- Forwarded message --
Date: Sat, 15 Nov 2003 23:37:50 +0800
From: Postmaster <[EMAIL PROTECTED]>
To: John J Lee <[EMAIL PROTECTED]>
Subject: AutoReply Reminding Mess
On Sat, 15 Nov 2003, allan juul wrote:
[...]
> i have tried something like
>
> $robot->cookie_jar( {
> autosave => 1,
> file => 'cookie.lwp'
> }
> );
>
>
> then i have tried to print out the cookies i get before i kill the
> robot process and after i re-start the robot an
On Wed, 12 Nov 2003, Haroon Rafique wrote:
[...]
> To have SSL capability you need either one of the following 2 modules on
> your Windows 2003 Server machine:
>
> Crypt::SSLeay
> IO::Socket::SSL
or the stuff from Johnny Lee (no, not me, we just happen to have similar
names), which only depends on
On Sat, 8 Nov 2003, Alexandre Loureiro wrote:
[...]
> I´ve talked to support of the site and they gave some hints..
[...]
> - Then I´m redirected to a Login.php Script that gets my information in a
> LDAP Server
If that really is the information you need, why not start and end right
there? U
On Wed, 5 Nov 2003, David Busby wrote:
[...]
> works on another similar site. So the question is not how to handle this in
> LWP but what other methods would these folks use for redirecting me that
> will work in IE and Netscape but not LWP.
Embedded script (JavaScript, usually) or Refresh redire
On Sun, 2 Nov 2003, Alexandre Loureiro wrote:
> I´m new to perl and using some books I´ve done some simple scripts to
> make things easier. But now I´m having some problems login into a web
> site ( secure using php login). I can log into the site but to navigate
> futher i need a cookie to send s
...just to add: I wasn't implying that Rod *is* doing something he
shouldn't. I have no knowledge about the site in question or the
motivations of the people involved.
John
On Thu, 30 Oct 2003, Poly wrote:
> This reminds me of a script that was supposed to post news articles
> somewhere until the site decided to keep hackers and automatic scripts
> out by inserting a session ID in an intermediary form...
Most of the time this stuff isn't an attempt to keep people ou
On Thu, 30 Oct 2003, Roderick A. Anderson wrote:
[...]
> There is a intermediate document being returned with an onLoad attribute
> in the body tag to automagically submit the new form.
>Needless to say this causes my script to fail and the as_string method
> doesn't include the original form a
On Thu, 23 Oct 2003, Abhishek jain wrote:
[...]
> I am a B.Tech student and as a part of college project I have to make a
> program that is able to send sms using www.sms.ac website.I tried to
> make the project myself but I am having some cookies problem. The site
> is not accepting the cookies ge
On Fri, 17 Oct 2003, Roderick A. Anderson wrote:
[...]
> my $res = LWP::UserAgent->new->request($form->click);
>
> Are there any methods to search $res (which contains another form) to pull
> out specific inputs that have been returned?
[...]
Just do another HTML::Form->parse() on the response dat
On Wed, 15 Oct 2003, Roderick A. Anderson wrote:
> Back again. Getting more things solved but I can't find _anything_ on how
> to build a HTML document in memory. All I really need is a method to POST
> to a page with data I already have collected. I know the form inputs
[...]
HTTP POST does n
On Wed, 15 Oct 2003, Gisle Aas wrote:
[...]
>2
[...]
> Of the browsers I have here Mozilla displays "2" for the second value
> while konqueror shows "x". I guess my question is what MSIE shows?
2! Damn.
(For IE 5 -- has there been any standards-compliance effort with IE 6? I
certainly doub
On Wed, 15 Oct 2003, John J Lee wrote:
[...]
> seen them 'in the wild', though :-( Probably I should strip tags even for
> OPTION element contents...
[...]
Hmm, I guess both our parsers do that naturally, anyway. :-)
John
On Wed, 14 Oct 2003, Gisle Aas wrote:
> John J Lee <[EMAIL PROTECTED]> writes:
[...]
> I could potentially let there be multiple 'value_names' for a single
> value, but I could also just let an explicit label override the option
> as per spec. Is this something that r
On Tue, 14 Oct 2003, John J Lee wrote:
[...]
> OK. Did you notice that both the value and the label of OPTION default to
> the contents (eg. Female here), according to the HTML 4 spec? In my
[...]
Just to be clear, OPTION actually has a label attribute, unlike INPUT
(which needs a special
On Tue, 14 Oct 2003, Gisle Aas wrote:
> John J Lee <[EMAIL PROTECTED]> writes:
[...]
> Yes. If a form contains:
>
>
>Female
>Male
>Unknown
>
>
> Then the values that this field might take becomes "F", "M" a
On Tue, 14 Oct 2003, Gisle Aas wrote:
[...]
> HTML::Form's dump now also print alternative value names.
What does 'alternative value name' mean? Is this something to do with
OPTION element contents (this bit) and labels?
> HTML::Form will now pick up the phrase after a
> or and us
On Mon, 13 Oct 2003, Roderick A. Anderson wrote:
[...]
> Well I figured WWW:Mechanize was the ticket but I am now up against the
> wall. I can't figure out how to take the returned page and get just that
> field's value. All the examples I've found of using Mech are looking for
> non-form informa
On Fri, 10 Oct 2003, Thurn, Martin wrote:
> What's probably happening is that you have cookies enabled, but you're not
> sending any. You have to GET the cookie from the search FORM page, in order
> to SEND the cookie back with your POST of the query.
Probably right.
Jonathan: remember that M
On Wed, 24 Sep 2003, Christophe Chisogne wrote:
> Just the same patch as in my previous post, but with a more
> correct mime type ;-)
>
> I feel like excite.com sends bad cookies.
> "Set-Cookie: uu=i=213.193.180.194-1064396543121MJ;; ..."
> the double ';' is preceded by 2 '=' in the same
> 'name=v
On 19 Sep 2003, Gisle Aas wrote:
[...]
> The current behaviour is based on what made sense to me, not on how
> stuff actually works in other apps on Windows. Anybody know a place
> that describes the de-factor rules for file: URLs on Windows?
[...]
Probably a useless snippet: apparently both ':'
On Wed, 3 Sep 2003, wendy soros wrote:
[...]
> What I am trying to do is very similar to the
> ABEBooks.com example in Burke (p.74): use the POST
> method to submit some parameters to a form and save
> the response to a local file. As done in the example,
> I first got the name-value pairs of the f
On 25 Aug 2003, Gisle Aas wrote:
> Mark Stosberg <[EMAIL PROTECTED]> writes:
>
> > I would like to see an extension to this part of the interface which
> > allows one to treat single and multiple SELECT lists the same way. In
> > the current situation calling the same command can result in dealing
On Thu, 21 Aug 2003, Patrick Collins wrote:
[...]
> If you control the webserver you could try upgrading to Apache2 and
> using mod_dav. The Webdav protocol allows you to get parsable directory
> listings from which you can then download whatever files you choose.
[...]
And no doubt there's a modu
On Thu, 7 Aug 2003, [EMAIL PROTECTED] wrote:
> Having a potential TreeBuilder memory problem when using it to parse
> through a large HTML table (> 2K rows) where the memory allocation grows to
> about 20M on my server and never goes down even after finishing with the
> HTML and TreeBuilder struct
On Tue, 5 Aug 2003, Andrea Tasso wrote:
[...]
> and lynx is short and with a
On Mon, 21 Jul 2003 [EMAIL PROTECTED] wrote:
[...]
> I'm setting the HTTPS_CA_DIR and HTTPS_CA_FILE environment
> variables as described in the documentation.
> - Something else strange - when I don't(!) set the two
> environment variables, then I can access both sites(!!). The
> warning "C
On Mon, 14 Jul 2003, Octavian Rasnita wrote:
[...]
> After downloading an HTML page, what modules can I use to read the cell 4
> from the fifth row of a table, if that table is placed in another table in
> the cell 2 of the row 3?
[...]
http://theoryx5.uwinnipeg.ca/CPAN/data/HTML-TableExtract/HTML
On Thu, 10 Jul 2003, John J Lee wrote:
[...]
> Another way is to use libtidy (a new shared library-ized HTMLTidy, with
[...]
Actually, it's called tidylib, not libtidy:
http://tidy.sourceforge.net/libintro.html
John
On Wed, 9 Jul 2003, Reinier Post wrote:
> On Tue, Jul 08, 2003 at 03:34:12PM +0100, Richard Lamb wrote:
> > working out a means of stripping HTML tags (via the DOM interface, which
> > [...]
> I have only tried HTML::TreeBuilder (not DOM, but the same principle;
> uses heuristic HTML parsing and p
On Wed, 9 Jul 2003, Jonathan Daigle wrote:
[...]
> $inref->{dbh}->prepare(qq| SELECT * from affiliate_account|); # for
[...]
This looks like web server code. This list is for discussion of web
client code.
John
On Wed, 9 Jul 2003, Terry wrote:
[...]
> If a page has frames, the main page gets returned, not
> the frames. Another request has to be made or
> something to get the contents of the frame. How do I
> go about that?
[...]
The same way you grabbed the main page? Just parse out the URLs, and
fetc
On Tue, 8 Jul 2003, Terry wrote:
> I am using HTTP::WebTest to do site logon simulation.
> However, there are browser checks on some websites.
> For example, some require that the user-agent (client)
> supports frames, tables, and vbscript, is there a way
> to 'trick' the server into thinking my c
On Mon, 23 Jun 2003, Alan Olegario wrote:
> I tried checking what headers are being sent with ethereal, but it looks
> like I can't get the info since it's going over https and being
> encrypted.
[...]
There are several solutions to that. Look at the message I posted here a
week or two ago for d
On Sat, 21 Jun 2003, Matthew Darwin wrote:
> The LWP behaviour looks like a security problem to me.
>
> For example, davin.ottawa.on.ca is not related to flora.ottawa.on.ca
> So if one sets a cookie the other site can get it?
> Very bad.
>
> Canadian domains are in the form ...ca
> or ..ca or .ca
On Fri, 20 Jun 2003, Alan Olegario wrote:
[...]
> HTTP::Cookies::extract_cookies: Set cookie SMSESSION => [cookie info]
> HTTP::Cookies::extract_cookies: Set cookie FORMCRED =>
> HTTP::Cookies::extract_cookies: Set cookie EntFXSessionR => [cookie info]
> HTTP::Cookies::extract_cookies: Set cookie L
1 - 100 of 138 matches
Mail list logo