Re: Help---Trouble Installing HTML-Parser-3.58

2008-11-27 Thread Reinier Post
w the answer to, but the defaults will usually work, and there's an option to keep it from asking. > -- > Radek -- Reinier Post TU Eindhoven

Re: UNIX - Installing Crypt::SSLeay

2008-02-15 Thread Reinier Post
On Wed, Feb 13, 2008 at 12:46:15PM -0500, David Moreno wrote: > I think that, from the paths he pasted, it's a Sun4 Solaris. In that case, the answer is probably to install a C compiler. Last time I used Solaris, it didn't come with a C compiler, but Sun offered CDs with optional additional packa

Re: About LWP in general??

2006-09-12 Thread Reinier Post
e any of them at the moment, so I cannot give you more information. > thanks > Chris -- Reinier Post TU Eindhoven

Re: [Crypt::SSLeay] compile problems on Solaris

2005-11-29 Thread Reinier Post
On Wed, Nov 23, 2005 at 11:31:06AM +0100, Barden, Tino wrote: > Hello, > > I have tried to compile Crypt-SSLeay-0.51 on a Solaris 9 machine and got the > following errors: > > UZKT3 # perl Makefile.PL > Found OpenSSL (version OpenSSL 0.9.8) installed at /usr/local/ssl > Which OpenSSL build path

Re: HTML::Parser bug

2005-03-21 Thread Reinier Post
On Sun, Mar 20, 2005 at 01:51:25PM -0800, Bill Moseley wrote: > On Sun, Mar 20, 2005 at 06:02:26PM +0300, [EMAIL PROTECTED] wrote: > > Hello libwww, > > > > using it to parse html-forms etc... > > noticed, that it recognizes strange comment > > like

Re: WWW::Mechanize & caching

2005-03-09 Thread Reinier Post
On Fri, Feb 25, 2005 at 10:42:43AM +1000, Robert Barta wrote: > On Thu, Feb 24, 2005 at 11:07:00PM +0100, Reinier Post wrote: > > On Mon, Feb 21, 2005 at 08:27:38AM +1000, Robert Barta wrote: > > > Hi all, > > > > > > I hope I did not miss an obvious solution t

Re: WWW::Mechanize & caching

2005-02-24 Thread Reinier Post
On Mon, Feb 21, 2005 at 08:27:38AM +1000, Robert Barta wrote: > Hi all, > > I hope I did not miss an obvious solution to the following: > > I want a *caching* version of WWW::Mechanize. Why don't you just use a caching proxy server? Squid? -- Reinier Post

Re: HTML::TreeBuilder/HTML::Parser - problem parsing tables

2004-04-06 Thread Reinier Post
all (newspaper and magazine) > sites tested have this problem, so that it renders the script virtually > useless. Can you post a *minimal* HTML fragment that exhibits the problem? > Greetings > > Neven -- Reinier Post TU Eindhoven

Re: Can't locate object method "host" via package "URI::_foreign"

2003-09-03 Thread Reinier Post
On Tue, Sep 02, 2003 at 12:29:16PM +0400, Siddhartha Jain(IT) wrote: > Sorry, the input being given to the $uri-host method was erroneous. > > Again, sorry for the false alert!! Comment: URIs exist that are valid, but do not have a host part, and you will have this problem then, so it is a good i

Re: TreeBuilder cgi memory problems

2003-08-14 Thread Reinier Post
On Fri, Aug 08, 2003 at 12:43:16AM +0100, John J Lee wrote: > On Thu, 7 Aug 2003, [EMAIL PROTECTED] wrote: > > > Having a potential TreeBuilder memory problem when using it to parse > > through a large HTML table (> 2K rows) where the memory allocation grows to > > about 20M on my server and never

Re: HTML-Parser module issue

2003-07-15 Thread Reinier Post
On Mon, Jul 14, 2003 at 05:55:09PM -0700, Hieu Nguyen wrote: > my opertion system # sun solaris version 9 > perl version: v5.6.1 built for sun4-solaris-64int (come w/ op system) > gcc compiler version: gcc-2.95.3-sol8-sparc-local > my quetion is when I tried to install the perl module > (HTML-Parse

Re: Help needed

2003-07-15 Thread Reinier Post
On Mon, Jul 14, 2003 at 11:10:26PM +0200, Carsten Kruse wrote: > Hi Teddy, > > if you know about the structure of the html page you should > try the functions of HTML:TokeParser package. [very nice example omitted here for brevity] I have used HTML::TreeBuilder in some of my scripts. If the HT

Re: Help needed

2003-07-14 Thread Reinier Post
On Mon, Jul 14, 2003 at 03:43:28PM +0530, Alpesh Thakar wrote: > Hi, > > I would like to read HTML file (which includes tabular formate) using > perl and i would like to extract the field from it. > > So would you please give me the link or the material which shows some > examples that perl scr

Re: HTML parsing

2003-07-09 Thread Reinier Post
On Tue, Jul 08, 2003 at 03:34:12PM +0100, Richard Lamb wrote: > Hi folks, > I'm Richard Lamb, and I'm a Perl virgin. Just getting to know the > language. I'm in the midst of an MSc in Computing in Manchester (UK), > working out a means of stripping HTML tags (via the DOM interface, which > I'm tryi

Re: URI Bug?

2003-03-04 Thread Reinier Post
On Sun, Nov 24, 2002 at 02:56:56PM -0800, Randal L. Schwartz wrote: > > "Gisle" == Gisle Aas <[EMAIL PROTECTED]> writes: > > >> I'd be happy to learn of a better way to do this, and I'm sure my list > >> is not complete anyway. > > Gisle> I suggest either: > > Gisle> $url->host(ip_addres

Re: Help! how is this called?

2002-12-02 Thread Reinier Post
successful most of > the time. > > Other than issues with the URI above, you should have minimal problems. Right now I have the problem that Apache 2 won't feed URLs to script.php (in my case it's a PHP script) if they have an extra path. But this is just one of my regular quarrels with the Apache configuration file mess, I expect it can be done somehow. -- Reinier Post TU Eindhoven

Re: Cygwin: name of an lwp-request script (HEAD) clashes with /bin/head

2002-09-30 Thread Reinier Post
On Fri, Sep 27, 2002 at 06:31:41PM -0400, Igor Pechtchanski wrote: > Hi, > > When installing LWP through CPAN, the installation creates the GET, HEAD > and POST scripts in /usr/bin. That's why it explicitly asks you if you want to install them. > So, to accommodate the users of Cygwin and other

Re: can't load host method in URI package

2002-09-01 Thread Reinier Post
On Wed, Aug 28, 2002 at 01:08:41PM -0400, Thurn, Martin (Intranet) wrote: > > > LWP::RobotUA=HASH(0x83d7994) GET 1 ...Can't locate object method "host" > > > via package > > > "URI::_generic" (perhaps you forgot to load "URI::_generic"?) at > > That's the message you get when your URL does not ha

Re: can't load host method in URI package

2002-08-28 Thread Reinier Post
On Fri, Aug 09, 2002 at 03:01:54PM +0500, Ken Munro wrote: > Hi. > > I am trying to write a simple robot that reads urls from a text file. > The source is listed below. I am getting an error that says: > > LWP::RobotUA=HASH(0x83d7994) GET 1 ...Can't locate object method "host" > via package > "U

Re: Fw: Can't navigate to URL after login

2002-08-22 Thread Reinier Post
On Tue, Aug 06, 2002 at 10:26:01AM -0500, Kenny G. Dubuisson, Jr. wrote: > Tried that (referer => ...) with no luck. I did find that I can navigate > several pages in using sequential $browser->request calls but the page that > finally fails has the hyperlinks to the next page calling javascript

Re: libwww only as root

2002-08-01 Thread Reinier Post
On Wed, Jul 17, 2002 at 12:44:39PM -0600, Keary Suska wrote: > on 7/17/02 12:13 PM, [EMAIL PROTECTED] purportedly said: > > > If you're having a problem with running your scripts from cron, the answer > > is usually in your PATH environment variable or working directory - cron > > tends to run wi

Re: Fetching big files

2002-06-03 Thread Reinier Post
On Wed, May 29, 2002 at 06:20:58PM +0300, evgeny tsurkin wrote: > > Hi! > The problem I have: > I am fetchng big files from the web that are created on the fly. > Befor actually fetching them I would like to know what is the size > it is going to be. > I am not sure that is possible ,but if

Re: HTTP status line

2002-04-27 Thread Reinier Post
On Thu, Apr 25, 2002 at 02:41:09PM -0700, Vicki Brown wrote: > I finally got plain text to work but I still can't get JPEGs to work - the > bit stream I get back is what I expect it to be... I am setting the > Content-Type header (easier said than done). I am sending a few other > Content- header

Re: Authentication

2002-03-29 Thread Reinier Post
On Fri, Mar 22, 2002 at 05:05:49PM -0600, Damian Kohlfeld wrote: > I have a situation where I have a webpage that has a list of links to > other web sites. They do the following: > > Login to my website. > My website sends assigns them a cookie using libwww perl. > They see the list of links and

Re: hi

2002-03-06 Thread Reinier Post
On Tue, Mar 05, 2002 at 06:54:41AM -0800, Randal L. Schwartz wrote: > >>>>> "Reinier" == Reinier Post <[EMAIL PROTECTED]> writes: > > Reinier> On Mon, Mar 04, 2002 at 04:33:37PM +0530, kavitha malar wrote: > >> I want to search a text in

Re: hi

2002-03-05 Thread Reinier Post
On Mon, Mar 04, 2002 at 04:33:37PM +0530, kavitha malar wrote: > I want to search a text in a website how to do that through perl. perl -MLWP::Simple -e \ 'getprint http://www.google.com/search?q=$word+site:$site' I'm serious. (This is what I use to find my own pages.) -- Reinier

Re: hi

2002-03-05 Thread Reinier Post
On Mon, Mar 04, 2002 at 02:57:30PM +0530, kavitha malar wrote: > perl -MLWP::Simple -e 'getprint "http://www.yahoo.com";' > 400 Bad Request http://www.yahoo.com> > > anybody knows why this error is happening . It isn't here. Try setting $http_proxy or something. > --jude -- Reinier

"it is not working" (was: hi)

2002-02-20 Thread Reinier Post
On Wed, Feb 20, 2002 at 03:44:35PM +0530, kavitha malar wrote: > perl -MLWP::Simple -e 'getprint "http://www.compaq.co > m"' > > I have executing this command in Linux machine .But it is not working . Any one >knows why it is happening . It works for me. It helps to post the error message.

Re: installing libwww on solaris

2002-02-15 Thread Reinier Post
On Thu, Feb 14, 2002 at 02:47:19PM +0200, Afgin Shlomit wrote: > > I try to install libwww on solaris and first the 'make test' dont past > okay - I get : > robot/uaPerl lib version (v5.6.1) doesn't match executable > version ( > 5.00503) at /usr/local/lib/perl5/5.6.1/sun4-solaris/Con

Re: More active list?

2002-02-15 Thread Reinier Post
On Tue, Feb 12, 2002 at 01:58:36PM -0500, Eric Frazier wrote: > I don't want to sound demanding, but I am getting desparate to solve this > problem and move on to some other horrors :) Have you tried the comp.lang.perl.* newsgroups? -- Reinier

Re: Need Help On Using LWP::Simple

2002-02-01 Thread Reinier Post
On Wed, Jan 23, 2002 at 02:39:54PM -0500, Weifeng Wang wrote: > Hi There, > > Thank you very much for all the responses on my previous question on > installation of libwww module. It looks like that it is hard to compile a > module without cc compiler when the original Perl was built with cc. I

Re: LWP and different ports

2002-01-16 Thread Reinier Post
On Wed, Jan 16, 2002 at 03:07:27PM -0800, Daniel Harrison wrote: [...] > It no longer worked. I added a line to print out the value of $raw_url > and it was in the correct format - (192.168.11.122:8000/whatever.html) 192.168.11.122:8000/whatever.html is not a URL. > Any suggestions? Try p

Re: Double slash in a URI: legal or not?

2002-01-06 Thread Reinier Post
On Sun, Jan 06, 2002 at 08:41:54AM -0800, Randal L. Schwartz wrote: > > "Hans" == Hans De Graaff <[EMAIL PROTECTED]> writes: > > Hans> RFC 2396 seems to indicate that in path segments only a single slash > Hans> is legal, > > I'm not sure where you get that. My reading of the BNF: > >

Re: Fixing opening/closing tags.

2002-01-06 Thread Reinier Post
arser solution because this will be running under mod_perl so > I'd like to avoid brining in the HTML::Parser module(s) to save memory. > Minor issue, though. This also depends on how clean your input HTML is. Running it through HTML::Parser/::TreeBuilder will enfocre *their* ideas of what HTML should look like. You may also want to have a look at HTML::PrettyPrint for related code. > Thanks, > > -- > Bill Moseley > mailto:[EMAIL PROTECTED] -- Reinier Post TU Eindhoven

Re: HTML::Parser question

2001-10-30 Thread Reinier Post
On Mon, Oct 29, 2001 at 10:00:23PM -0600, ADJE WebMail Technical Support Team wrote: > Question: How do I extract the plain text from an HTML file, or, put > another way, how do I remove the html markups, just leaving the plain > text? I have looked at the example provided in HTML::Parser, in > p

Re: Minor bug in request()

2001-10-22 Thread Reinier Post
> >Why? I know this has been argued extensively elsewhere; see e.g. > > > > http://pppwww.ph.gla.ac.uk/~flavell/www/post-redirect.html > > > >Possibility 1 mentioned there is common enough to add support for it. > > > >-- > >Reinier > > > This link goes nowhere -- is the site down? The corre

Re: Minor bug in request()

2001-10-17 Thread Reinier Post
On Fri, Oct 12, 2001 at 02:21:09PM -0600, Sean M. Burke wrote: > At 09:30 PM 2001-10-09 +0200, Eyal wrote: > >[...] > >In reality (which means - how IE does it, can't argue with the > >majority), after receiving a redirection to a POST, > >IE issues a GET to the new location. > >[...] > > I hereb

Re: ODBC to MS SQL 7/2000

2001-09-24 Thread Reinier Post
On Thu, Sep 20, 2001 at 05:50:39PM -0400, Hawk wrote: > Hi, > > I have been assigned the task of writing perl scripts from a Linux box to > connected to a MS SQL 7/2000 server. Are there routines and modules > already built for this? Yes. % perl -eshell -MCPAN cpan shell -- CPAN exploration

Re: problems installing the modules

2001-08-24 Thread Reinier Post
On Thu, Aug 23, 2001 at 01:46:47PM +0300, Yair Lapin wrote: > Hi, > > I'm trying to install the libwww modules in a sparc server with solaris 2.8 > and the most of them I can't compile I get the following > Error message: > > cc -c -xO3 -xdepend-DVERSION=\"3.25\" -DXS_VERSION=\"3.25\" -K

Re: HTML::Parser - Extracting out the text from

2001-07-09 Thread Reinier Post
On Mon, Jul 02, 2001 at 11:17:00AM -0700, Bill Moseley wrote: > Hello, > > I need to extract text out of html docs to do search word highlighting in > context. (You know, like google's output.) > > So, is there a "fastest" method to do this -- better than just using > HTML::Parser, setting a fl

Re: redirects and javascript

2001-07-09 Thread Reinier Post
On Mon, Jul 02, 2001 at 08:51:42AM -0400, fred whitridge wrote: > I have inelegantly solved my problem by loading the page with the > javascript reference into Excel and then snagging the executed result. > There has to be a better way to do this, altho' this one works. LWP doesn't support Javasc

Re: LWP::RobotUA-recurse?

2001-07-09 Thread Reinier Post
On Fri, Jun 29, 2001 at 10:44:20PM +0200, Simon Dang wrote: > Hi, I am a newbie with LWP. > > Does LWP::RobotUA run recursively by default? > > If not, is there method that I can call within UA to > set this to run recursively? > > I have searched the docs within LWP::RobotUA, but > there is

Re: Question

2001-07-08 Thread Reinier Post
On Sun, Jul 08, 2001 at 04:03:46PM -0700, Jason Whitlow wrote: > I am trying to get one of my apps to display only 5 records at a time. With > Perl attaching to a mysql database. Does anyone have any good Ideas of how > to do this. Yes, Perl can do this (check the DBI documentation), and the SQL

Re: Automated FORM posting

2001-05-16 Thread Reinier Post
forms have to be without Javascript. > > target="_top"ACTION="http://someserver.com/SendForm.htm";> > > > > > > > > -- Reinier Post

how to disable automatic redirect (was: Newbie Question)

2001-05-16 Thread Reinier Post
On Tue, May 15, 2001 at 11:48:54AM -0400, Jean Zoch wrote: > Hello all, > > I am developing a utility that needs to grab the HTML code from web pages. > To do this I am using: > > my $url = 'http://www.theURLiWant.com'; > > use LWP::Simple; > $content = get($url); > > This works great, but I a

Re: LWP::RobotUA problem

2001-04-24 Thread Reinier Post
On Tue, Apr 24, 2001 at 09:47:21AM -0700, Gisle Aas wrote: > > 234c234,235 > > < my $netloc = $request->url->host_port; > > --- > > > my $ru = $request->url; > > > my $netloc = $ru->can('host_port') ? $ru->host_port : $ru->host; > > Not all URIs have a 'host' method either. I think

LWP::RobotUA problem

2001-04-24 Thread Reinier Post
Hello Gisle and all, I use the following local patch to LWP::RobotUA : % perldoc -m LWP::RobotUA.pm | diff - lib/LWP/RobotUA.pm 1c1 < # $Id: RobotUA.pm,v 1.17 2000/04/09 11:21:11 gisle Exp $ --- > # $Id: RobotUA.pm,v 1.171 2000/08/09 11:25:23 cvs Exp $ 7c7 < $VERSION = sprintf("%d.%02d", q$Rev

Re: WARNING: default LWP install on Mac OS X is bad, Bad, BAD

2001-04-19 Thread Reinier Post
On Thu, Apr 19, 2001 at 09:21:27AM +0200, Reinier Post wrote: > On Tue, Apr 17, 2001 at 06:42:00PM -0700, Joseph N. Hall wrote: > > A default install of LWP on the currently shipping Mac OS X places HEAD in > > /usr/bin/HEAD. Unfortunately this is a case-insensitive filesystem. S

Re: WARNING: default LWP install on Mac OS X is bad, Bad, BAD

2001-04-19 Thread Reinier Post
On Tue, Apr 17, 2001 at 06:42:00PM -0700, Joseph N. Hall wrote: > A default install of LWP on the currently shipping Mac OS X places HEAD in > /usr/bin/HEAD. Unfortunately this is a case-insensitive filesystem. So long > /usr/bin/head. I discovered this as I was preparing to install Xfree86 and

Re: considering HTML::Element's $tree->extract_links

2001-02-27 Thread Reinier Post
On Sat, Feb 24, 2001 at 05:11:02PM -0700, Sean M. Burke wrote: > Some clever person wrote me earlier this month and suggested adding a > feature to HTML::Element's extract_links method; and I want to > run it past people who actually use the current method's behavior. Count me in. > What the per

Re: / and DirectoryIndex

2001-02-21 Thread Reinier Post
On Wed, Feb 21, 2001 at 04:42:20PM +0700, John Indra wrote: > Hi all... > > How do I tell my user-agent (an LWP::UserAgent object) to NOT download both > / and index.html or whatever remote sites DirectoryIndex set to? > Example, my user-agent sees 2 link: > - http:://www.domain.com/ This :: not

Re: Off topic question

2001-01-23 Thread Reinier Post
On Mon, Jan 22, 2001 at 08:57:20AM -0800, [EMAIL PROTECTED] wrote: > I know this is off topic, but can some perhaps point me to a > resource online that shows how you can load a perl module into > your local cgi-bin and use it locally. I'm running into a case of a host > admin that refuses to i

Re: Install, Again

2001-01-11 Thread Reinier Post
On Tue, Jan 09, 2001 at 09:00:58AM -0500, Alliance Support wrote: > # perl -e "use LWP::Proxy" > Can't locate LWP/Proxy.pm in @INC (@INC contains: > /usr/local/lib/perl5/5.00502/sun4-solaris > /usr/local/lib/perl5/5.00502 > /usr/local/lib/perl5/site_perl/5.005/sun4-solaris > /usr/local/lib/perl5

Re: problems with LWP::UserAgent

2000-12-07 Thread Reinier Post
or redirect_ok. > If the number is something else > than 0 then the page should simply be returned as now. It would be nice to also have the option to have it refreshed anyway. It would even be possible to refresh after the specified # of seconds, with sleep(). refresh_ok ? refresh_immediately_if_faster_than(10) ? > --Gisle -- Reinier Post TU Eindhoven

Re: problems with LWP::UserAgent

2000-12-06 Thread Reinier Post
ike in follow HTML example? > > > > > > Contrary to what you seem to believe, this is not a HTTP redirect. It isn't handled by the redirect_ok setting. I don't think LWP offers supportfor automatic refreshes. -- Reinier Post

HTML::TreeBuilder moves initial 'special' elements to the end

2000-11-30 Thread Reinier Post
it?) that declarations in the preamble (before the "html" start-tag) end up actually under the "html" element. As noted in the code, this also applies to store_comments(). And yes, I need it fixed! A haphazard patch is attached. I am now working on HTML::Pre

Re: Javascript interpreter?

2000-11-28 Thread Reinier Post
that doesn't support user interaction and only allows very simple manipulations on values; this could be as declarative as HTML and it would probably be useful for most of the Javascript out there. -- Reinier Post [EMAIL PROTECTED]

Re: URI::Heuristic

2000-11-27 Thread Reinier Post
oes another stupid thing ... > it tries: > 1. rdsnet.ro ... fails > 2. www.rdsnet.ro.com ... fails > 3. www.rdsnet.ro.org/net/mil ... etc ... all of them fail ... > 4. host not found ... > (but OFC, www.rdsnet.ro exists and is up and alive, too bad no one ask for > his name ...) > > what do you say ... ? Send a bug report to lynx-dev :) -- Reinier Post TU Eindhoven

Re: Links not extracted!

2000-11-27 Thread Reinier Post
On Wed, Nov 22, 2000 at 04:29:45PM +0800, Tan Joo Geok wrote: > > Hello, > > I am using the lwp-request sample program that comes with the libwww distribution to >parse the content of various web sites. I noticed that for certain web sites, the >links fail to be extracted by lwp-request, e.g:

Re: Error downloading file.

2000-11-27 Thread Reinier Post
On Mon, Nov 20, 2000 at 04:08:56PM +, Stuart Grimshaw wrote: > I'm trying to download a file from an FTP server using libwww, but when I > actually try and get the file I get the error "cannot chdir to /home" > > The file I'm trying to download is /home/stuart/logs/today.log > > If I log in

Re: libwww-perl install

2000-11-15 Thread Reinier Post
On Wed, Nov 15, 2000 at 09:30:12AM +0100, Bence Fejervari wrote: > > Hi! > > Yesterday I tried to install the libwww-perl package from .tar.gz file, > but when I made make test, it gave me 16 errors out of 22. > I attached all the output information. LWP depends on some other libraries that you

Re: Howto upload a file?

2000-11-13 Thread Reinier Post
On Fri, Nov 10, 2000 at 07:03:36PM +0100, Tobias Manthey wrote: > Hi all, > I am trying desperately but I dont get it. > Who can provide me a simple example how to upload a file? I use the HTTP::File upload() method. It works well. -- Reinier

Re: LWP - HELP !!

2000-11-07 Thread Reinier Post
On Tue, Nov 07, 2000 at 11:43:55AM +0530, Ravi wrote: > Hello Friends, > > Can, anybody Please tell me why this error message is coming when I try to run my > perl file. > > Can't locate HTML/HeadParser.pm in @INC (@INC contains: /usr/lib/perl5/5.00503/i > 386-linux /usr/lib/perl5/5.00503 /usr/l

Re: Perl script

2000-11-06 Thread Reinier Post
ny others are denied as well. See your webserver software's documentation for details. -- Reinier post

Re: People Online

2000-11-06 Thread Reinier Post
On Mon, Nov 06, 2000 at 10:20:35AM -0800, Jason Whitlow wrote: > How are ya, > > I just need a little assistance. I need to know how it tell who is > currently longed on to my site. There is no such thing as being 'logged on to a site'. HTTP is stateless: it operates by fetching URLs from webs

Re: One Doubt !!!

2000-11-06 Thread Reinier Post
y modules > ... useing which , we can do this job ... Definitely; I use LWP and Image::Magick to dynamically manipulate images at the time they are requested by the browser. You'll need a detailed plan on how your software is going to work though, and that is something you'd better design by

Re: LWP::Simple

2000-10-26 Thread Reinier Post
On Thu, Oct 26, 2000 at 10:02:22AM -0400, [EMAIL PROTECTED] wrote: > I am new to the list and relatively new to Perl. I joined the list > with a specific question in mind. Here it is: > > I have been using LWP::Simple to download web pages. I recently moved > my scripts to a new serv

Re: Logging on to a Website using libwww

2000-10-26 Thread Reinier Post
On Thu, Oct 26, 2000 at 11:30:09AM +0200, Dirk Treusch wrote: > I have disabled cookies in Internet Explorer just for a test and I > was still able to log in to the website. However I will try to enable > my code for cookies. I hope I will succeed. As you say, prhaps you should find out if th

Re: Logging on to a Website using libwww

2000-10-26 Thread Reinier Post
e forms on > some websites. However I have not been able to log in to > http://www.finanztreff.de Your Perl code looks fine, but when I look att he source of this page, I see: I haven't studied the details, but you probably need to add cookie handling on the LWP end. This is supported. -- Reinier Post

Re: filtering uploaded files

2000-10-19 Thread Reinier Post
On Thu, Oct 19, 2000 at 11:34:31AM -0700, Gisle Aas wrote: > > Does LWP also support that other method (multi-part/form-data, every form > > attribute being a separate part)? > > I don't understand. This is multi-part/form-data. What is wrong with > it? My testing :-( You're right, it works

Re: filtering uploaded files

2000-10-19 Thread Reinier Post
TTP::File::upload at the receiving end, bcause the file content is transmitted in a different way. Does LWP also support that other method (multi-part/form-data, every form attribute being a separate part)? > Regards, > Gisle -- Reinier Post [EMAIL PROTECTED]

Re: Extending HTML-Parser

2000-10-10 Thread Reinier Post
% } else { %> >false > <% } %> I use HTML::Mason to do this, would it be possible to unite forces? Having one parser for this mechanism could benefit both development (you) and users (me). -- Reinier Post

Re: Redirects with javascript

2000-10-03 Thread Reinier Post
On Thu, Sep 28, 2000 at 06:07:45PM -0300, Anderson Marcelo wrote: > Please, > > How make for the "result.html" content the > redirect of the "index.html" ?? > > The page (index.html) content this: > window.location="/test.shtml" LWP doesn't include a Javascript engine; you'll need one to get

Re: HTTP redirects

2000-09-07 Thread Reinier Post
On Thu, Sep 07, 2000 at 04:07:31PM +, Jarrett Carver wrote: > Is there a way to tell if your request has been redirected? i.e is_redirect? perldoc HTTP::Response Look at the previous() method. HTTP return codes are defined in http://www.w3.org/Protocols/rfc2068/rfc2068 -- Reinier

Re: Rget/Parse Implementation?

2000-08-22 Thread Reinier Post
ough I'm not sure where its maintainers are. I posted a patch to this list recently and never got a response. -- Reinier Post

Re: how to convert

2000-08-11 Thread Reinier Post
l -0pe 's/\r\n/\n/g' filename > converted -- Reinier Post

Re: Why does LWP freeze with this URL ?

2000-08-08 Thread Reinier Post
Error: 500 Cache Detected Error So it has nothing to do with LWP. -- Reinier Post

WWW::Robot only retrieves HTML documents

2000-08-07 Thread Reinier Post
need every document, not just HTML. The attached patch adds three Boolean flags to turn off these hardcoded filters. With all three turned on, traversal/retrieval is only limited by the robot exclusion protocol. -- Reinier Post [EMAIL PROTECTED] --- Rob

Re: problem with installing libwww-perl.5.48

2000-08-07 Thread Reinier Post
perl -MCPAN -eshell it handles dependencies for you. -- Reinier Post

Re: WWW::Robot crashing upon encountering file:// URLs (correction)

2000-08-06 Thread Reinier Post
gt;"%s\n",$ag->request(new >HTTP::Request(GET,"file://localhost/home/rp/.cshrc/"))->content' should have been % perl -MLWP::RobotUA -e 'my $ag=new LWP::RobotUA("bugexposer/0.1","reinpos t\@win.tue.nl");$ag->delay(0);printf "%s\n",$ag->request(new HTTP::Request(GET, "file://localhost/home/rp/.cshrc/"))->content' -- Reinier Post

WWW::Robot crashing upon encountering file:// URLs

2000-08-06 Thread Reinier Post
g to figure out why WWW::Robot will only produce HTML URLs despite what the documentation suggests), but this is a clear problem, that seems to call for a patch. A stopgap patch is attached. -- Reinier Post [EMAIL PROTECTED] --- RobotUA.pm.or

Re: "last_modified" problem, help needed

2000-08-03 Thread Reinier Post
> but, if it's a ".php" file, $res->last_modified get > nothing, what's wrong? A .php URL typically points to a PHP script that generates its output on the fly; therefore, a Last-Modified header would be quite useless. -- Reinier Post

Re: Newbie to the list

2000-08-01 Thread Reinier Post
be looking at the WWW::Robot CPAN module right now, and my questions are similar. Can you say how your spider compares to WWW::Robot? Is it more advanced? -- Reinier Post

Re: HTML::Entities module

2000-06-22 Thread Reinier Post
On Thu, Jun 22, 2000 at 09:52:18AM +, marc-andre sauve wrote: > Hi, > > Looking for HTML::Entities perl module % perl -e shell -M CPAN cpan> install HTML::Entities cpan> quit % perldoc HTML::Entities -- Reinier

HTML queries (was: HTML::Element screen-scraping)

2000-05-31 Thread Reinier Post
ith one text node (consisting of cuisine) as a child, and > then looking at its right sister node, which should be text... Yes, but it would be nice to use a HTML-like notation, while employing the benefits of real HTML parsing. > -- > Sean M. Burke[EMAIL PROTECTED]http://www.spinn.net/~sburke/ -- Reinier Post [EMAIL PROTECTED]

Re: handling refreshes

2000-05-23 Thread Reinier Post
On Tue, May 23, 2000 at 04:30:50PM -0600, Steve Rasmussen wrote: > Reinier Post wrote: > ... > > > Do refreshes always have to appear in the html page and the http > > > header, or can they be in either? > > > > Like all tags, only in the . > > > &

Re: handling refreshes

2000-05-23 Thread Reinier Post
ment headers from tags such as this one. > Do refreshes always have to appear in the html page and the http > header, or can they be in either? Like all tags, only in the . -- Reinier Post

Re: Problem deleting nodes with HTML::Element

2000-05-18 Thread Reinier Post
a one-line change in &traverse) but the maintainer didn't accept it, and also completely reimplemented &traverse; I haven't studied if it would be as easy to change in the new implementation. So mark or collect the nodes for deletion, delete them in a second pass. -- Reinier Post

Re: Help!

2000-05-04 Thread Reinier Post
> Question: > Is there any way I can set the absolute url of the response to > http://165.21.42.93/7000OneNumber so that the redirection is successful. Yes, use the abs() method, described in perldoc URI -- Reinier

Re: FRAME support in HTML::TreeBuilder

2000-04-30 Thread Reinier Post
> >How easy would it be to change HTML::TreeBuilder to preserve > >the structure of framed pages? > > Not terribly, but I'll give it a try. In the meantime, yes, try moving > things around to repair the tree. Presumably it's just a matter of finding > the body, finding the frameset under that,

FRAME support in HTML::TreeBuilder

2000-04-27 Thread Reinier Post
is assumption? particularly if anyone points out a case where this is troublesome for them. Obviously, for a proxy, this is a major problem: all framed pages turn out empty, I haven't looked at the code yet; is this coercion invertible? How easy would it be to chang

Re: libwww-perl

2000-04-18 Thread Reinier Post
> perl Makefile.PL > Testing alignment requirements for U32... ld: fatal: library -ldb: not > found > ld: fatal: File processing errors. No output written to u32align > Can't compile test program > > What am I missing? Berkeley DB. Get it from http://www.sleepycat.com/ -- Reinier Post

Re: breaking encapsulation on HTML::TreeBuilder|Element objects

2000-03-27 Thread Reinier Post
On Mon, Mar 27, 2000 at 08:30:12PM -0700, Sean M. Burke wrote: > At 06:41 PM 2000-03-27 +0200, Reinier Post wrote: > >[...] > >| * Does anyone write applications using HTML::Element that break > >| encapsulation on HTML::Element objects? That is, by accessing object > &g

some limitations in using HTML::TreeBuilder to transform HTML

2000-03-27 Thread Reinier Post
of any real drawbacks. In the average case, having to copy the array won't induce much overhead. 3) what's the matter with framesets? s are wrapped within , which causes framed pages to turn up empty, at least in my version of Netscape. Why does this happen? Is it a design decision or ju

Re: patch for HTML::Parser 3.06 to fix declaration commenthandling

2000-03-09 Thread Reinier Post
On Wed, Mar 08, 2000 at 10:21:17PM -0500, la mouton wrote: > this is what I experienced also. Comments like "" get treated > like comments by browsers and HTML::Parser should behave the same way. In other words, HTML::Parser should parse not HTML, but what some browsers think HTML is. -- Reini

Re: MULTI FORM submission

2000-01-24 Thread Reinier Post
module: LWP. -- Reinier Post