Re: how to get source of of web page which is hosted on the server (which I dont have access).

2007-10-30 Thread Tom Phoenix
On 10/29/07, Siva Prasad [EMAIL PROTECTED] wrote:

 I am opening a web page using win32::IEAutomation and clicking each
 link on the page and reading data.(This page has 177 links).

I hope you have permission of whoever owns the web server.

 I tried LWP::Simple and LWP::UserAgent to read the content, But the it
 throws me unable to get the source of the page.

If you're doing what a browser would do, the server would be doing
what the server should do. You might be able to do what you want with
WWW::Mechanize; its FAQ has a section on this problem:


http://search.cpan.org/dist/WWW-Mechanize/lib/WWW/Mechanize/FAQ.pod#My_Mech_program_doesn

 Is there any other way to read the source of the page from the server on
 which I don't have a account?

If the server won't give it to you when you ask, wouldn't taking it be stealing?

Cheers!

--Tom Phoenix
Stonehenge Perl Training

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/




Re: how to get source of of web page which is hosted on the server (which I dont have access).

2007-10-30 Thread Matthew Whipple
Siva Prasad wrote:

 HI Gurus,

  

  

 I am opening a web page using win32::IEAutomation and clicking each
 link on the page and reading data.(This page has 177 links).

  

 The above processes occupy more memory,

Make sure that you're not trying to read all 177 links (especially if
you're descending further down those links) at a time and that memory is
freed or reused upon each return to the top page.  I'd probably also
look for some recursive wget style app to get the source first since
that's quite a bit of network traffic. 

 After some time the page is unable to load and the script gives me
 READY STATE error.

  

 I have used WaitforDone(); but still no luck.

  

 I tried LWP::Simple and LWP::UserAgent to read the content, But the it
 throws me *“unable to get the source of the page”.*

 * *

 I don’t have access to the server on which pages are hosted.

  

 Is there any other way to read the source of the page from the server
 on which I don’t have a account?

  

  

  

  

 Thanks,

 PP

  

  



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/




how to get source of of web page which is hosted on the server (which I dont have access).

2007-10-29 Thread Siva Prasad
 

HI Gurus,

 

 

I am opening a web page using win32::IEAutomation and clicking each link on
the page and reading data.(This page has 177 links).

 

The above processes occupy more memory, After some time the page is unable
to load and the script gives me READY STATE error.

 

I have used WaitforDone(); but still no luck.

 

I tried LWP::Simple and LWP::UserAgent to read the content, But the it
throws me unable to get the source of the page.

 

I don't have access to the server on which pages are hosted.

 

Is there any other way to read the source of the page from the server on
which I don't have a account?

 

 

 

 

Thanks,

PP

 

 

image001.gif