Re: How to get the text of web framed pages?

2005-05-24 Thread Eric Chatonet
Hi Stephen, Thanks for your input. At the moment, I have 2 efficient functions with 2 different approaches for extracting all urls from the text of a framed page: The first one from Ken Ray uses regex with machText and another one I wrote uses items with quote as the item delimiter. Ken's sol

Re: How to get the text of web framed pages?

2005-05-24 Thread Stephen Barncard
I would look for the word frameset in a tag inside a page, then get all the valid URLS inside the frame. Then I would check each URL for size, and pick the largest file, or the number of lines. That will be where the main content is. At 2:10 PM +0200 5/24/05, Eric Chatonet wrote: You are ri

Re: How to get the text of web framed pages?

2005-05-24 Thread Eric Chatonet
Hi Ken, Your mail confirms without any doubt some steps made in this direction by testing with simple logic. As usual, absolutely clear :-) Many thanks! I have now to write a good function to parse the text and return the list of all found urls ;-) Best regards from Paris, Eric Chatonet.

Re: How to get the text of web framed pages?

2005-05-24 Thread Ken Ray
On 5/24/05 7:10 AM, "Eric Chatonet" <[EMAIL PROTECTED]> wrote: > You are right about some frames that can be dowloaded by parsing the > But you are not sure to get the whole contents :-( > I shall give you an example. > The following url: http://www.major-k.de/revstart.html (BTW great > stuff :-)

Re: How to get the text of web framed pages?

2005-05-24 Thread xbury . cs
Eric, I wont give you the answer (too easy) but the discrete browser shows you how in terms of resolving links from the downloaded html to the right clickeable url... ;) Im sure you know where to find the discrete browser (now known as DeepCyberSpace under the TAOO agents section... ;) cheer

Re: How to get the text of web framed pages?

2005-05-24 Thread Eric Chatonet
Hello Xavier, Le 24 mai 05 à 13:48, [EMAIL PROTECTED] a écrit : Hi Eric, Either it's in between tags or ... Note some frames can be downloaded like Hope it's what you meant... You are right about some frames that can be dowloaded by parsing the But you are not sure to get the whole con

Re: # POSSIBLY SPAM #::How to get the text of web framed pages?

2005-05-24 Thread xbury . cs
Hi Eric, Either it's in between tags or ... Note some frames can be downloaded like Hope it's what you meant... cheers Xavier On 24.05.2005 13:39:34 use-revolution-bounces wrote: >Hi everyone, > >I am finishing a plugin that imports web pages as .txt files. >So it can search through the file

How to get the text of web framed pages?

2005-05-24 Thread Eric Chatonet
Hi everyone, I am finishing a plugin that imports web pages as .txt files. So it can search through the files any expression and display the corresponding web page in any browser. Very simple and efficient :-) But I wonder how to retrieve the effective text from framed pages? Any idea? Thanks,