Hi Stephen,
Thanks for your input.
At the moment, I have 2 efficient functions with 2 different
approaches for extracting all urls from the text of a framed page:
The first one from Ken Ray uses regex with machText and another one I
wrote uses items with quote as the item delimiter.
Ken's sol
I would look for the word frameset in a tag inside a page, then get
all the valid URLS inside the frame. Then I would check each URL for
size, and pick the largest file, or the number of lines. That will be
where the main content is.
At 2:10 PM +0200 5/24/05, Eric Chatonet wrote:
You are ri
Hi Ken,
Your mail confirms without any doubt some steps made in this
direction by testing with simple logic.
As usual, absolutely clear :-)
Many thanks!
I have now to write a good function to parse the text and return the
list of all found urls ;-)
Best regards from Paris,
Eric Chatonet.
On 5/24/05 7:10 AM, "Eric Chatonet" <[EMAIL PROTECTED]>
wrote:
> You are right about some frames that can be dowloaded by parsing the
> But you are not sure to get the whole contents :-(
> I shall give you an example.
> The following url: http://www.major-k.de/revstart.html (BTW great
> stuff :-)
Eric,
I wont give you the answer (too easy) but the discrete browser shows you
how
in terms of resolving links from the downloaded html to the right
clickeable url... ;)
Im sure you know where to find the discrete browser (now known as
DeepCyberSpace
under the TAOO agents section... ;)
cheer
Hello Xavier,
Le 24 mai 05 à 13:48, [EMAIL PROTECTED] a écrit :
Hi Eric,
Either it's in between tags or ...
Note some frames can be downloaded like
Hope it's what you meant...
You are right about some frames that can be dowloaded by parsing the
But you are not sure to get the whole con
Hi Eric,
Either it's in between tags or ...
Note some frames can be downloaded like
Hope it's what you meant...
cheers
Xavier
On 24.05.2005 13:39:34 use-revolution-bounces wrote:
>Hi everyone,
>
>I am finishing a plugin that imports web pages as .txt files.
>So it can search through the file
Hi everyone,
I am finishing a plugin that imports web pages as .txt files.
So it can search through the files any expression and display the
corresponding web page in any browser.
Very simple and efficient :-)
But I wonder how to retrieve the effective text from framed pages?
Any idea?
Thanks,