Reproducing a web page and add own content to it.

2008-04-08 Thread LaundroMat
Hi -

I'm working on a Django powered site where one of the required
functionalities is the possibility of displaying the content of
external pages, with an extra banner at the top where specific
information is displayed. In other words, I'm looking for a way to
reproduce an existing web page and add some HTML code to it. (I can't
think of an example right now, but the idea is similar to sites that
let you see an external page and have some site-specific text above it
(often stating that the content below is not part of the site the user
comes from)).

To test this, I've been downloading an external page, adding some text
to it and re-opening it in a browser (with the help of built-in
modules such as urllib2 etc). This works of course, but the external
page's links such as img src=hello.png, or a href=help.html
are evidently no longer correct.

Apart from parsing the whole file and trying to inject the external
site's domain in links such as the above (with the added inconvenience
of having to store the external page locally), is there an easier way
of accomplishing what I want?

Thanks,

Mathieu

-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Reproducing a web page and add own content to it.

2008-04-08 Thread Diez B. Roggisch
LaundroMat wrote:

 Hi -
 
 I'm working on a Django powered site where one of the required
 functionalities is the possibility of displaying the content of
 external pages, with an extra banner at the top where specific
 information is displayed. In other words, I'm looking for a way to
 reproduce an existing web page and add some HTML code to it. (I can't
 think of an example right now, but the idea is similar to sites that
 let you see an external page and have some site-specific text above it
 (often stating that the content below is not part of the site the user
 comes from)).
 
 To test this, I've been downloading an external page, adding some text
 to it and re-opening it in a browser (with the help of built-in
 modules such as urllib2 etc). This works of course, but the external
 page's links such as img src=hello.png, or a href=help.html
 are evidently no longer correct.
 
 Apart from parsing the whole file and trying to inject the external
 site's domain in links such as the above (with the added inconvenience
 of having to store the external page locally), is there an easier way
 of accomplishing what I want?

Using a frame?

Diez
-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Reproducing a web page and add own content to it.

2008-04-08 Thread LaundroMat
On Apr 8, 2:04 pm, Diez B. Roggisch [EMAIL PROTECTED] wrote:
 LaundroMat wrote:
  Hi -

  I'm working on a Django powered site where one of the required
  functionalities is the possibility of displaying the content of
  external pages, with an extra banner at the top where specific
  information is displayed. In other words, I'm looking for a way to
  reproduce an existing web page and add some HTML code to it. (I can't
  think of an example right now, but the idea is similar to sites that
  let you see an external page and have some site-specific text above it
  (often stating that the content below is not part of the site the user
  comes from)).

  To test this, I've been downloading an external page, adding some text
  to it and re-opening it in a browser (with the help of built-in
  modules such as urllib2 etc). This works of course, but the external
  page's links such as img src=hello.png, or a href=help.html
  are evidently no longer correct.

  Apart from parsing the whole file and trying to inject the external
  site's domain in links such as the above (with the added inconvenience
  of having to store the external page locally), is there an easier way
  of accomplishing what I want?

 Using a frame?

 Diez

Ack. I was too focused on importing the external web page and
redisplaying the information (I've just been reading up on
BeautifulSoup) instead of looking for an HTML based approach.

Thanks!
-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Reproducing a web page and add own content to it.

2008-04-08 Thread Steve Holden
LaundroMat wrote:
 On Apr 8, 2:04 pm, Diez B. Roggisch [EMAIL PROTECTED] wrote:
 LaundroMat wrote:
 Hi -
 I'm working on a Django powered site where one of the required
 functionalities is the possibility of displaying the content of
 external pages, with an extra banner at the top where specific
 information is displayed. In other words, I'm looking for a way to
 reproduce an existing web page and add some HTML code to it. (I can't
 think of an example right now, but the idea is similar to sites that
 let you see an external page and have some site-specific text above it
 (often stating that the content below is not part of the site the user
 comes from)).
 To test this, I've been downloading an external page, adding some text
 to it and re-opening it in a browser (with the help of built-in
 modules such as urllib2 etc). This works of course, but the external
 page's links such as img src=hello.png, or a href=help.html
 are evidently no longer correct.
 Apart from parsing the whole file and trying to inject the external
 site's domain in links such as the above (with the added inconvenience
 of having to store the external page locally), is there an easier way
 of accomplishing what I want?
 Using a frame?

 Diez
 
 Ack. I was too focused on importing the external web page and
 redisplaying the information (I've just been reading up on
 BeautifulSoup) instead of looking for an HTML based approach.
 
 Thanks!

You could also look at adding a base tag to your generated page's 
head section.

regards
  Steve
-- 
Steve Holden+1 571 484 6266   +1 800 494 3119
Holden Web LLC  http://www.holdenweb.com/

-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Reproducing a web page and add own content to it.

2008-04-08 Thread LaundroMat
On Apr 8, 4:11 pm, Steve Holden [EMAIL PROTECTED] wrote:
 LaundroMat wrote:
  On Apr 8, 2:04 pm, Diez B. Roggisch [EMAIL PROTECTED] wrote:
  LaundroMat wrote:
  Hi -
  I'm working on a Django powered site where one of the required
  functionalities is the possibility of displaying the content of
  external pages, with an extra banner at the top where specific
  information is displayed. In other words, I'm looking for a way to
  reproduce an existing web page and add some HTML code to it. (I can't
  think of an example right now, but the idea is similar to sites that
  let you see an external page and have some site-specific text above it
  (often stating that the content below is not part of the site the user
  comes from)).
  To test this, I've been downloading an external page, adding some text
  to it and re-opening it in a browser (with the help of built-in
  modules such as urllib2 etc). This works of course, but the external
  page's links such as img src=hello.png, or a href=help.html
  are evidently no longer correct.
  Apart from parsing the whole file and trying to inject the external
  site's domain in links such as the above (with the added inconvenience
  of having to store the external page locally), is there an easier way
  of accomplishing what I want?
  Using a frame?

  Diez

  Ack. I was too focused on importing the external web page and
  redisplaying the information (I've just been reading up on
  BeautifulSoup) instead of looking for an HTML based approach.

  Thanks!

 You could also look at adding a base tag to your generated page's
 head section.

 regards
   Steve
 --
 Steve Holden+1 571 484 6266   +1 800 494 3119
 Holden Web LLC  http://www.holdenweb.com/

True, but I suppose that users would no longer see the top banner
added by me when they click on one of the links on the external site's
page. I'm a bit hesitant about using frames however, but reading up on
them makes me think the application I have in mind for them might be
the generally accepted exception to the rule that frames are bad :)

Anyway. Thanks for the help!
-- 
http://mail.python.org/mailman/listinfo/python-list