Re: Simple elementtree question

Stefan Behnel Thu, 30 Aug 2007 12:43:39 -0700

IamIan wrote:
> This is in Python 2.3.5. I've had success with elementtree and other
> RSS feeds, but I can't get it to work with this format:
> 
> <?xml version="1.0"?><rdf:RDF
>  xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#";
>  xmlns:dc="http://purl.org/dc/elements/1.1/";
>  xmlns:fr="http://ASPRSS.com/fr.html";
>  xmlns:pa="http://ASPRSS.com/pa.html";
>  xmlns="http://purl.org/rss/1.0/";>
> <channel rdf:about="http://www.sample.com";>
> <title>Example feed</title>
[...]
> </rdf:RDF>
> 
> What I want to extract is the text in the title and link tags for each
> item (eg. <title>First story</title> and <link>http://www.sample.com/
> news/20000/news.htm</link>). Starting with the title, my test script
> is:
> 
> import sys
> from urllib import urlopen
> 
> import elementtree.ElementTree as ET
> 
> news = urlopen("http://www.sample.com/rss/rss.xml";)
> nTree = ET.parse(news)
> for item in nTree.getiterator("title"):
>   print item.text
> 
> Whether I try this for title or link, nothing is printed.


Your document uses namespaces. What you are looking for is not the tag "title"
without a namespace, but the tag "{http://purl.org/rss/1.0/}title"; with the
default namespace.

http://effbot.org/zone/element.htm#xml-namespaces

Stefan
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Simple elementtree question

Reply via email to