subject:"Re\: \[Tutor\] Using Beautiful Soup to extract tag names"

Re: [Tutor] Using Beautiful Soup to extract tag names

2006-03-14 Thread Kent Johnson

Ed Singleton wrote:
 I have (unfortunately) received some data in XML format.  I need to
 use it in Python, preferably as a list of dictionaries.  The data is a
 flat representation of a table, in the style:
 
 tablename
 fieldname1Some Data/fieldname1
 fieldname2Some Data/fieldname
 ...
 /tablename
 tablename
 fieldname1Some Data/fieldname1
 fieldname2Some Data/fieldname
 ...
 
 and so on (where tablename is always the same in one file).

ElementTree makes short work of this:

from elementtree import ElementTree

xml = '''
datatablename
fieldname1Some Data1/fieldname1
fieldname2Some Data2/fieldname2
/tablename
tablename
fieldname3Some Data3/fieldname3
fieldname4Some Data4/fieldname4
/tablename
/data'''

doc = ElementTree.fromstring(xml)
# use ElementTree.parse() to parse a file

for table in doc.findall('tablename'):
 for field in table.getchildren():
 print field.tag, field.text


prints:
fieldname1 Some Data1
fieldname2 Some Data2
fieldname3 Some Data3
fieldname4 Some Data4

If speed is an issue then look at cElementTree which has the same 
interface and is blazingly fast.
http://effbot.org/zone/element.htm

Kent

___
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] Using Beautiful Soup to extract tag names

2006-03-14 Thread Ed Singleton

As always Kent, you're amazing.

That will do perfectly.  (Though the ElementTree documentation seems a
bit difficult to get through.  I'm sure I'll get through it
eventually).

Thanks

Ed

On 14/03/06, Kent Johnson [EMAIL PROTECTED] wrote:
 Ed Singleton wrote:
  I have (unfortunately) received some data in XML format.  I need to
  use it in Python, preferably as a list of dictionaries.  The data is a
  flat representation of a table, in the style:
 
  tablename
  fieldname1Some Data/fieldname1
  fieldname2Some Data/fieldname
  ...
  /tablename
  tablename
  fieldname1Some Data/fieldname1
  fieldname2Some Data/fieldname
  ...
 
  and so on (where tablename is always the same in one file).

 ElementTree makes short work of this:

 from elementtree import ElementTree

 xml = '''
 datatablename
 fieldname1Some Data1/fieldname1
 fieldname2Some Data2/fieldname2
 /tablename
 tablename
 fieldname3Some Data3/fieldname3
 fieldname4Some Data4/fieldname4
 /tablename
 /data'''

 doc = ElementTree.fromstring(xml)
 # use ElementTree.parse() to parse a file

 for table in doc.findall('tablename'):
  for field in table.getchildren():
  print field.tag, field.text


 prints:
 fieldname1 Some Data1
 fieldname2 Some Data2
 fieldname3 Some Data3
 fieldname4 Some Data4

 If speed is an issue then look at cElementTree which has the same
 interface and is blazingly fast.
 http://effbot.org/zone/element.htm

 Kent

 ___
 Tutor maillist  -  Tutor@python.org
 http://mail.python.org/mailman/listinfo/tutor

___
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] Using Beautiful Soup to extract tag names

2006-03-14 Thread Kent Johnson

Ed Singleton wrote:
 As always Kent, you're amazing.

Thank you!
 
 That will do perfectly.  (Though the ElementTree documentation seems a
 bit difficult to get through.  I'm sure I'll get through it
 eventually).

Unfortunately I have to agree with you on this one. ET is going to be 
part of the standard lib in Python 2.5, I hope the docs get a bit of work.

Kent

___
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] Using Beautiful Soup to extract tag names

Re: [Tutor] Using Beautiful Soup to extract tag names

Re: [Tutor] Using Beautiful Soup to extract tag names

3 matches

Site Navigation

Mail list logo

Footer information