Greetings,

Just installed plucker and plucker-desktop; was surprised how full of
holes the latter seems to be.  But now I've figured out how plucker
itself works and am rolling my own.  I'm very impressed, in general!

Tried to load the default home.html; while some pages are now dead (eg
nytimes link), I couldn't get the parser to process the plucker home
page or my home page for that matter ... am I doing something wrong>
Many pages seem to work fine (tho cbc.ca/mobile does not oddly, since
they built it for PDA syncing.)

I enclose a log resulting from trying to parse my home page.
Any ideas what I can do?  Thanks in advance for any tips.


===

Initializing Plucker spidering engine...
 
-----------------------------------------------------------
Updating channel: my home page...
-----------------------------------------------------------
Pluckerdir is '/home/migod/.plucker'...
Using proxy '' with authentication for user ''...
---- 0 collected, 1 to do ----
Processing http://plg.uwaterloo.ca/~migod...
  Retrieved ok.
Error:  Unknown error parsing document http://plg.uwaterloo.ca/~migod:
Traceback (innermost last):
  File "/usr/local/PyPlucker/Parser.py", line 27, in generic_parser
    parser = TextParser.StructuredHTMLParser (url, data, headers,
config, attributes)
  File "/usr/local/PyPlucker/TextParser.py", line 899, in __init__
    self.feed (text)
  File "/usr/lib/python1.5/site-packages/xml/parsers/sgmllib.py", line
465, in finish_endtag
    self.handle_endtag(tag, method)
  File "/usr/local/PyPlucker/TextParser.py", line 1005, in handle_endtag
    if method: sgmllib.SGMLParser.handle_endtag(self, tag, method)
  File "/usr/lib/python1.5/site-packages/xml/parsers/sgmllib.py", line
476, in handle_endtag
    method()
  File "/usr/local/PyPlucker/TextParser.py", line 1406, in end_font
    self._doc.unset_forecolor (forecolor)
  File "/usr/local/PyPlucker/TextParser.py", line 515, in
unset_forecolor
    if self._attributes.pop_forecolor (value):
  File "/usr/local/PyPlucker/TextParser.py", line 251, in pop_forecolor
    foreres = self._forecolor[-1] != self._forecolor[-2]
IndexError: list index out of range
  Parsing failed.
---- all 0 pages retrieved and parsed ----
Writing out collected data...
Writing document 'my home page' to file
/home/migod/.plucker/channels/myhomepage/myhomepage.pdb
Traceback (innermost last):
  File "/usr/local/bin/plucker-build", line 1532, in ?
    sys.exit(realmain())
  File "/usr/local/bin/plucker-build", line 1524, in realmain
    retval = main (config, exclusion_lists)
  File "/usr/local/bin/plucker-build", line 1046, in main
    mapping = writer.write (verbose=verbosity, alias_list=alias_list)
  File "/usr/local/PyPlucker/Writer.py", line 520, in write
    result = Writer.write (self, verbose, alias_list=alias_list)
  File "/usr/local/PyPlucker/Writer.py", line 339, in write
    raise RuntimeError("The collection process failed to generate a
'home' document")
RuntimeError: The collection process failed to generate a 'home'
document
Installing channel output to destinations...
Setting channels new due date
Tasks completed for all channels.


_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to