Hi, I'm using the new version of the Plucker Desktop, and I'm trying to Pluck an article that spans several pages, but I'm not having much luck, could someone point out what I'm doing wrong?
The article is at: "The Next Ages of Game Development" http://avault.com/developer/getarticle.asp?name=bsawyer1 and each page is linked thus: http://avault.com/developer/getarticle.asp?name=bsawyer1&page=2 http://avault.com/developer/getarticle.asp?name=bsawyer1&page=3 etc etc. I set http://avault.com/developer/getarticle.asp?name=bsawyer1 as the URL, and Maximum Depth to 22(!), "Ignore links to a server that is different that is different from a starting page's server" to TRUE and used an URL pattern filter as ".*avault.com/developer/getarticle.asp?name=bsawyer1.*". The output is as follows: <snip> Initializing Plucker spidering engine... ----------------------------------------------------------- Updating channel: Ages of Development... ----------------------------------------------------------- Pluckerdir is 'E:\Plucker-Desktop'... Using proxy '' with authentication for user ''... ZLib compression turned on Using exclusion list E:\Plucker-Desktop\exclusionlist.txt Using exclusion list E:\Plucker-Desktop\exclusionlist.txt Regexp pattern is '.*avault.com/developer/getarticle.asp?name=bsawyer1.*' ---- 0 collected, 1 to do ---- Processing http://avault.com/developer/getarticle.asp?name=bsawyer1... Retrieved ok. Not fetching image http://avault.com/images/layout/avault.gif Not fetching image http://avault.com/images/layout/page_developer.gif Not fetching image http://avault.com/images/layout/spacer.gif Not fetching image http://avault.com/images/layout/menu_logo-anim.gif Not fetching image http://avault.com/images/layout/menu_sections.gif Not fetching image http://avault.com/images/layout/menu_inside.gif Not fetching image http://avault.com/images/layout/menu_site.gif Not fetching image http://avault.com/images/layout/spacer.gif Not fetching image http://avault.com/developer/images/bsawyer11a.jpg Not fetching image http://avault.com/images/layout/next.gif Not fetching image http://avault.com/images/layout/spacer.gif Parsed ok; added 1 document link. ---- 1 collected, 1 to do ---- Processing mailto:[EMAIL PROTECTED]... Retrieved ok. Parsed ok. ---- all 2 pages retrieved and parsed ---- Writing out collected data... Writing document 'Ages of Development' to file E:\Plucker-Desktop\channels/AgesofDevelopment/AgesofDevelopment.pdb Converting mailto:[EMAIL PROTECTED]... Converted 11: mailto:[EMAIL PROTECTED] Converting http://avault.com/developer/getarticle.asp?name=bsawyer1... Converted 2: http://avault.com/developer/getarticle.asp?name=bsawyer1 Default charset is MIBenum 4 (ISO-8859-1) New document <PluckerIndexDocument 'plucker:/~special~/index' at 12000940> added Converted 1: plucker:/~special~/index New document <PluckerMetadataDocument 'plucker:/~special~/metadata' at 12000492> added Converted 5: plucker:/~special~/metadata Wrote 1 <= plucker:/~special~/index Wrote 2 <= http://avault.com/developer/getarticle.asp?name=bsawyer1 Wrote 5 <= plucker:/~special~/metadata Wrote 11 <= mailto:[EMAIL PROTECTED] Done! Installing channel output to destinations... Setting channels new due date Tasks completed for all channels. </snip> I'm sure its something to do with the regex - can anyone help? Thanks for your time, and my apologies for the length of the email. Cheers, Ian -- fortune says: In an orderly world, there's always a place for the disorderly. ~~~~~~~~~~~ Made in Ireland using GNU Emacs ~~~~~~~~~~~ Ian Swainson Kia Ora! [EMAIL PROTECTED] ~~~~~~~~~~~~~~~~ http://www.clients.ie ~~~~~~~~~~~~~~~~ _______________________________________________ plucker-list mailing list [EMAIL PROTECTED] http://lists.rubberchicken.org/mailman/listinfo/plucker-list

