Patrick Ohly

> I have already verified manually that it is possible to replace
> the <a href="javascript"></a> with a <form>...</form>, but this
> doesn't really help I guess. Is there any way how I can cause
> any of the plucker frontends to trigger a POST operation and then
> include the result in the pdb file? 

Python can handle posts.  (url-fetching functions take an optional
third argument of "data".)  PyPlucker has some support for it, but
not quite.  (It may be urltext_key).  To my knowledge, you would
probably have to make code changes.

Part of the catch is that post actions are supposed to involve
the user saying "yes, I really want to do this", but get actions
can be automatic.  This means that at a properly designed site,
GET is safe, but POST might order thousands of dollars worth
of junk.  You don't want that happening by accident.

> Ideally, I'd let JPluck transform the starting page with XSLT
> automatically. As I have never used XSLT before, I am looking for
> an easier way to debug a stylesheet than running JPluck on
> a page and looking at the resulting pdb: in particular, it would
> be nice to get a serial dump of the parse tree that the stylesheet
> is applied to and the result of the transformation.

I agree, but can't offer much help just yet.  I have heard that IE 
can use the same XSLT, which might speed your testing.
 
"David A. Desrosiers" <[EMAIL PROTECTED]>
>       Using javascript to navigate between links is not HTML, 
> and Plucker shouldn't be expected to handle that. Plucker 
> also doesn't handle forms

Is there anything that Plucker *should* handle?

>From a user's perspective, we want to see the information on
a web page, and our browser (including plucker) ought to
handle any fetching for us - including authentication,
cookies, and forms.  It doesn't quite yet, but it is getting
closer (particularly with jpluck) and should continue to 
improve.

> Convert the links back to their native HTML equivalents, 
> and you'll see better results. 

There may not be one, if the post data is essential.  (And if
it weren't, GET would probably have worked.)

> Perhaps the - --url-pattern options would help here. Apply 
> a regex to convert the invalid Javascript code to HTML, and 
> parse again.

Hunh?  Have you seen url-pattern doing something more than
filtering which urls to fetch?  Transformations (including regex)
are good, but I haven't seen it in PyPlucker yet, and I haven't
seen it in JPluck without XSLT (which he is still debugging).

If you think it is there, please tell me where/how and save me
some coding that I wouldn't get to this month anyhow.

-jJ
_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to