Re: [HACKERS] Patch Review: Bugfix for XPATH() if text or attribute nodes are selected

Radosław Smogura Tue, 12 Jul 2011 03:58:31 -0700

On Tue, 12 Jul 2011 11:45:59 +0200, Florian Pflug wrote:

On Jul12, 2011, at 11:00 , Radosław Smogura wrote:
On Sun, 10 Jul 2011 17:06:22 -0500, Robert Haas wrote:
Unless I am missing something, Florian  is clearly correct here.
For me not, because this should be fixed internally by making xmltype sefe
Huh??. Making the xml type safe is *exactly* what I'm trying to dohere...
currently xml type may be used to keep proper XMLs and any kind ofdata, as well.
As I pointed out before, that simply isn't true. Try storing
non-well-formed data into an XML column (there *are* ways to do
that, i.e. there are bugs, one if which I'm trying to fix here!)
and then dump and (try to) reload your database. Ka-wooooom!
If I ask, by any means select xpath(/text(...))..... I want to gettext.
And I want '3' || '4' to return the integer 34. Though luck! The factthatXPATH() is declared to return XML, *not* TEXT means you don't getwhat youwant. Period. Feel free to provide a patch that adds a functionXPATH_TEXT
if you feel this is an issue.
XML *is* *not* simply an alias for TEXT! It's a distinct type, whichitsdown distinct rules about what constitutes a valid value and whatdoesn't.
1) How I should descape node in client application (if it's part ofxml I don't have header), bear in mind XML must give support forstreaming processing too.
Huh?
2) Why I should differntly treat text() then select from varchar inboth I ask for xml, driver can't make this, because it doesn't know ifit gets scalar, text, comment, element, or maybe document.
3) What about current applications, folks probably uses this and arehappy they get text, and will not see, that next release of PostgreSQLwill break their applications.
That, and *only* that, I recognize as a valid concern. However, and*again*as I have pointer out before a *multiple* of times, backwardscompatibilityis no excuse not to fix bugs. Plus, there might just as well beapplicationswhich feed the contents of XML columns directly into a XML parser (astheyhave every right to!) and don't expect that parser to throw an error.Which,as it stands, we cannot guarantee. Having to deal with an error thereis akin
to having to deal with integer columns containing 'foobar'!

Bugs must be resolved in smart way, especially if they changesbehaviour, with consideration of impact change will produce, removingsupport for xml resolves this bug as well. I've said problem should beresolved in different way.

There is of course disadvantage of current behaviour as it may leadto inserting badly xmls (in one case), but I created example when autoescaping will create double escaped xmls, and may lead to insertinproper data (this is about 2nd patch where Florian add escaping,too).
SELECT XMLELEMENT(name root, XMLATTRIBUTES(foo.namespace AS sth))FROM (SELECT(XPATH('namespace-uri(/*)', x))[1] AS namespace FROM (VALUES(XMLELEMENT(name"root", XMLATTRIBUTES('<n' AS xmlns, '<v' AS value),'<t'))) v(x)) asfoo;
      xmlelement
-------------------------
<root sth="&amp;lt;n"/>
Radosław, you've raised that point before, and I refuted it. Thecrucialdifference is that double-escaped values are well-formed, where asun-escaped
ones are not.
Again, as I said before, the double-escaping done by XMLATTRIBUTESthere isnot pretty. But its *not* XPATH()'s fault!. To see that, simplyreplace your
XPATH() expression with '&lt;n'::xml to see that.

And in fact
It can't be resolved without storing type in xml or adding xmltextor adding pseudo xmlany element, which will be returned by xpath.
Huh?
Frankly, Radosław, I get the feeling that you're not trying tounderstand myanswers to your objections, but instead keep repeating the sameassertions
over and over again. Even though at least some of them, like XML
being able to
store arbitrary values, are simply wrong! And I'm getting pretty
tired of this...
So far, you also don't seem to have taken a single look at the actual
implementation of the patch, even though code review is an supposedto be an
integral part of the patch review process. I therefore don't believe
that we're
getting anywhere here.

So far, you don't know if I taken a single look, your suspicious arewrong, and You try to blame me. All of your sentences about "do notunderstanding" I may sent to you, and blame you with your words.

So please either start reviewing the actual implementation, and leave
the considerations about whether we want this or not to the eventual
committer.
Or, if you don't want to do that for one reason or another, pleaserconsiderletting somebody else take over this review, i.e. consider removingyour name
from the "Reviewer" field.

If I do review I may put my comments, but "I get the feeling thatyou're not trying to understand myanswers to your objections, but instead keep repeating the sameassertions over and over again." - and in patch there is review of code.So please either "stop" responding to my objections "and leave theconsiderations about whether we want this or not to the eventualcommitter."


And consider this new assertions...

I store XML document in XML (escaped text, why not?), then recreate itby xpath and insert, with Your patch, I will not insert document butescaped text, which is wrong XML, and ALL your objections about currentbuggy not-escaping will FULLY apply to this situation.

It's ping-pong, and this patch moves problem from one corner to other.

For me this discussion is over. I putted my objections and suggestions.Full review is available in archives, and why to not escape is putted inreview of your 2nd patch, about scalar values.


Regards,
Radek


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Patch Review: Bugfix for XPATH() if text or attribute nodes are selected

Reply via email to