Re: Major problem with SQLite binding handling

H William Wellliver III Wed, 24 May 2017 11:15:02 -0700

> The "datatype" that comes closest to being a suitable native bytearray

    > these days is Stdio.Buffer.

Is this true? If the binding were to interpret any Buffer object asbinary data, would that cause surprise? My reading suggests that it is agrey area and I'd rather solve the problem unambiguously if it'spossible.




   > I presume you mean SQLite with typed queries here?

I'm not sure what a typed query is; this has to do with using bindingparameters and the way the pike glue to SQLite binds a given parameter.

> doing this and hasn???t run into this problem?), as any 8-bitstrings> that weren???t binary data would be stored as blobs, even if theywere> in a text typed field. These records would need to be re-storedwith> the proper text type, which could be done with a query to updatethe> table. My sense is that this is the proper thing to do, as blobfields> should be reserved for data that???s actually binary data (asopposed

    > to text).

> - The contortions you describe to get the queries right with thecurrent> behaviour would indicate that anyone who would have tried to dothe same> would likely have ended up complaining here while trying to get itright.

I have a hard time imagining that anyone has tried it. Not only does itcause all kinds of complications within pike, it also means that queriesexecuted directly against the sqlite command would fail in similar ways,as there could be mixed types of data within a given column and writingsome string as a blob literal means converting it to hex data first,which isn't something many people can do on the fly.

> - In general I've noticed that very few, if any, people areactually using

    >  *typed* queries from Sql.Sql.

> So that would suggest that your change would be beneficial, andwould> (for safety) require a bit of compat code to get it right foranyone

    > unfortunate enough to rely on older behaviour.

I would argue the opposite: the existing behavior is so broken thattrying to come up with compat for it would just make existing code morebrittle because the effect of the current code is to effectively corruptthe data in a column.


Example:

a column of type text may have individual elements that are text or blob(or number even) values depending on whether the data was inserted usinga text literal 'some text', a blob literal X'14EC24', a bindingparameter from pike that happened to be a wide string (stored as text)or a narrow string (stored as binary). So any query run against such acolumn would likely return incorrect results unless the values and thequery values were always cast to one or the other.

I can't think of a scenario where this is desirable. Even if you ignoreapplications written for ASCII code only, I imagine there are lots ofnarrow values in languages that also have wide characters.

My proposal would be to fix this so that all strings are stored as textand that all values of some other object type (Stdio.Buffer orpreferably some Sql datatype wrapper for bytestrings) are stored asblobs. A reasonable workaround for anyone crazy enough to use theexisting broken functionality can just wrap their narrow strings withthe object mentioned above and have that value bound as a blob... theperfect use of a release note.

I'd also argue that this ought to be fixed in 8.0 as well... thebehavior is so bad that I honestly can't imagine anyone has used itsuccessfully.


Bill

Re: Major problem with SQLite binding handling

Reply via email to