> Would the Arrow team welcome a pull request that enhances
ValidateFull() to
> validate that utf8-column values are well-formed UTF-8 byte sequences?

We already have a UTF-8 validation function, but it's not hooked into
ValidateFull().  So, yes, that seems desirable to me.  Can you open a
JIRA and perhaps a PR?

> Another validation we've added to Workbench is in column *names*. In
> Arrow's IPC layer, `FieldFromFlatbuffer()` validates that column names are
> not null. But it doesn't validate that column names are well-formed UTF-8.
> The Flatbuffers spec says strings should be valid UTF-8. Should
> `FieldFromFlatbuffer()` check?

I have no idea.  I'll let others comment.

Regards

Antoine.

Reply via email to