Ma77Ball opened a new pull request, #4995:
URL: https://github.com/apache/texera/pull/4995
### What changes were proposed in this PR?
ArrowUtils.fromTexeraSchema now tags ANY attributes with texera_type=ANY
metadata on the Arrow field, and toTexeraSchema reads that tag back. This
mirrors the existing LARGE_BINARY mechanism. Without it, ANY round-trips
silently became STRING because both types share the same Arrow
representation (Utf8).
### Any related issues, documentation, or discussions?
Closes: #4762
### How was this PR tested?
Updated ArrowUtilsSpec (in common/workflow-core): replaced the test that
pinned the bug ("lose the ANY distinction") with one that asserts ANY is
preserved through a round-trip, and added a test that the texera_type=ANY
metadata is attached only to ANY fields. Ran both WorkflowCore (27/27) and
WorkflowOperator (14/14) ArrowUtilsSpec suites — all pass.
### Was this PR authored or co-authored using generative AI tooling?
Co-Authored with Claude Opus 4.7 in compliance with ASF
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]