Yicong-Huang opened a new pull request, #4758:
URL: https://github.com/apache/texera/pull/4758
### What changes were proposed in this PR?
Adds scalatest coverage for
`common/workflow-core/src/main/scala/org/apache/texera/amber/util/ArrowUtils.scala`.
The Arrow ↔ Texera type / schema converters had no dedicated spec.
The `getTexeraTuple` / `setTexeraTuple` / `appendTexeraTuple` row-level
helpers are intentionally out of scope for this PR — they need a real
`VectorSchemaRoot` and substantial Arrow scaffolding, and warrant a dedicated
PR.
### Any related issues, documentation, discussions?
Closes #4751.
Two contract divergences are pinned in the spec with explanatory comments
and filed separately as Bug issues: (1) `toAttributeType`'s catch-all `case 64
| _ => LONG` collapses every non-{16,32}-bit width onto LONG (including widths
like 8 that are clearly not LONG), and (2) `fromAttributeType` collapses
`STRING`, `LARGE_BINARY`, and `ANY` onto the same `Utf8` Arrow type —
`LARGE_BINARY` is recovered via field metadata, but `ANY` loses its distinction
entirely on round-trip and surfaces as `STRING`.
### How was this PR tested?
```
sbt scalafmtCheckAll
sbt "WorkflowCore/testOnly org.apache.texera.amber.util.ArrowUtilsSpec"
```
### Was this PR authored or co-authored using generative AI tooling?
Generated-by: Claude Code (claude-opus-4-7)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]