[PR] test(amber): add unit tests for ArrowUtils [texera]

via GitHub Sat, 02 May 2026 18:35:16 -0700


Yicong-Huang opened a new pull request, #4758:
URL: https://github.com/apache/texera/pull/4758


   ### What changes were proposed in this PR?
   
   Adds scalatest coverage for 
`common/workflow-core/src/main/scala/org/apache/texera/amber/util/ArrowUtils.scala`.
 The Arrow ↔ Texera type / schema converters had no dedicated spec.
   
   The `getTexeraTuple` / `setTexeraTuple` / `appendTexeraTuple` row-level 
helpers are intentionally out of scope for this PR — they need a real 
`VectorSchemaRoot` and substantial Arrow scaffolding, and warrant a dedicated 
PR.
   
   ### Any related issues, documentation, discussions?
   
   Closes #4751.
   
   Two contract divergences are pinned in the spec with explanatory comments 
and filed separately as Bug issues: (1) `toAttributeType`'s catch-all `case 64 
| _ => LONG` collapses every non-{16,32}-bit width onto LONG (including widths 
like 8 that are clearly not LONG), and (2) `fromAttributeType` collapses 
`STRING`, `LARGE_BINARY`, and `ANY` onto the same `Utf8` Arrow type — 
`LARGE_BINARY` is recovered via field metadata, but `ANY` loses its distinction 
entirely on round-trip and surfaces as `STRING`.
   
   ### How was this PR tested?
   
   ```
   sbt scalafmtCheckAll
   sbt "WorkflowCore/testOnly org.apache.texera.amber.util.ArrowUtilsSpec"
   ```
   
   ### Was this PR authored or co-authored using generative AI tooling?
   
   Generated-by: Claude Code (claude-opus-4-7)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[PR] test(amber): add unit tests for ArrowUtils [texera]

Reply via email to