L. C. Hsieh created SPARK-57112:
-----------------------------------

             Summary: Add putNotNulls case to 
WritableColumnVectorBulkFillBenchmark
                 Key: SPARK-57112
                 URL: https://issues.apache.org/jira/browse/SPARK-57112
             Project: Spark
          Issue Type: Improvement
          Components: Tests
    Affects Versions: 4.3.0
            Reporter: L. C. Hsieh
            Assignee: L. C. Hsieh


SPARK-57042 added WritableColumnVectorBulkFillBenchmark covering the
  constant-value bulk-fill APIs on WritableColumnVector. The benchmark
  does not yet cover putNotNulls(rowId, count), which is the inverse of
  putNulls and runs once per batch from WritableColumnVector.reset()
  when numNulls > 0.

  This change adds a putNotNulls case mirroring the existing putNulls
  case. It seeds one null into the column vector at setup so that
  putNotNulls' `if (!hasNull()) return;` early-out does not skip the
  fill during measurement.

  The benchmark addition lands separately so that an upstream-master
  baseline exists before SPARK-57111 (which switches putNotNulls to
  Arrays.fill / Platform.setMemory) is measured. With this case in
  master, the GHA `Run benchmarks` workflow will produce a like-for-like
  before/after comparison when SPARK-57111 is benchmarked.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to