> Patch optimizes Float16 to integral conversion operations. Currently, its a > two step process where by first a Float16 value is > converted to a single precision floating point value followed by a conversion > to an integral value. > > x86 targets supporting AVX512-FP16 feature (Intel Sapphire Rapids+ and > upcoming AMD Zen6) provides direct instruction to convert a Float16 value to > integral value. > > Following are the performance numbers of micro benchmark included with the > patch on Granite Rapids with and without auto-vectorization. > > <img width="1125" height="636" alt="image" > src="https://github.com/user-attachments/assets/ca6e6757-1579-475f-8307-9454c7c025c1" > /> > > Kindly review and share your feedback. > > Best Regards, > Jatin > > --------- > - [x] I confirm that I make this contribution in accordance with the [OpenJDK > Interim AI Policy](https://openjdk.org/legal/ai).
Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision: Review comments resolution ------------- Changes: - all: https://git.openjdk.org/jdk/pull/30928/files - new: https://git.openjdk.org/jdk/pull/30928/files/bf170d93..da2ff1b0 Webrevs: - full: https://webrevs.openjdk.org/?repo=jdk&pr=30928&range=01 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=30928&range=00-01 Stats: 6 lines in 3 files changed: 0 ins; 0 del; 6 mod Patch: https://git.openjdk.org/jdk/pull/30928.diff Fetch: git fetch https://git.openjdk.org/jdk.git pull/30928/head:pull/30928 PR: https://git.openjdk.org/jdk/pull/30928
