> Patch optimizes Float16 to integral conversion operations. Currently, its a 
> two step process where by first a Float16 value is
> converted to a single precision floating point value followed by a conversion 
> to an integral value.
> 
> x86 targets supporting AVX512-FP16 feature (Intel Sapphire Rapids+ and 
> upcoming AMD Zen6) provides direct instruction to convert a Float16 value to 
> integral value.
> 
> Following are the performance numbers of micro benchmark included with the 
> patch on Granite Rapids with and without auto-vectorization.
> 
> <img width="1125" height="636" alt="image" 
> src="https://github.com/user-attachments/assets/ca6e6757-1579-475f-8307-9454c7c025c1";
>  />
> 
> Kindly review and share your feedback.
> 
> Best Regards,
> Jatin
> 
> ---------
> - [x] I confirm that I make this contribution in accordance with the [OpenJDK 
> Interim AI Policy](https://openjdk.org/legal/ai).

Jatin Bhateja has updated the pull request incrementally with one additional 
commit since the last revision:

  Review comments resolution

-------------

Changes:
  - all: https://git.openjdk.org/jdk/pull/30928/files
  - new: https://git.openjdk.org/jdk/pull/30928/files/bf170d93..da2ff1b0

Webrevs:
 - full: https://webrevs.openjdk.org/?repo=jdk&pr=30928&range=01
 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=30928&range=00-01

  Stats: 6 lines in 3 files changed: 0 ins; 0 del; 6 mod
  Patch: https://git.openjdk.org/jdk/pull/30928.diff
  Fetch: git fetch https://git.openjdk.org/jdk.git pull/30928/head:pull/30928

PR: https://git.openjdk.org/jdk/pull/30928

Reply via email to