dependabot[bot] opened a new pull request, #1046: URL: https://github.com/apache/opennlp/pull/1046
Bumps `onnxruntime.version` from 1.25.1 to 1.26.0. Updates `com.microsoft.onnxruntime:onnxruntime` from 1.25.1 to 1.26.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/microsoft/onnxruntime/releases">com.microsoft.onnxruntime:onnxruntime's releases</a>.</em></p> <blockquote> <h2>1.26.0</h2> <p>n.b. The following was generated via LLM from Git history. Only the contributor list has been verified.</p> <h1>ONNX Runtime Release 1.26.0</h1> <h2>Announcement - Breaking Changes</h2> <ul> <li><strong>Support for CUDA 12 will be removed in 1.27.0.</strong> <ul> <li>CUDA 13 will continue to be published as <code>onnxruntime-<os>-<arch>-gpu_cuda13-<version>.<ext></code></li> </ul> </li> <li>CUDA runtime will be moving soon to a dedicated Execution Provider (EP) instead of a published package from ORT core.</li> </ul> <h2>Highlights</h2> <ul> <li>Added optional memory mapping for <code>.ort</code> model loads (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28164">#28164</a>).</li> <li>Added RISC-V Vector (RVV) support for CPU EP (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28261">#28261</a>).</li> <li>OpenVINO EP upgraded for 1.26.0 development release (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28297">#28297</a>).</li> <li>WebGPU gained GridSample support (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28264">#28264</a>) and Split-K improvements (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28151">#28151</a>).</li> <li>CUDA plugin EP gained graph support (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28002">#28002</a>), profiling API (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28216">#28216</a>).</li> </ul> <h2>Security and Reliability Hardening</h2> <ul> <li>Replaced unrestricted Python <code>setattr</code> configuration with an allowlist (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28083">#28083</a>).</li> <li>Hardened multiple OOB and overflow scenarios across ML and core ops: <ul> <li>Attention mask index OOB write (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27789">#27789</a>).</li> <li>MaxPoolGrad indices bounds validation (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27903">#27903</a>).</li> <li>SVM and TreeEnsemble bounds/security fixes (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27950">#27950</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/27951">#27951</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/27952">#27952</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/27989">#27989</a>).</li> <li>RNN sequence_lens OOB read and integer overflow handling (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28052">#28052</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/28003">#28003</a>).</li> <li>GroupQueryAttention seqlens_k bounds validation and compatibility follow-up (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28031">#28031</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/28259">#28259</a>).</li> <li>MatMulBnb4 and ML coefficient SafeInt checks (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27995">#27995</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/28001">#28001</a>).</li> <li>CUDA Gather int32 overflow fix (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28108">#28108</a>).</li> <li>GridSample float->int64 cast hardening for NaN/Inf/out-of-range coords (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28302">#28302</a>).</li> </ul> </li> <li>Fixed session logger use-after-free during EP teardown under verbose logging (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28274">#28274</a>).</li> </ul> <h2>CUDA, Attention, and MLAS</h2> <ul> <li>Filled CUDA opset/operator gaps and extended support: <ul> <li>Transpose opset 23 -> 25 (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27740">#27740</a>).</li> <li>QuantizeLinear/DequantizeLinear opset 25 (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28046">#28046</a>).</li> <li>CUDA TopK INT8/INT16/UINT8 support (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27862">#27862</a>).</li> <li>LabelEncoder CUDA support for numeric types (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28045">#28045</a>).</li> </ul> </li> <li>Attention/GQA improvements: <ul> <li>Fixed ONNX Attention min-bias alignment crash on SM<80 and masked-batch NaN behavior (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27831">#27831</a>).</li> <li>Added FP32 QK accumulation path for unfused GQA attention (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28198">#28198</a>).</li> <li>Added CUDART_VERSION reduction compatibility in GQA attention (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28296">#28296</a>).</li> <li>Fixed CUDA 13 build error in GQA unfused attention (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28309">#28309</a>).</li> <li>PagedAttention fallback for SM<80 fp16 (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28200">#28200</a>).</li> </ul> </li> <li>MLAS updates: <ul> <li>FP16 Gelu enablement (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/26815">#26815</a>).</li> <li>Arm64 BF16 fast-math conv kernels for NCHW/NCHWc paths (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27878">#27878</a>).</li> </ul> </li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/microsoft/onnxruntime/commit/8c546c37b43caaca1fa25db430dab94b901cf277"><code>8c546c3</code></a> 1.26.0 - cherry-pick for RC2 (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28347">#28347</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/55c5c822a02e6237a667c13cd5ed008cd55bb459"><code>55c5c82</code></a> GridSample: harden float->int64 casts against NaN/Inf/out-of-range coords (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/2">#2</a>...</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/60ce9ccd65ebb1794110c4dddf2e110cf223e9f9"><code>60ce9cc</code></a> Relax GQA seqlens_k shape validation for backward compat with older models (#...</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/d02a0fd536d7cf6b88bfd268932d2933a1c3305c"><code>d02a0fd</code></a> Fix DoubleQDQPairsRemover adding spurious dimension to scalar scale/zero-poin...</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/9b30f306e40683df362249b7c139177cc83e352f"><code>9b30f30</code></a> remove weights_are_all_positive_ from TreeEnsemble (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/27552">#27552</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/5f2f848a6874d390c94502b3bb424447b119b975"><code>5f2f848</code></a> fix(ci): incorrect relative template includes for setup-feeds (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28312">#28312</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/de2bc90ce09d8bab419e07e1db741f445a27f721"><code>de2bc90</code></a> Add QNN Plugin EP repo link to README (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28225">#28225</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/8dd4a06811179a7c04aea0a97c65f45a3dc8c6af"><code>8dd4a06</code></a> Include license file in built distributions (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/27783">#27783</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/6e19374beb24cc2fa0e8262c6d51404af87aefc0"><code>6e19374</code></a> Fix CUDA 13 build error in gqa_unfused_attention.cu (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28309">#28309</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/d6c363c0d192bf2205155a18f01a62efa6b34da1"><code>d6c363c</code></a> [OVEP] OpenVINO EP 1.26.0 Development Release Updates (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28297">#28297</a>)</li> <li>Additional commits viewable in <a href="https://github.com/microsoft/onnxruntime/compare/v1.25.1...v1.26.0">compare view</a></li> </ul> </details> <br /> Updates `com.microsoft.onnxruntime:onnxruntime_gpu` from 1.25.1 to 1.26.0 <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/microsoft/onnxruntime/releases">com.microsoft.onnxruntime:onnxruntime_gpu's releases</a>.</em></p> <blockquote> <h2>1.26.0</h2> <p>n.b. The following was generated via LLM from Git history. Only the contributor list has been verified.</p> <h1>ONNX Runtime Release 1.26.0</h1> <h2>Announcement - Breaking Changes</h2> <ul> <li><strong>Support for CUDA 12 will be removed in 1.27.0.</strong> <ul> <li>CUDA 13 will continue to be published as <code>onnxruntime-<os>-<arch>-gpu_cuda13-<version>.<ext></code></li> </ul> </li> <li>CUDA runtime will be moving soon to a dedicated Execution Provider (EP) instead of a published package from ORT core.</li> </ul> <h2>Highlights</h2> <ul> <li>Added optional memory mapping for <code>.ort</code> model loads (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28164">#28164</a>).</li> <li>Added RISC-V Vector (RVV) support for CPU EP (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28261">#28261</a>).</li> <li>OpenVINO EP upgraded for 1.26.0 development release (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28297">#28297</a>).</li> <li>WebGPU gained GridSample support (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28264">#28264</a>) and Split-K improvements (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28151">#28151</a>).</li> <li>CUDA plugin EP gained graph support (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28002">#28002</a>), profiling API (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28216">#28216</a>).</li> </ul> <h2>Security and Reliability Hardening</h2> <ul> <li>Replaced unrestricted Python <code>setattr</code> configuration with an allowlist (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28083">#28083</a>).</li> <li>Hardened multiple OOB and overflow scenarios across ML and core ops: <ul> <li>Attention mask index OOB write (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27789">#27789</a>).</li> <li>MaxPoolGrad indices bounds validation (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27903">#27903</a>).</li> <li>SVM and TreeEnsemble bounds/security fixes (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27950">#27950</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/27951">#27951</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/27952">#27952</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/27989">#27989</a>).</li> <li>RNN sequence_lens OOB read and integer overflow handling (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28052">#28052</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/28003">#28003</a>).</li> <li>GroupQueryAttention seqlens_k bounds validation and compatibility follow-up (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28031">#28031</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/28259">#28259</a>).</li> <li>MatMulBnb4 and ML coefficient SafeInt checks (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27995">#27995</a>, <a href="https://redirect.github.com/microsoft/onnxruntime/pull/28001">#28001</a>).</li> <li>CUDA Gather int32 overflow fix (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28108">#28108</a>).</li> <li>GridSample float->int64 cast hardening for NaN/Inf/out-of-range coords (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28302">#28302</a>).</li> </ul> </li> <li>Fixed session logger use-after-free during EP teardown under verbose logging (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28274">#28274</a>).</li> </ul> <h2>CUDA, Attention, and MLAS</h2> <ul> <li>Filled CUDA opset/operator gaps and extended support: <ul> <li>Transpose opset 23 -> 25 (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27740">#27740</a>).</li> <li>QuantizeLinear/DequantizeLinear opset 25 (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28046">#28046</a>).</li> <li>CUDA TopK INT8/INT16/UINT8 support (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27862">#27862</a>).</li> <li>LabelEncoder CUDA support for numeric types (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28045">#28045</a>).</li> </ul> </li> <li>Attention/GQA improvements: <ul> <li>Fixed ONNX Attention min-bias alignment crash on SM<80 and masked-batch NaN behavior (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27831">#27831</a>).</li> <li>Added FP32 QK accumulation path for unfused GQA attention (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28198">#28198</a>).</li> <li>Added CUDART_VERSION reduction compatibility in GQA attention (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28296">#28296</a>).</li> <li>Fixed CUDA 13 build error in GQA unfused attention (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28309">#28309</a>).</li> <li>PagedAttention fallback for SM<80 fp16 (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/28200">#28200</a>).</li> </ul> </li> <li>MLAS updates: <ul> <li>FP16 Gelu enablement (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/26815">#26815</a>).</li> <li>Arm64 BF16 fast-math conv kernels for NCHW/NCHWc paths (<a href="https://redirect.github.com/microsoft/onnxruntime/pull/27878">#27878</a>).</li> </ul> </li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/microsoft/onnxruntime/commit/8c546c37b43caaca1fa25db430dab94b901cf277"><code>8c546c3</code></a> 1.26.0 - cherry-pick for RC2 (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28347">#28347</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/55c5c822a02e6237a667c13cd5ed008cd55bb459"><code>55c5c82</code></a> GridSample: harden float->int64 casts against NaN/Inf/out-of-range coords (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/2">#2</a>...</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/60ce9ccd65ebb1794110c4dddf2e110cf223e9f9"><code>60ce9cc</code></a> Relax GQA seqlens_k shape validation for backward compat with older models (#...</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/d02a0fd536d7cf6b88bfd268932d2933a1c3305c"><code>d02a0fd</code></a> Fix DoubleQDQPairsRemover adding spurious dimension to scalar scale/zero-poin...</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/9b30f306e40683df362249b7c139177cc83e352f"><code>9b30f30</code></a> remove weights_are_all_positive_ from TreeEnsemble (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/27552">#27552</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/5f2f848a6874d390c94502b3bb424447b119b975"><code>5f2f848</code></a> fix(ci): incorrect relative template includes for setup-feeds (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28312">#28312</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/de2bc90ce09d8bab419e07e1db741f445a27f721"><code>de2bc90</code></a> Add QNN Plugin EP repo link to README (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28225">#28225</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/8dd4a06811179a7c04aea0a97c65f45a3dc8c6af"><code>8dd4a06</code></a> Include license file in built distributions (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/27783">#27783</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/6e19374beb24cc2fa0e8262c6d51404af87aefc0"><code>6e19374</code></a> Fix CUDA 13 build error in gqa_unfused_attention.cu (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28309">#28309</a>)</li> <li><a href="https://github.com/microsoft/onnxruntime/commit/d6c363c0d192bf2205155a18f01a62efa6b34da1"><code>d6c363c</code></a> [OVEP] OpenVINO EP 1.26.0 Development Release Updates (<a href="https://redirect.github.com/microsoft/onnxruntime/issues/28297">#28297</a>)</li> <li>Additional commits viewable in <a href="https://github.com/microsoft/onnxruntime/compare/v1.25.1...v1.26.0">compare view</a></li> </ul> </details> <br /> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
