Skip to content

⬆️ Update dependency onnxruntime-web to ^1.24.2#1204

Draft
renovate[bot] wants to merge 1 commit intomainfrom
renovate/onnxruntime-web-1.x
Draft

⬆️ Update dependency onnxruntime-web to ^1.24.2#1204
renovate[bot] wants to merge 1 commit intomainfrom
renovate/onnxruntime-web-1.x

Conversation

@renovate
Copy link
Contributor

@renovate renovate bot commented Feb 9, 2026

This PR contains the following updates:

Package Change Age Confidence
onnxruntime-web ^1.22.0^1.24.2 age confidence

Release Notes

Microsoft/onnxruntime (onnxruntime-web)

v1.24.2: ONNX Runtime v1.24.2

Compare Source

This is a patch release for ONNX Runtime 1.24, containing several bug fixes, security improvements, and execution provider updates.

Bug Fixes

  • NuGet: Fixed native library loading issues in the ONNX Runtime NuGet package on Linux and macOS. (#​27266)
  • macOS: Fixed Java support and Jar testing on macOS ARM64. (#​27271)
  • Core: Enable Robust Symlink Support for External Data for Huggingface Hub Cache. (#​27374)
  • Core: Added boundary checks for SparseTensorProtoToDenseTensorProto to improve robustness. (#​27323)
  • Security: Fixed an out-of-bounds read vulnerability in ArrayFeatureExtractor. (#​27275)

Execution Provider Updates

  • MLAS: Fixed flakiness and accuracy issues in Lut GEMM (MatMulNBitsLutGemm). (#​27216)
  • QNN: Enabled 64-bit UDMA mode for HTP target v81 or above. (#​26677)
  • WebGPU:
    • Used LazyRelease for prepack allocator. (#​27077)
    • Fixed ConvTranspose bias validation in both TypeScript and C++ implementations. (#​27213)
  • OpenVINO (OVEP): Patch to reduce resident memory by reusing weight files across shared contexts. (#​27238)
  • DNNL: Fixed DNNL build error by including missing files. (#​27334)

Build and Infrastructure

  • CUDA:
    • Added support for CUDA architecture family codes (suffix 'f') introduced in CUDA 12.9. (#​27278)
    • Fixed build errors and warnings for various CUDA versions (12.8, 13.0, 13.1.1). (#​27276)
    • Applied patches for Abseil CUDA warnings. (#​27096, #​27126)
  • Pipelines:
    • Fixed Python packaging pipeline for Windows ARM64 and release. (#​27339, #​27350, #​27299)
    • Fixed DirectML NuGet pipeline to correctly bundle x64 and ARM64 binaries for release. (#​27349)
    • Updated Microsoft.ML.OnnxRuntime.Foundry package for Windows ARM64 support and NuGet signing. (#​27294)
  • Testing: Updated BaseTester to support plugin EPs with both compiled nodes and registered kernels. (#​27176)
  • Telemetry: Added service name and framework name to telemetry events for better usage understanding on Windows. (#​27252, #​27256)

Full Changelog: v1.24.1...v1.24.2

Contributors

@​tianleiwu, @​hariharans29, @​edgchen1, @​xiaofeihan1, @​adrianlizarraga, @​angelser, @​angelserMS, @​ankitm3k, @​baijumeswani, @​bmehta001, @​ericcraw, @​eserscor, @​fs-eire, @​guschmue, @​mc-nv, @​qjia7, @​qti-monumeen, @​titaiwangms, @​yuslepukhin

v1.24.1: ONNX Runtime v1.24.1

Compare Source

📢 Announcements & Breaking Changes

Platform Support Changes
  • Python 3.10 wheels are no longer published — Please upgrade to Python 3.11+
  • Python 3.14 support added
  • Free-threaded Python (PEP 703) — Added support for Python 3.13t and 3.14t in Linux (#​26786)
  • x86_64 binaries for macOS/iOS are no longer provided and minimum macOS is raised to 14.0
API Version

✨ New Features

🤖 Execution Provider (EP) Plugin API

A major infrastructure enhancement enabling plugin-based EPs with dynamic loading:

  • Initial kernel-based EP support (#​26206)
  • Weight pre-packing support for plugin EPs (#​26754)
  • EP Context model support (#​25124)
  • Control flow kernel APIs (#​26927)
  • OrtKernelInfo APIs for kernel-based plugin EPs (#​26803)
🔧 Core APIs
  • OrtApi::CreateEnvWithOptions() and OrtEpApi::GetEnvConfigEntries() (#​26971)
  • EP Device Compatibility APIs (#​26922)
  • External Resource Importer API for D3D12 shared resources (#​26828)
  • Session config access from KernelInfo (#​26589)
📊 Dependencies & Integration

🖥️ Execution Provider Updates

NVIDIA
  • CUDA EP: Flash Attention updates, GQA kernel fusion, BF16 support for MoE/qMoE/MatMulNBits, CUDA 13.0 support
  • TensorRT EP: Upgraded to TensorRT 10.14, automatic plugin loading, NVFP4 custom ops
  • TensorRT RTX EP: RTX runtime caching, CUDA graph support, BFloat16, memory-mapped engines
Qualcomm QNN EP
  • QNN SDK upgraded to 2.42.0 with new ops (RMSNorm, ScatterElements, GatherND, STFT, RandomUniformLike)
  • Gelu pattern fusion, LPBQ quantization support, ARM64 wheel builds, v81 device support
Intel & AMD
  • OpenVINO EP: Upgraded to 2025.4.1
  • VitisAI EP: External EP loader, compiled model compatibility API
  • MIGraphX EP: QuickGelu, multihead attention, QLinear pooling ops
ArmNN EP

Arm is formally deprecating the Arm NN Execution Provider (EP) in ONNX Runtime. The Arm NN EP is still experimental and depends on technology that is no longer actively maintained. Keeping it available now only adds complexity and potential confusion for users.

What to expect:

  • Effective immediately, the Arm NN EP is deprecated and will no longer be maintained
  • All build options, documentation, and examples referencing ArmNN will be removed once the upstream change merges; the removal will appear in the first ONNX Runtime release that includes that change. We will confirm the release number as soon as it is known
  • Builds that still rely on Arm NN-specific options (for example --use_armnn) will fail after the change lands, so please adjust configurations in advance

🌐 Web & JavaScript

  • WebGPU EP: Flash Attention optimizations, graph capture, Split-K MatMul, qMoE support, WGSL templates
  • WebNN EP: GQA local attention, GatherBlockQuantized, ConvInteger/MatMulInteger
  • Node.js/React Native: Node.js v22, JSI for React Native, JSPI build support

🧠 CPU Improvements

  • KleidiAI: SME1/SME2 Convolution and SGemm kernels, FP32 Gemv, Windows/Arm support
  • New ops: MoE/qMoE kernels, RotaryEmbeddings opset 23, LayerNorm/RMSNorm broadcasting
  • Platform support: S390x SIMD, LoongArch64 4-bit quantization, FP16 inference improvements
  • ARM NCHWc layout support: NCHWc layout support for potential performance improvement of Conv models. Needs building from source with --enable_arm_neon_nchwc to enable this feature (#​25580 #​26838 #​26691 #​26171). This feature may be turned ON by default in a future release based on community feedback.
  • ARM perf improvements: Dedicated depthwise conv kernel (#​26688) and SiLU activation perf improvement (#​26753)

🔌 Language Bindings

C#
Python
  • add_external_initializers_from_files (#​26012)
Java

🐛 Bug Fixes

Critical Fixes
  • DoS vulnerability in FuseReluClip (#​26878)
  • Security issue loading arbitrary files as external data (#​26776)
  • Memory leak fix for KernelContext_GetAllocator (#​26883)
  • Local Attention off-by-1 bug (#​25927)
EP-Specific Fixes
  • [QNN] Clip op with min/max from QDQ (#​26601)
  • [CoreML] Gather fp16 support (#​26442)

🙏 Contributors

Thanks to our 170 contributors for this release!

@​fs-eire, @​tianleiwu, @​edgchen1, @​qjia7, @​yuslepukhin, @​hariharans29, @​Honry, @​qti-yuduo, @​adrianlizarraga, @​snnn, @​eserscor, @​vraspar, @​xiaofeihan1, @​guschmue, @​daijh, @​quic-muchhsu, @​qti-jkilpatrick, @​tirupath-qti, @​Jiawei-Shao, @​qti-hungjuiw, @​quic-ashwshan, @​titaiwangms, @​qti-mattsinc, @​chilo-ms, @​jchen10, @​xhcao, @​skottmckay, @​quic-calvnguy, @​JonathanC-ARM, @​Rohanjames1997, @​sushraja-msft, @​jambayk, @​adrastogi, @​xenova, @​quic-tirupath, @​justinchuby, @​HectorSVC, @​kunal-vaishnavi, @​wenqinI, @​prathikr, @​baijumeswani, @​preetha-intel, @​jatinwadhwa921, @​umangb-09, @​qti-ashwshan, @​carzh, @​bachelor-dou, @​ranjitshs, @​gedoensmax, @​xadupre, @​nenad1002, @​TedThemistokleous, @​keshavv27, @​zpye, @​jnagi-intel, @​jiafatom, @​mingyueliuh, @​Colm-in-Arm, @​borg323, @​chunghow-qti, @​Craigacp, @​BODAPATIMAHESH, @​AlekseiNikiforovIBM, @​hans00, @​thevishalagarwal, @​MaanavD, @​qti-kromero, @​damdoo01-arm, @​BoarQing, @​naomiOvad, @​yuhuchua-qti, @​hadiFute, @​vishalpandya1990, @​rivkastroh, @​minfhong-qti, @​kuanyul-qti, @​xieofxie, @​ankitm3k, @​RyanMetcalfeInt8, @​MayureshV1, @​bopeng1234, @​vthaniel, @​mdvoretc-intel, @​ericcraw, @​javier-intel, @​saurabhkale17, @​sfatimar, @​Kotomi-Du, @​intbf, @​n1harika, @​TejalKhade28, @​gupta-pallavi, @​cbourjau, @​nieubank, @​r-devulap, @​wszqkzqk, @​sanketkaleoss, @​amancini-N, @​fanchenkong1, @​meakbiyik, @​hisham-hchowdhu, @​shaoboyan091, @​Stonesjtu, @​qwu16, @​wangw-1991, @​bonktree, @​naetherm, @​nikhilfujitsu, @​Panxuefeng-loongson, @​selenayang888, @​moyo1997, @​chwarr, @​patryk-kaiser-ARM, @​fdwr, @​SavaLione, @​shiyi9801, @​mcost45, @​aciddelgado, @​prudhvi-qti, @​Jonahcb, @​lifang-zhang, @​zhaoxul-qti, @​gaugarg-nv, @​cocotdf, @​WangFengtu1996, @​orlmon01, @​weidu-tpvision, @​theHamsta, @​kevinch-nv, @​XXXXRT666, @​movedancer, @​melkap01-Arm, @​KingSora, @​urpetkov-amd, @​junchao-loongson, @​jixiongdeng, @​wcy123, @​GrigoryEvko, @​anujj, @​peishenyan, @​quic-ankus, @​jchen351, @​yihonglyu, @​satyajandhyala, @​co63oc, @​mschofie, @​quic-ashigarg, @​asoldano, @​nproshun, @​jiangzhaoming, @​seungtaek94, @​liqunfu, @​jaholme, @​hanbitmyths, @​quic-boyuc, @​rM-planet, @​qti-vaiskv, @​AndreyOrb, @​pkubaj, @​xhan65, @​Jaswanth51, @​quic-hungjuiw, @​jywu-msft, @​mklimenk, @​derdeljan-msft, @​ianfhunter, @​NingW101, @​feich-ms, @​Akupadhye, @​wschin


Full Changelog: v1.23.2...rel-1.24.1

v1.23.2: ONNX Runtime v1.23.2

Compare Source

v1.23.0: ONNX Runtime v1.23.0

Compare Source

Announcements

  • This release introduces Execution Provider (EP) Plugin API, which is a new infrastructure for building plugin-based EPs. (#​24887 , #​25137, #​25124, #​25147, #​25127, #​25159, #​25191, #​2524)

  • This release introduces the ability to dynamically download and install execution providers. This feature is exclusively available in the WinML build and requires Windows 11 version 25H2 or later. To leverage this new capability, C/C++/C# users should use the builds distributed through the Windows App SDK, and Python users should install the onnxruntime-winml package(will be published soon). We encourage users who can upgrade to the latest Windows 11 to utilize the WinML build to take advantage of this enhancement.

Upcoming Changes

  • The next release will stop providing x86_64 binaries for macOS and iOS operating systems.
  • The next release will increase the minimum supported macOS version from 13.4 to 14.0.
  • The next release will stop providing python 3.10 wheels.

Execution & Core Optimizations

Shutdown logic on Windows is simplified

Now on Windows some global object will be not destroyed if we detect that the process is being shutting down(#​24891) . It will not cause memory leak as when a process ends all the memory will be returned to the operating system. This change can reduce the chance of having crashes on process exit.

AutoEP/Device Management

Now ONNX Runtime has the ability to automatically discovery computing devices and select the best EPs to download and register. The EP downloading feature currently only works on Windows 11 version 25H2 or later.

Execution Provider (EP) Updates

ROCM EP was removed from the source tree. Users are recommended to use Migraphx or Vitis AI EPs from AMD.
A new EP, Nvidia TensorRT RTX, was added.

Web

EMDSK is upgraded from 4.0.4 to 4.0.8

WebGPU EP

Added WGSL template support.

QNN EP

SDK Update: Added support for QNN SDK 2.37.

KleidiAI

Enhanced performance for SGEMM, IGEMM, and Dynamic Quantized MatMul operations, especially for Conv2D operators on hardware that supports SME2 (Scalable Matrix Extension v2).

Known Problems

  • There was a change in build.py that was related to KleidiAI that may cause build failures when doing cross-compiling (#​26175) .

Contributions

Contributors to ONNX Runtime include members across teams at Microsoft, along with our community members:

@​1duo, @​Akupadhye, @​amarin16, @​AndreyOrb, @​ankan-ban, @​ankitm3k, @​anujj, @​aparmp-quic, @​arnej27959, @​bachelor-dou, @​benjamin-hodgson, @​Bonoy0328, @​chenweng-quic, @​chuteng-quic, @​clementperon, @​co63oc, @​daijh, @​damdoo01-arm, @​danyue333, @​fanchenkong1, @​gedoensmax, @​genarks, @​gnedanur, @​Honry, @​huaychou, @​ianfhunter, @​ishwar-raut1, @​jing-bao, @​joeyearsley, @​johnpaultaken, @​jordanozang, @​JulienMaille, @​keshavv27, @​kevinch-nv, @​khoover, @​krahenbuhl, @​kuanyul-quic, @​mauriciocm9, @​mc-nv, @​minfhong-quic, @​mingyueliuh, @​MQ-mengqing, @​NingW101, @​notken12, @​omarhass47, @​peishenyan, @​pkubaj, @​qc-tbhardwa, @​qti-jkilpatrick, @​qti-yuduo, @​quic-ankus, @​quic-ashigarg, @​quic-ashwshan, @​quic-calvnguy, @​quic-hungjuiw, @​quic-tirupath, @​qwu16, @​ranjitshs, @​saurabhkale17, @​schuermans-slx, @​sfatimar, @​stefantalpalaru, @​sunnyshu-intel, @​TedThemistokleous, @​thevishalagarwal, @​toothache, @​umangb-09, @​vatlark, @​VishalX, @​wcy123, @​xhcao, @​xuke537, @​zhaoxul-qti


Configuration

📅 Schedule: Branch creation - Between 09:00 PM and 11:59 PM, only on Sunday ( * 21-23 * * 7 ), Between 12:00 AM and 04:59 PM, only on Monday ( * 0-16 * * 1 ) (UTC), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

@renovate renovate bot temporarily deployed to #1204: App preview February 9, 2026 02:41 Inactive
@renovate renovate bot temporarily deployed to #1204: Bundle breakdown February 9, 2026 08:54 Inactive
@github-actions
Copy link
Contributor

github-actions bot commented Feb 9, 2026

Bundle Stats

Hey there, this message comes from a GitHub action that helps you and reviewers to understand how these changes affect the size of this project's bundle.

As this PR is updated, I'll keep you updated on how the bundle size is impacted.

Total

Files count Total bundle size % Changed
231 46.78 MB → 52.61 MB (+5.83 MB)
11.58 MB → 12.84 MB (gzip)
+12.45%
Changeset
File Δ Size
(sidepanel)/import/+page.svelte 🆕 +1.63 kB 0 B → 1.63 kB
ort.bundle.min.mjs 📈 +2.81 kB (+0.71%) 393.58 kB → 396.39 kB
node_modules/onnxruntime-web/dist/ort.bundle.min.mjs 📈 +2.26 kB (+0.58%) 387.1 kB → 389.36 kB
piexifjs/piexif.js 📉 -8 B (-0.03%) 30.49 kB → 30.48 kB
parser.js 📉 -8 B (-0.03%) 26.21 kB → 26.2 kB
compiler.js 📉 -4 B (-0.05%) 7.84 kB → 7.83 kB
language.js 📉 -26 B (-0.06%) 41.96 kB → 41.93 kB
storage.ts 📉 -2 B (-0.07%) 2.99 kB → 2.99 kB
resolve-block-scalar.js 📉 -2 B (-0.07%) 2.93 kB → 2.92 kB
whitespace-control.js 📉 -2 B (-0.08%) 2.58 kB → 2.58 kB
stringifyString.js 📉 -4 B (-0.08%) 4.88 kB → 4.87 kB
source-map-[hash].js 📉 -12 B (-0.10%) 11.78 kB → 11.76 kB
browser/index.js 📉 -745 B (-0.10%) 720.81 kB → 720.09 kB
source-node.js 📉 -7 B (-0.18%) 3.7 kB → 3.69 kB
stringifyPair.js 📉 -4 B (-0.19%) 2.03 kB → 2.02 kB
resolve-props.js 📉 -5 B (-0.23%) 2.12 kB → 2.11 kB
javascript-[hash].js 📉 -38 B (-0.24%) 15.39 kB → 15.35 kB
resolve-flow-collection.js 📉 -9 B (-0.25%) 3.49 kB → 3.48 kB
results.ts 📉 -28 B (-0.25%) 10.73 kB → 10.7 kB
resolve-block-map.js 📉 -6 B (-0.28%) 2.08 kB → 2.07 kB
browser.js 📉 -41 B (-0.34%) 11.83 kB → 11.79 kB
handlebars.runtime.js 📉 -4 B (-0.44%) 911 B → 907 B
handlebars.js 📉 -4 B (-0.47%) 847 B → 843 B
server.js 📉 -20 B (-0.65%) 3.03 kB → 3.01 kB
inference_utils.js 📉 -34 B (-0.94%) 3.53 kB → 3.5 kB
each.js 📉 -13 B (-1.02%) 1.24 kB → 1.23 kB
foldFlowLines.js 📉 -15 B (-1.06%) 1.39 kB → 1.37 kB
runtime.js 📉 -74 B (-1.21%) 5.95 kB → 5.88 kB
util.js 📉 -98 B (-2.62%) 3.65 kB → 3.55 kB
routes/(app)/(sidepanel)/import/+page.svelte 🔥 -1.63 kB (-100%) 1.63 kB → 0 B
decorators/inline.js 🔥 -368 B (-100%) 368 B → 0 B
View detailed bundle breakdown

Added

Asset File Size % Changed
_app/immutable/assets/ort-wasm-simd-threaded.jsep.CVw3nYo7.wasm 0 B → 23.77 MB (+23.77 MB)
0 B → 5.55 MB (gzip)
-
_app/immutable/nodes/16.BP6lDlgA.js 0 B → 2.67 kB (+2.67 kB)
0 B → 1.48 kB (gzip)
-

Removed

Asset File Size % Changed
_app/immutable/assets/ort-wasm-simd-threaded.jsep.CLPRrI3A.wasm 20.86 MB → 0 B (-20.86 MB)
4.92 MB → 0 B (gzip)
-100%
_app/immutable/nodes/16.pim-up1R.js 2.67 kB → 0 B (-2.67 kB)
1.48 kB → 0 B (gzip)
-100%

Bigger

Asset File Size % Changed
_app/immutable/workers/assets/ort-wasm-simd-threaded.jsep-[hash].wasm 20.86 MB → 23.77 MB (+2.91 MB)
4.92 MB → 5.55 MB (gzip)
+13.96%
_app/immutable/chunks/Bd7_h5c3.js 389.89 kB → 392.16 kB (+2.26 kB)
107.59 kB → 108.12 kB (gzip)
+0.58%
_app/immutable/workers/start-[hash].js 1.77 MB → 1.77 MB (+1.57 kB)
605.37 kB → 605.76 kB (gzip)
+0.09%

Smaller

No assets were smaller

@renovate renovate bot temporarily deployed to #1204: E2E tests February 9, 2026 09:06 Inactive
@gwennlbh
Copy link
Member

gwennlbh commented Feb 9, 2026

gotta test if using this version fixes eventual bugs with the yolo26 model, if not, dont upgrade cuz of bundle size

@gwennlbh gwennlbh marked this pull request as draft February 9, 2026 10:08
@renovate renovate bot force-pushed the renovate/onnxruntime-web-1.x branch from 14feb4a to 15a3de4 Compare February 12, 2026 10:35
@renovate renovate bot temporarily deployed to #1204: App preview February 12, 2026 10:38 Inactive
@renovate renovate bot temporarily deployed to #1204: Bundle breakdown February 12, 2026 10:45 Inactive
@renovate renovate bot temporarily deployed to #1204: E2E tests February 12, 2026 10:51 Inactive
@renovate renovate bot force-pushed the renovate/onnxruntime-web-1.x branch from 15a3de4 to 1daabef Compare February 17, 2026 16:15
@renovate renovate bot temporarily deployed to #1204: Bundle breakdown February 17, 2026 16:27 Inactive
@renovate renovate bot temporarily deployed to #1204: E2E tests February 17, 2026 16:34 Inactive
@renovate renovate bot force-pushed the renovate/onnxruntime-web-1.x branch from 1daabef to 02bbb74 Compare February 19, 2026 18:23
@renovate renovate bot changed the title ⬆️ Update dependency onnxruntime-web to ^1.24.1 ⬆️ Update dependency onnxruntime-web to ^1.24.2 Feb 19, 2026
@renovate renovate bot temporarily deployed to #1204: App preview February 19, 2026 18:26 Inactive
@renovate renovate bot temporarily deployed to #1204: Bundle breakdown February 19, 2026 18:33 Inactive
@renovate renovate bot temporarily deployed to #1204: E2E tests February 19, 2026 18:52 Inactive
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments