Add wait to MKL calls #518

michel2323 · 2025-08-22T19:11:08Z

We observed norm()=0 on oneAPI arrays when it shouldn't. Upon review, it seems one needs a wait after each MKL call. They support event dependencies etc.

We have still issues with correctness of axpy in https://github.com/JuliaSmoothOptimizers/KrylovPreconditioners.jl

github-actions · 2025-08-22T19:11:36Z

Your PR requires formatting changes to meet the project's style guidelines.
Please consider running Runic (git runic master) to apply these changes.

Click here to view the suggested changes.

diff --git a/deps/generate_interfaces.jl b/deps/generate_interfaces.jl
index beb87ea..9e7bd3f 100644
--- a/deps/generate_interfaces.jl
+++ b/deps/generate_interfaces.jl
@@ -447,17 +447,17 @@ function generate_cpp(library::String, filename::Vector{String}, output::String;
     write(oneapi_cpp, "extern \"C\" $header {\n")
     if template
       type = version_types[version]
-      !occursin("scratchpad_size", name) && write(oneapi_cpp, "   auto status = oneapi::mkl::$library::$variant$name<$type>($parameters, {});\n   device_queue->val.wait_and_throw();\n")
-      occursin("scratchpad_size", name)  && write(oneapi_cpp, "   int64_t scratchpad_size = oneapi::mkl::$library::$variant$name<$type>($parameters);\n   device_queue->val.wait_and_throw();\n")
-      # !occursin("scratchpad_size", name) && write(oneapi_cpp, "   auto status = oneapi::mkl::$library::$variant$name<$type>($parameters, {});\n")
-      # occursin("scratchpad_size", name)  && write(oneapi_cpp, "   int64_t scratchpad_size = oneapi::mkl::$library::$variant$name<$type>($parameters);\n")
+            !occursin("scratchpad_size", name) && write(oneapi_cpp, "   auto status = oneapi::mkl::$library::$variant$name<$type>($parameters, {});\n   device_queue->val.wait_and_throw();\n")
+            occursin("scratchpad_size", name)  && write(oneapi_cpp, "   int64_t scratchpad_size = oneapi::mkl::$library::$variant$name<$type>($parameters);\n   device_queue->val.wait_and_throw();\n")
+            # !occursin("scratchpad_size", name) && write(oneapi_cpp, "   auto status = oneapi::mkl::$library::$variant$name<$type>($parameters, {});\n")
+            # occursin("scratchpad_size", name)  && write(oneapi_cpp, "   int64_t scratchpad_size = oneapi::mkl::$library::$variant$name<$type>($parameters);\n")
     else
       if !(name ∈ void_output)
         write(oneapi_cpp, "   auto status = oneapi::mkl::$library::$variant$name($parameters, {});\n")
-        occursin("device_queue", parameters) && write(oneapi_cpp, "   device_queue->val.wait_and_throw();\n")
+                occursin("device_queue", parameters) && write(oneapi_cpp, "   device_queue->val.wait_and_throw();\n")
       else
         write(oneapi_cpp, "   oneapi::mkl::$library::$variant$name($parameters);\n")
-        occursin("device_queue", parameters) && write(oneapi_cpp, "   device_queue->val.wait_and_throw();\n")
+                occursin("device_queue", parameters) && write(oneapi_cpp, "   device_queue->val.wait_and_throw();\n")
       end
     end
     if occursin("scratchpad_size", name)

deps/generate_interfaces.jl

codecov · 2025-08-22T20:08:15Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 81.73%. Comparing base (115a10f) to head (2da6dde).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #518   +/-   ##
=======================================
  Coverage   81.73%   81.73%           
=======================================
  Files          44       44           
  Lines        2540     2540           
=======================================
  Hits         2076     2076           
  Misses        464      464

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

maleadt · 2025-09-01T04:55:59Z

Upon review, it seems one needs a wait after each MKL call. They support event dependencies etc.

Can you elaborate? Making these calls synchronous is not something we want, and synchronization should be handled on the Julia side already when accessing the destination oneArray. Doing this eagerly risks introducing execution bubbles and killing performance.

michel2323 · 2025-09-02T15:45:39Z

Oh ok. We observed wrong values in Krylov.jl when multiple MKL calls are called in series. Upon reviewing the Intel MKL examples, it appears that the MKL calls are asynchronous and not ordered in the SYCL queue. You'd need to synchronize with the returned event. And the event is not returned to Julia and can't be passed to the following MKL calls. Do you think we should do that then?

michel2323 · 2025-09-02T16:05:57Z

oneAPI.jl/lib/level-zero/cmdqueue.jl

Line 13 in 48d1750

flags=0,

The default flag flags=0 is out-of-order. Do you think we should change this to in order? @maleadt

maleadt · 2025-09-03T06:46:38Z

enumerator ZE_COMMAND_QUEUE_FLAG_IN_ORDER

To be used only when creating immediate command lists. Commands appended to the immediate command list are executed in-order, with driver implementation enforcing dependencies between them. Application is not required to have the signal event of a given command being the wait event of the next to define an in-order list, and application is allowed to pass signal and wait events to each appended command to implement more complex dependency graphs.

It does seem enticing, but the "To be used only when creating immediate command lists" doesn't apply here, so I'm not sure. Maybe we should ping some people at Intel, or open some issue upstream to figure out what's the best way to emulate CUDA's stream-ordered operations without having to use events everywhere.

cc @kballeda

Add wait to MKL calls

3ebc1df

michel2323 requested a review from amontoison August 22, 2025 19:11

amontoison reviewed Aug 22, 2025

View reviewed changes

deps/generate_interfaces.jl Show resolved Hide resolved

More fixes

2da6dde

amontoison approved these changes Aug 23, 2025

View reviewed changes

amontoison merged commit cf05fb5 into master Aug 23, 2025
2 checks passed

amontoison deleted the ms/wait branch August 23, 2025 07:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add wait to MKL calls #518

Add wait to MKL calls #518

Uh oh!

michel2323 commented Aug 22, 2025

Uh oh!

github-actions bot commented Aug 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

codecov bot commented Aug 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

maleadt commented Sep 1, 2025

Uh oh!

michel2323 commented Sep 2, 2025 •

edited

Loading

Uh oh!

michel2323 commented Sep 2, 2025 •

edited

Loading

Uh oh!

maleadt commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add wait to MKL calls #518

Add wait to MKL calls #518

Uh oh!

Conversation

michel2323 commented Aug 22, 2025

Uh oh!

github-actions bot commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

maleadt commented Sep 1, 2025

Uh oh!

michel2323 commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michel2323 commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maleadt commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Aug 22, 2025 •

edited

Loading

codecov bot commented Aug 22, 2025 •

edited

Loading

michel2323 commented Sep 2, 2025 •

edited

Loading

michel2323 commented Sep 2, 2025 •

edited

Loading