-
Notifications
You must be signed in to change notification settings - Fork 796
[SYCL][COMPAT] Add max_active_work_groups_per_cu #15802
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
I guess previously only being used places where already included
Related functionality already in this file
sycl/test-e2e/syclcompat/util/max_active_work_groups_per_cu.cpp
Outdated
Show resolved
Hide resolved
| size_t max_wgs = kernel.template ext_oneapi_get_info< | ||
| syclex::info::kernel_queue_specific::max_num_work_groups>( | ||
| q, sycl::range<3>{syclcompat::dim3{wg_range}}, local_mem_size); | ||
| assert(max_per_cu == max_wgs / max_compute_units); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we expect NDEBUG to not to defined during testing ?
Does the CI build and test DPCPP with a specific configuration or is the build type string left empty ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, asserts work as expected during testing. llvm-lit will test on as many backends as are available, AFAIK.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks
Also reorder params to match the rest of syclcompat
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 🚢
|
@intel/llvm-gatekeepers this is ready to merge 🙏 |
This new API returns the maximum work-groups which can occupy a single compute unit of the given device.