[UR] Stop querying adapter fp16/fp64 support via extension. #15811

aarongreig · 2024-10-22T14:27:13Z

We're trying to move the UR adapters away from returning hard coded OpenCL extension strings to report device capabilities, this is the first change in that direction.

Closes oneapi-src/unified-runtime#1374

instead of checking separately

Not a member of llvm-reviewers-runtime anymore, need someone else re-approve.

aelovikov-intel · 2025-08-01T16:02:44Z

sycl/source/detail/device_impl.hpp

+  // Check if the device supports double precision floating point.
+  bool isFp64Supported() const;
+
+  // Check if the device supports half precision floating point.
+  bool isFp16Supported() const;
+


Please don't add extra methods. Existing interfaces should be enough, just change their implementation to the proper calls to the underlying UR queries.

aelovikov-intel · 2025-08-01T16:04:57Z

sycl/source/detail/device_impl.hpp

-    CASE(fp16) { return has_extension("cl_khr_fp16"); }
-    CASE(fp64) { return has_extension("cl_khr_fp64"); }


This relied on the internal caching of the extensions string, we need to ensure that new underlying query is cached too. See

llvm/sycl/source/detail/device_impl.hpp

Lines 2265 to 2301 in 76a887e

mutable JointCache<

UREagerCache<UR_DEVICE_INFO_TYPE, UR_DEVICE_INFO_USE_NATIVE_ASSERT,

UR_DEVICE_INFO_EXTENSIONS>, //

URCallOnceCache<UR_DEVICE_INFO_NAME,

// USM:

UR_DEVICE_INFO_USM_DEVICE_SUPPORT,

UR_DEVICE_INFO_USM_HOST_SUPPORT,

UR_DEVICE_INFO_USM_SINGLE_SHARED_SUPPORT,

UR_DEVICE_INFO_USM_CROSS_SHARED_SUPPORT,

UR_DEVICE_INFO_USM_SYSTEM_SHARED_SUPPORT,

//

UR_DEVICE_INFO_ATOMIC_64>, //

EagerCache<InfoInitializer>, //

CallOnceCache<InfoInitializer,

ext::oneapi::experimental::info::device::architecture>, //

AspectCache<EagerCache, aspect::fp16, aspect::fp64,

aspect::int64_base_atomics, aspect::int64_extended_atomics,

aspect::ext_oneapi_atomic16>,

AspectCache<

CallOnceCache,

// Slow, >100ns (for baseline cached ~30..40ns):

aspect::ext_intel_pci_address, aspect::ext_intel_gpu_eu_count,

aspect::ext_intel_free_memory, aspect::ext_intel_fan_speed,

aspect::ext_intel_power_limits,

// medium-slow, 60-90ns (for baseline cached ~30..40ns):

aspect::ext_intel_gpu_eu_simd_width, aspect::ext_intel_gpu_slices,

aspect::ext_intel_gpu_subslices_per_slice,

aspect::ext_intel_gpu_eu_count_per_subslice,

aspect::ext_intel_device_info_uuid,

aspect::ext_intel_gpu_hw_threads_per_eu,

aspect::ext_intel_memory_clock_rate,

aspect::ext_intel_memory_bus_width,

aspect::ext_oneapi_bindless_images,

aspect::ext_oneapi_bindless_images_1d_usm,

aspect::ext_oneapi_bindless_images_2d_usm,

aspect::ext_oneapi_is_composite, aspect::ext_oneapi_is_component>>

MCache;

[UR] Stop querying adapter fp16/fp64 support via extension.

32957aa

aarongreig requested review from a team as code owners October 22, 2024 14:27

aarongreig requested a review from bso-intel October 22, 2024 14:27

aarongreig had a problem deploying to WindowsCILock October 22, 2024 14:28 — with GitHub Actions Failure

aarongreig mentioned this pull request Oct 22, 2024

Report device fp support via config rather than extension string. oneapi-src/unified-runtime#2231

Closed

Simplify device info helpers

5e66ecc

aarongreig had a problem deploying to WindowsCILock October 22, 2024 15:52 — with GitHub Actions Failure

Merge branch 'sycl' into aaron/stopReportingFPExtensions

8720fbe

aarongreig had a problem deploying to WindowsCILock October 24, 2024 10:13 — with GitHub Actions Failure

aarongreig temporarily deployed to WindowsCILock October 24, 2024 10:41 — with GitHub Actions Inactive

aarongreig added 2 commits October 28, 2024 10:22

Rely on empty bitfield to report no type support

368a9e8

instead of checking separately

Merge branch 'sycl' into aaron/stopReportingFPExtensions

d789703

aarongreig had a problem deploying to WindowsCILock October 29, 2024 13:55 — with GitHub Actions Error

Revert change made for testing.

5358def

aarongreig temporarily deployed to WindowsCILock October 29, 2024 13:58 — with GitHub Actions Inactive

aarongreig temporarily deployed to WindowsCILock October 29, 2024 14:33 — with GitHub Actions Inactive

bso-intel previously approved these changes Nov 7, 2024

View reviewed changes

Merge branch 'sycl' into aaron/stopReportingFPExtensions

b3b7153

aarongreig temporarily deployed to WindowsCILock January 22, 2025 15:45 — with GitHub Actions Inactive

aarongreig temporarily deployed to WindowsCILock January 22, 2025 16:41 — with GitHub Actions Inactive

aarongreig added 2 commits January 23, 2025 14:17

Clean up some remaining uses of the old extension string.

fba0498

Merge branch 'sycl' into aaron/stopReportingFPExtensions

3049632

aarongreig had a problem deploying to WindowsCILock January 23, 2025 14:19 — with GitHub Actions Failure

aarongreig temporarily deployed to WindowsCILock January 23, 2025 14:49 — with GitHub Actions Inactive

Fix unit tests.

ee0fc6c

aarongreig had a problem deploying to WindowsCILock January 23, 2025 15:56 — with GitHub Actions Failure

Merge branch 'sycl' into aaron/stopReportingFPExtensions

189bf35

aarongreig temporarily deployed to WindowsCILock February 4, 2025 16:07 — with GitHub Actions Inactive

aarongreig temporarily deployed to WindowsCILock February 4, 2025 18:00 — with GitHub Actions Inactive

aarongreig added 2 commits March 19, 2025 14:36

Adjust minimum flags in native cpu and link related issue.

e965b3e

Merge branch 'sycl' into aaron/stopReportingFPExtensions

9ecf00b

aarongreig temporarily deployed to WindowsCILock March 19, 2025 14:38 — with GitHub Actions Inactive

aarongreig temporarily deployed to WindowsCILock March 19, 2025 15:27 — with GitHub Actions Inactive

sarnex requested a review from a team as a code owner May 16, 2025 21:00

Merge branch 'sycl' into aaron/stopReportingFPExtensions

3f13197

aarongreig had a problem deploying to WindowsCILock July 17, 2025 15:01 — with GitHub Actions Failure

aarongreig temporarily deployed to WindowsCILock July 17, 2025 16:54 — with GitHub Actions Inactive

Fix hip build.

f5bcab7

aarongreig had a problem deploying to WindowsCILock July 18, 2025 09:56 — with GitHub Actions Failure

aarongreig temporarily deployed to WindowsCILock July 18, 2025 10:55 — with GitHub Actions Inactive

Fix unit tests and report proper vec widths for hip + cuda.

797dd4c

aarongreig temporarily deployed to WindowsCILock July 21, 2025 10:42 — with GitHub Actions Inactive

aarongreig temporarily deployed to WindowsCILock July 21, 2025 11:09 — with GitHub Actions Inactive

Merge branch 'sycl' into aaron/stopReportingFPExtensions

b9017b9

aarongreig temporarily deployed to WindowsCILock July 21, 2025 14:45 — with GitHub Actions Inactive

aarongreig temporarily deployed to WindowsCILock July 21, 2025 16:24 — with GitHub Actions Inactive

Merge branch 'sycl' into aaron/stopReportingFPExtensions

a9932cc

aarongreig temporarily deployed to WindowsCILock July 29, 2025 15:51 — with GitHub Actions Inactive

aarongreig temporarily deployed to WindowsCILock July 29, 2025 16:50 — with GitHub Actions Inactive

aelovikov-intel reviewed Aug 1, 2025

View reviewed changes

aarongreig mentioned this pull request Aug 5, 2025

[UR][Offload] Tracking issue for features missing in the offload UR adapter #18681

Open

83 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[UR] Stop querying adapter fp16/fp64 support via extension. #15811

[UR] Stop querying adapter fp16/fp64 support via extension. #15811

Uh oh!

aarongreig commented Oct 22, 2024 •

edited

Loading

Uh oh!

aelovikov-intel Aug 1, 2025

Uh oh!

aelovikov-intel Aug 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		CASE(fp16) { return has_extension("cl_khr_fp16"); }
		CASE(fp64) { return has_extension("cl_khr_fp64"); }

	mutable JointCache<
	UREagerCache<UR_DEVICE_INFO_TYPE, UR_DEVICE_INFO_USE_NATIVE_ASSERT,
	UR_DEVICE_INFO_EXTENSIONS>, //
	URCallOnceCache<UR_DEVICE_INFO_NAME,
	// USM:
	UR_DEVICE_INFO_USM_DEVICE_SUPPORT,
	UR_DEVICE_INFO_USM_HOST_SUPPORT,
	UR_DEVICE_INFO_USM_SINGLE_SHARED_SUPPORT,
	UR_DEVICE_INFO_USM_CROSS_SHARED_SUPPORT,
	UR_DEVICE_INFO_USM_SYSTEM_SHARED_SUPPORT,
	//
	UR_DEVICE_INFO_ATOMIC_64>, //
	EagerCache<InfoInitializer>, //
	CallOnceCache<InfoInitializer,
	ext::oneapi::experimental::info::device::architecture>, //
	AspectCache<EagerCache, aspect::fp16, aspect::fp64,
	aspect::int64_base_atomics, aspect::int64_extended_atomics,
	aspect::ext_oneapi_atomic16>,
	AspectCache<
	CallOnceCache,
	// Slow, >100ns (for baseline cached ~30..40ns):
	aspect::ext_intel_pci_address, aspect::ext_intel_gpu_eu_count,
	aspect::ext_intel_free_memory, aspect::ext_intel_fan_speed,
	aspect::ext_intel_power_limits,
	// medium-slow, 60-90ns (for baseline cached ~30..40ns):
	aspect::ext_intel_gpu_eu_simd_width, aspect::ext_intel_gpu_slices,
	aspect::ext_intel_gpu_subslices_per_slice,
	aspect::ext_intel_gpu_eu_count_per_subslice,
	aspect::ext_intel_device_info_uuid,
	aspect::ext_intel_gpu_hw_threads_per_eu,
	aspect::ext_intel_memory_clock_rate,
	aspect::ext_intel_memory_bus_width,
	aspect::ext_oneapi_bindless_images,
	aspect::ext_oneapi_bindless_images_1d_usm,
	aspect::ext_oneapi_bindless_images_2d_usm,
	aspect::ext_oneapi_is_composite, aspect::ext_oneapi_is_component>>
	MCache;

[UR] Stop querying adapter fp16/fp64 support via extension. #15811

Are you sure you want to change the base?

[UR] Stop querying adapter fp16/fp64 support via extension. #15811

Uh oh!

Conversation

aarongreig commented Oct 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aelovikov-intel Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

aelovikov-intel Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

aarongreig commented Oct 22, 2024 •

edited

Loading