You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -253,23 +253,22 @@ Attributes `L1_hint`, `L2_hint`, and `L3_hint` can be applied to prefetch.
253
253
XeGPU.atomic_rmw reuses the arith dialect attribute, ::mlir::arith::AtomicRMWKindAttr.
254
254
In case that certain Xe GPU target does not support atomic operation for a certain data type, the user needs to convert the matrix to the supported datatype to perform the atomic operation.
255
255
256
-
alloc_nbarrier allocates named barriers. Named barrier is workgroup level resource, shared by all subgroups.
256
+
alloc_nbarrier allocates a set of named barriers with the specified number. Named barrier is workgroup level resource, shared by all subgroups.
257
257
```mlir
258
-
XeGPU.alloc_nbarrier %nbarrier_count: i8
258
+
XeGPU.alloc_nbarrier %total_nbarrier_num: i8
259
259
```
260
-
`create_nbarrier` assigns a role for a specific named barrier to be producer and/or consumer. The returned nbarrier object holds a description of the specified barrier, which encodes all the barrier information. It also binds the current thread with the named barrier by holding the returned nbarrier object. Multiple threads may bind to the same nbarrier so that they can sync with each other.
260
+
`init_nbarrier` returns one named barrier with the specified barrier ID to the current thread. Multiple threads may bind to the same named barrier, and the input specifies the number of total participant threads. The returned nbarrier object holds a description of the specified barrier, which encodes all the barrier information.
0 commit comments