Skip to content

Commit 38b2ac3

Browse files
committed
Update documentation
1 parent 014c9c7 commit 38b2ac3

File tree

1 file changed

+10
-1
lines changed

1 file changed

+10
-1
lines changed

include/gemmi.hpp

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -111,7 +111,7 @@ struct MatrixSplit {
111111
}
112112

113113
/* Split the matrix using round-to-nearest. This is an implementation of
114-
* Algorithm 8 in
114+
* Algorithm 8 in:
115115
*
116116
* Uchino Y., Ozaki K., Imamura T. Performance enanchcement of the Ozaki
117117
* scheme on integer matrix multiplication unit. arXiv:2409.13313 [cs.DC]. 2024.
@@ -139,6 +139,15 @@ struct MatrixSplit {
139139
}
140140
}
141141

142+
/* Split the matrix using bit masking, which is equivalent to truncation.
143+
* This is an implementation of Algorithm 4 in:
144+
*
145+
* Ootomo H., Ozaki K., Yokota R. DGEMM on integer matrix multiplication
146+
* unit. Int. J. High Performance Comput. App. 2024;38(4):297-313.
147+
* DOI: 10.1177/10943420241239588
148+
*
149+
* Integer products are accumulated in floating-point arithmetic one by one.
150+
*/
142151
void computeSplitsWithBitMasking() {
143152
this->splitType = splittingStrategy::bitMasking;
144153
// Compute splits one row/column at a time.

0 commit comments

Comments
 (0)