Skip to content

Commit 6e7b191

Browse files
authored
oneDNN new features for next major version. (#439)
* oneDNN: add fpmath_mode requirements * oneDNN: move quantization model description to int8 section * oneDNN: update the int8 quantization model with asymetric and dynamic quant support * oneDNN: add matmul to primitives supporting the `any` format. * oneDNN: add support of broadcast semantic * oneDNN: split scale and shift flags/parameters in batchnorm * oneDNN: split scale and shift flags/parameters in layernorm * oneDNN: split src/dst descriptor for normalization, eltwise, shuffle * oneDNN: remove logsoftmax and handle it through softmax * oneDNN: add new eltwise functions and some parametrizations * oneDNN: add prelu primitive * oneDNN: add reduction primitive * oneDNN: new quantization mecanism - split quantization parameter as per tensor properties - now inputs qparams are used to compute compensations, and dequantize after op, and output qparams are applied before write to requantize. - all scales/shift are passed at execution time, removes overhead of transfering host scales to device for implementions. Also allow better reuse of primitives for dynamic quantization. * oneDNN: add binary post-op * oneDNN: add AUGRU and LBR-AUGRU cells to RNN primitive * oneDNN: remove op_desc * oneDNN: fix outdated doc in header * fixup! oneDNN: new quantization mecanism * fixup! oneDNN: update the int8 quantization model with asymetric and dynamic quant support * oneDNN: various warning/rendering fixes - Use relative paths for includes - fix missing replacements - fix missing references - fix itemizations - fix latex rendering * oneDNN: int8: fix notations * oneDNN: modify header to match spec changes * oneDNN: update post-ops support and naming
1 parent 90727ee commit 6e7b191

34 files changed

+3885
-2576
lines changed

source/elements/oneDNN/Doxyfile

Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -267,6 +267,41 @@ TAB_SIZE = 4
267267

268268
ALIASES =
269269

270+
ALIASES += src="\f$\src\f$"
271+
ALIASES += srclayer="\f$\srclayer\f$"
272+
ALIASES += srclayerattention="\f$\srclayerattention\f$"
273+
ALIASES += srciter="\f$\srciter\f$"
274+
ALIASES += srciterc="\f$\srciterc\f$"
275+
ALIASES += weights="\f$\weights\f$"
276+
ALIASES += weightslayer="\f$\weightslayer\f$"
277+
ALIASES += weightsiter="\f$\weightsiter\f$"
278+
ALIASES += weightspeephole="\f$\weightspeephole\f$"
279+
ALIASES += weightsprojection="\f$\weightsprojection\f$"
280+
ALIASES += bias="\f$\bias\f$"
281+
ALIASES += dst="\f$\dst\f$"
282+
ALIASES += dstlayer="\f$\dstlayer\f$"
283+
ALIASES += dstiter="\f$\dstiter\f$"
284+
ALIASES += dstiterc="\f$\dstiterc\f$"
285+
ALIASES += diffsrc="\f$\diffsrc\f$"
286+
ALIASES += diffsrclayer="\f$\diffsrclayer\f$"
287+
ALIASES += diffsrclayerattention="\f$\diffsrclayerattention\f$"
288+
ALIASES += diffsrciter="\f$\diffsrciter\f$"
289+
ALIASES += diffsrciterc="\f$\diffsrciterc\f$"
290+
ALIASES += diffweights="\f$\diffweights\f$"
291+
ALIASES += diffweightslayer="\f$\diffweightslayer\f$"
292+
ALIASES += diffweightsiter="\f$\diffweightsiter\f$"
293+
ALIASES += diffweightspeephole="\f$\diffweightspeephole\f$"
294+
ALIASES += diffweightsprojection="\f$\diffweightsprojection\f$"
295+
ALIASES += diffbias="\f$\diffbias\f$"
296+
ALIASES += diffdst="\f$\diffdst\f$"
297+
ALIASES += diffdstlayer="\f$\diffdstlayer\f$"
298+
ALIASES += diffdstiter="\f$\diffdstiter\f$"
299+
ALIASES += diffdstiterc="\f$\diffdstiterc\f$"
300+
ALIASES += diffgamma="\f$\diffgamma\f$"
301+
ALIASES += diffbeta="\f$\diffbeta\f$"
302+
ALIASES += workspace="\f$\workspace\f$"
303+
304+
270305
# Set the OPTIMIZE_OUTPUT_FOR_C tag to YES if your project consists of C sources
271306
# only. Doxygen will then generate output that is more tailored for C. For
272307
# instance, some of the names that are used will be different. The list of all

0 commit comments

Comments
 (0)