foss-for-synopsys-dwc-arc-processors
diff --git a/‎doc/documents/mli_api_desc/arithmetic_details.rst‎ renamed to ‎doc/documents/MLI_FP_data_format/MLI_FP_data_format.rst‎
Lines changed: 27 additions & 23 deletions b/‎doc/documents/mli_api_desc/arithmetic_details.rst‎ renamed to ‎doc/documents/MLI_FP_data_format/MLI_FP_data_format.rst‎
Lines changed: 27 additions & 23 deletions
diff --git a/‎doc/documents/MLI_helpers/MLI_helpers.rst‎
Lines changed: 16 additions & 0 deletions b/‎doc/documents/MLI_helpers/MLI_helpers.rst‎
Lines changed: 16 additions & 0 deletions
diff --git a/‎doc/documents/MLI_helpers/convert_tensor.rst‎
Lines changed: 72 additions & 0 deletions b/‎doc/documents/MLI_helpers/convert_tensor.rst‎
Lines changed: 72 additions & 0 deletions
diff --git a/‎doc/documents/MLI_helpers/count_no_elements.rst‎
Lines changed: 51 additions & 0 deletions b/‎doc/documents/MLI_helpers/count_no_elements.rst‎
Lines changed: 51 additions & 0 deletions
diff --git a/‎doc/documents/MLI_helpers/get_basic_elem_size.rst‎
Lines changed: 36 additions & 0 deletions b/‎doc/documents/MLI_helpers/get_basic_elem_size.rst‎
Lines changed: 36 additions & 0 deletions
@@ -1,16 +1,13 @@
-Arithmetic Details
-------------------
+.. _mli_fpd_fmt:   
+   
+MLI Fixed-Point Data Format
+---------------------------
 
    The MLI Library is targeting ARCv2DSP based platform and implies
    efficient usage of its DSP Features. For this reason, there is some
    specificity of basic data types and arithmetical operations using it
    in comparison with operations using float-point values.
 
-.. _mli_fpd_fmt:   
-   
-MLI Fixed-Point Data Format
-~~~~~~~~~~~~~~~~~~~~~~~~~~~
-
    Default MLI Fixed-point data format (represented by tensors of
    ``MLI_EL_FX_8`` and ``MLI_EL_FX_16`` element types) reflects general signed
    values interpreted by typical Q notation [1,2]. The following
@@ -23,14 +20,23 @@ MLI Fixed-Point Data Format
       non-sign bits are assumed to hold an integer part.
 
 .. note::
-   For more information regarding Q notation, see entries [1] & [2] of :ref:`refs`.
-    
+   For more information regarding Q notation, see 
+  
+   - `Q Notation`_ 
+
+   - `Q Notation tips and tricks`_
+
+.. _Q notation: https://en.wikipedia.org/wiki/Q_(number_format)
+   
+.. _Q Notation tips and tricks: http://x86asm.net/articles/fixed-point-arithmetic-and-tricks/
+
+..
 
 Data storage
-^^^^^^^^^^^^
+~~~~~~~~~~~~
 
    The container of the tensor’s values is always signed two’s
-   complemented integer numbers: 8 bit for ``MLI_EL_FX_8`` (also referred to as ``fx8``) and
+   complemented integer numbers: 8 bit for ``MLI_EL_FX_8`` (also referred to as ``fx8``) and   
    16 bit for ``MLI_EL_FX_16`` (also referred to as ``fx16``). ``mli_tensor`` keeps only number
    of fractional bits (see ``fx.frac_bits`` in :ref:`mli_el_prm_u`),
    which corresponds to the second designation above.
@@ -82,7 +88,7 @@ Data storage
 .. _op_fx_val:
 
 Operations on FX values
-^^^^^^^^^^^^^^^^^^^^^^^
+~~~~~~~~~~~~~~~~~~~~~~~
 
    Arithmetical operations are actually performed on signed integers
    according to the rules for two’s complemented integer numbers. Q
@@ -92,7 +98,7 @@ Operations on FX values
 .. _data_fmt_conv:
 
 Data Format Conversion
-''''''''''''''''''''''
+^^^^^^^^^^^^^^^^^^^^^^
 
    Conversion between real values and fx value might be performed
    according to the following formula:
@@ -163,15 +169,15 @@ Where:
      ``Round(0x24>>(4–1)) = Round(0x24>>3) = (0x24 + (1<<(3-1))) >> 3 = 0x28>>3 = 0x5 in Q.1(2.5)``
 
 Addition and Subtraction
-''''''''''''''''''''''''
+^^^^^^^^^^^^^^^^^^^^^^^^
 
    In fixed point arithmetic, addition and subtraction are performed as
    they are for general integer values but only when the input values
    are in the same format. Otherwise, ensure that you perform conversion
    to bring the input values into the same format before operation.
 
 Multiplication
-''''''''''''''
+^^^^^^^^^^^^^^
 
    For multiplication input operands do not have to be of the same
    format. The width of the integer part of the result is the sum of 
@@ -203,7 +209,7 @@ Multiplication
    result.
 
 Division
-''''''''
+^^^^^^^^
 
    For division, input operands also do not have to be of the same
    format. The result has a format containing the difference of bits in
@@ -214,8 +220,6 @@ Division
   -  For a dividend ``x`` in Q16.16 format and a divisor y in Q7.10 format,
      the format of the result ``x/y`` is Q(16-7).(16-10), or Q9.6 format.
 
-\
-
   -  For a dividend ``x`` in Q7.8 format and a divisor y in Q3.12 format, the
      format of the result ``x/y`` is in Q4.-4 format.
 
@@ -229,7 +233,7 @@ Division
    significant bits) is required.
 
 Accumulation
-''''''''''''
+^^^^^^^^^^^^
 
    Even single addition might result in overflow if all bits of operands
    are used and both of them hold the maximum (or minimum) values. It
@@ -258,14 +262,14 @@ Accumulation
    operation.
 
 ARCv2DSP Implementation Specifics
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 
    The MLI Library is designed keeping performance in mind as one of the
    main goals. This section deals with manual model adaptation of MLI
    library.
 
 Bias for MAC-based Kernels
-''''''''''''''''''''''''''
+^^^^^^^^^^^^^^^^^^^^^^^^^^
 
    MAC based kernels (convolutions, fully connected, recurrent, etc)
    typically use several input tensors including input feature map,
@@ -285,7 +289,7 @@ Bias for MAC-based Kernels
    must be less or equal to 10 (since 7+3=10) for correct bias.
 
 Configurability of Output Tensors Fractional Bits 
-''''''''''''''''''''''''''''''''''''''''''''''''''
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 
    Not all primitives provide possibility to configure output tensor
    format – some of them derive it based on inputs or used algorithm, 
@@ -311,7 +315,7 @@ Configurability of Output Tensors Fractional Bits
    Output configurability is specified in description for each primitive.
 
 Quantization: Influence of Accumulator Bit Depth
-''''''''''''''''''''''''''''''''''''''''''''''''
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 
    The MLI Library applies neither saturation nor post-multiplication
    shift with rounding in accumulation. Saturation is performed only for
 
@@ -0,0 +1,16 @@
+.. _mli_helpers:
+
+MLI helpers
+===========
+
+   This is a set of utility functions for getting information from data
+   structures and performing various operations on it.
+   
+.. toctree::
+   :maxdepth: 2
+   :caption: Contents:
+   
+   get_basic_elem_size.rst
+   count_no_elements.rst
+   point_to_sub_tensor.rst
+   convert_tensor.rst   
@@ -0,0 +1,72 @@
+..  _conv_tensor:
+
+Convert Tensor
+~~~~~~~~~~~~~~
+
+   This function copies elements from input tensor to output with data
+   conversion according to the output tensor type parameters.
+
+   For example, the function can:
+
+   -  convert data according to new element type: ``fx16`` to ``fx8`` and backward
+
+   -  change data according to new data parameter: increase/decrease the
+      number of fractional bits while keeping the same element type for
+      FX data
+
+..
+
+   Conversion is performed using
+
+   -  rounding when the number of significant bits increases.
+
+   -  saturation when the number of significant bits decreases.
+
+..
+
+   This operation does not change tensor shape. It copies it from input
+   to output.
+
+   Kernel can perform in-place computation, but only for conversions
+   without increasing data size, so that that it does not lead to
+   undefined behavior. Therefore, output and input might point to exactly the
+   same memory (but without shift) except ``fx8`` to ``fx16`` conversion.
+   In-place computation might affect performance for some platforms.
+
+.. _api-18:
+
+API
+^^^
+
++-----------------------+-----------------------+----------------------------------------------+
+| **Prototype**         |.. code:: c                                                           |
+|                       |                                                                      |
+|                       | mli_status mli_hlp_convert_tensor(mli_tensor *in, mli_tensor *out);  |
+|                       |                                                                      |
++-----------------------+-----------------------+----------------------------------------------+
+| **Parameters**        | ``in``                | [IN] Pointer to input                        |
+|                       |                       | tensor                                       |
++-----------------------+-----------------------+----------------------------------------------+
+|                       | ``start_dim``         | [OUT] Pointer to                             |
+|                       |                       | output tensor                                |
++-----------------------+-----------------------+----------------------------------------------+
+| **Returns**           | ``status code``       |                                              |
++-----------------------+-----------------------+----------------------------------------------+
+
+.. _conditions-for-applying-the-function-7:
+
+Conditions for Applying the Function
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+
+   -  Input must be a valid tensor (see :ref:`mli_tns_struct`).
+
+   -  Before processing the output tensor must contain a valid pointer to a
+      buffer with sufficient capacity enough for storing the result
+      (that is, the total amount of elements in input tensor).
+
+   -  The output tensor also must contain valid element type and its
+      parameter (``el_params.fx.frac_bits``)
+
+   -  Before processing, the output tensor does not have to contain valid
+      shape and rank - they are copied from input tensor.
+
@@ -0,0 +1,51 @@
+.. _count_no_elem:
+
+Count Number of Elements 
+~~~~~~~~~~~~~~~~~~~~~~~~~
+
+   Function counts the number of elements in a tensor starting from the
+   provided dimension number (dimension numbering starts from 0):
+
+.. math:: num\_ of\_ elements = shape\lbrack start\_ dim\rbrack\ *shape\lbrack start\_ dim + 1\rbrack*\ldots*shape\lbrack last\_ dim\rbrack
+
+..
+
+   Where:
+
+   ``num_of_elements`` - Number of accounting elements
+
+   ``shape`` - Shape of tensor
+
+   ``start_dim`` – Start dimension for counting
+
+   ``last_dim`` - Last dimension of tensor (tensor rank-1)
+
+   Function calculates total number of elements in case
+   ``start_dim = 0``. Function returns 0 if conditions listed
+   in the following API are violated.
+
+.. _api-16:
+
+API
+^^^
+
++-----------------------+-----------------------+-------------------------------------------------+
+| **Prototype**         |.. code:: c                                                              |
+|                       |                                                                         |      
+|                       | uint32_t mli_hlp_count_elem_num(mli_tensor *in, uint32_t start_dim)     |
++-----------------------+-----------------------+-------------------------------------------------+
+| **Parameters**        | ``in``                | [IN] Pointer to input  tensor                   |
++-----------------------+-----------------------+-------------------------------------------------+
+|                       | ``start_dim``         | [IN] Start dimension for counting               |
++-----------------------+-----------------------+-------------------------------------------------+
+| **Returns**           | ``Number of elements``|                                                 |
++-----------------------+-----------------------+-------------------------------------------------+
+
+.. _conditions-for-applying-the-function-5:
+
+Conditions for Applying the Function
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+
+-  Input must contain valid rank (less then ``MLI_MAX_RANK``).
+
+-  ``start_dim`` must be less than or equal to input rank
@@ -0,0 +1,36 @@
+.. _get_elm_size:
+
+Get Basic Element Size
+~~~~~~~~~~~~~~~~~~~~~~
+
+   This function returns size of tensor basic element in bytes. It
+   returns 0 if conditions listed the following API are violated.
+
+.. _api-15:
+
+API
+^^^
+
++-----------------------+-----------------------+-----------------------+
+|                       |.. code:: c                                    |
+|                       |                                               |
+| **Prototype**         | uint32_t mli_hlp_count_elem_num               |
+|                       | (mli_tensor *in)                              |
+|                       |                                               |
++-----------------------+-----------------------+-----------------------+
+|                       |                       |                       |
+| **Parameters**        | ``in``                | [IN] Pointer to input |
+|                       |                       | tensor                |
++-----------------------+-----------------------+-----------------------+
+| **Returns**           | Size of basic element                         |
+|                       | in bytes                                      |
++-----------------------+-----------------------+-----------------------+
+
+.. _conditions-for-applying-the-function-4:
+
+Conditions for Applying the Function
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+
+   The function must point to the tensor of supported element type (see
+   :ref:`mli_elm_enum`).
+