DPNP_GettingStarted

AvijitBag07 · AvijitBag07 · commit 52eb727e1771 · 2024-10-01T07:11:23.000Z
Signed-off-by: AvijitBag07 &lt;AvijitBag07@intel.com&gt;
diff --git a/DirectProgramming/Python/DPNP_GettingStarted/License.txt b/DirectProgramming/Python/DPNP_GettingStarted/License.txt
@@ -0,0 +1,7 @@
+Copyright Intel Corporation
+
+Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
diff --git a/DirectProgramming/Python/DPNP_GettingStarted/README.md b/DirectProgramming/Python/DPNP_GettingStarted/README.md
@@ -0,0 +1,84 @@
+# Intel® Python Data Parallel Extension for NumPy Getting Started Sample
+
+The `Intel® Python DPNP Getting Started` sample code shows how to find conjugate gradient using the Intel Python API powered by the [Intel® Python DPNP - Data Parallel Extension for NumPy](https://github.com/IntelPython/dpnp).
+
+| Area                   | Description
+| :---                   | :---
+| Category               | Getting Started
+| What you will learn    | DPNP programming model for Intel GPU
+| Time to complete       | 60 minutes
+>**Note**: This sample is migrated from Cupy Python sample. See the [ConjugateGradient](https://github.com/cupy/cupy/blob/main/examples/cg/cg.py) sample in the cupy-samples GitHub.
+
+
+## Purpose
+The Data Parallel Extension for NumPy* (dpnp package) - a library that implements a subset of NumPy* that can be executed on any data parallel device. The subset is a drop-in replacement of core NumPy* functions and numerical data types. 
+
+The DPNP is used to offload python code to INTEL GPU's. This is very similar to CUPY API [Comparsion_list](https://intelpython.github.io/dpnp/reference/comparison.html#).   
+
+
+## Prerequisites
+
+| Optimized for           | Description
+| :---                    | :---
+| OS                      | Ubuntu* 22.04 (or newer)
+| Hardware                | Intel® Gen9 <br>Intel® Gen11 <br>Intel® Data Center GPU Max 
+| Software                | Intel® Python Data Parallel Extension for NumPy (DPNP)
+> **Note**: [Intel® Python DPNP - Data Parallel Extension for NumPy](https://github.com/IntelPython/dpnp).
+
+## Key Implementation Details
+
+- This get-started sample code is implemented for Intel GPUs using Python language. The example assumes the user has the latest DPNP installed in the environment, similar to what is delivered with the installation of the [Intel® Distribution for Python*](https://www.intel.com/content/www/us/en/developer/tools/oneapi/distribution-python-download.html).
+  
+## Environment Setup
+
+You will need to download and install the following toolkits, tools, and components to use the sample.
+
+**1. Intel Python**
+
+
+Required Intel Python package: DPNP (Select Intel® Distribution for Python*: Offline on [Get Intel® Distribution for Python*](https://www.intel.com/content/www/us/en/developer/tools/oneapi/distribution-python-download.html) to install)
+
+
+**2. (Offline Installer) Update the Intel Python base environment**
+
+Load python env:
+```
+source $PYTHON_INSTALL/env/vars.sh
+```
+ 
+**3. (Offline Installer) Check the DPNP version**
+
+```
+python -c "import dpnp; print(dpnp.__version__)"
+``` 
+Note: if the version is 0.15.0 or more continue, otherwise need to upgrade the dpnp version 
+
+**4. Clone the GitHub repository**
+<!-- for oneapi-samples: git clone https://github.com/oneapi-src/oneAPI-samples.git
+cd oneAPI-samples/DirectProgramming/<samples-folder>/<individual-sample-folder> -->
+<!-- for migrated samples - provide git clone command for individual repo and cd to sample dir --> 
+``` 
+git clone https://github.com/oneapi-src/oneAPI-samples.git
+cd oneAPI-samples/DirectProgramming/Python/DPNP_GettingStarted
+```
+
+
+## Run the Sample
+>**Note**: Before running the sample, make sure Intel Python is installed.
+
+1. Change to the sample directory.
+2. Build the program.
+   ```
+   $ python cg.py
+   ```
+
+## License
+
+Code samples are licensed under the MIT license. See
+[License.txt](https://github.com/oneapi-src/oneAPI-samples/blob/master/License.txt)
+for details.
+
+Third party program Licenses can be found here:
+[third-party-programs.txt](https://github.com/oneapi-src/oneAPI-samples/blob/master/third-party-programs.txt)
+
+*Other names and brands may be claimed as the property of others. [Trademarks](https://www.intel.com/content/www/us/en/legal/trademarks.html)
diff --git a/DirectProgramming/Python/DPNP_GettingStarted/cg.py b/DirectProgramming/Python/DPNP_GettingStarted/cg.py
@@ -0,0 +1,95 @@
+import argparse
+import time
+import dpctl
+import dpctl.tensor as dpt
+import dpnp
+import numpy as np
+
+
+def fit_cpu(A, b, tol, max_iter):
+    # Note that this function works even tensors 'A' and 'b' are NumPy or dpnp
+    # arrays.
+    x = np.zeros_like(b, dtype=np.float64)
+    r0 = b - np.dot(A, x)
+    p = r0
+    for i in range(max_iter):
+        a = np.inner(r0, r0) / np.inner(p, np.dot( A, p))
+        x += (a * p)
+        r1 = r0 - a * np.dot(A, p)
+        if np.linalg.norm(r1) < tol:
+            return x
+        b = np.inner(r1, r1) / np.inner(r0, r0)
+        p = r1 + b * p
+        r0 = r1
+    print('Failed to converge. Increase max-iter or tol.')
+    return x
+
+def fit(A, b, tol, max_iter):
+    # Note that this function works even tensors 'A' and 'b' are NumPy or dpnp
+    # arrays.
+    x = dpnp.zeros_like(b, dtype=dpnp.float64)
+    r0 = b - dpnp.dot(A, x)
+    p = r0
+    for i in range(max_iter):
+        a = dpnp.inner(r0, r0) / dpnp.inner(p, dpnp.dot(A, p))
+        x += a * p
+        r1 = r0 - a * dpnp.dot(A, p)
+        if dpnp.linalg.norm(r1) < tol:
+            return x
+        b = dpnp.inner(r1, r1) / dpnp.inner(r0, r0)
+        p = r1 + b * p
+        r0 = r1
+    print('Failed to converge. Increase max-iter or tol.')
+    return x
+
+
+def run(gpu_id, tol, max_iter):
+    """CuPy Conjugate gradient example
+
+    Solve simultaneous linear equations, Ax = b.
+    'A' and 'x' are created randomly and 'b' is computed by 'Ax' at first.
+    Then, 'x' is computed from 'A' and 'b' in two ways, namely with CPU and
+    GPU. To evaluate the accuracy of computation, the Euclidean distances
+    between the answer 'x' and the reconstructed 'x' are computed.
+
+    """
+    for repeat in range(3):
+        print('Trial: %d' % repeat)
+        # Create the large symmetric matrix 'A'.
+        N = 10000
+        A = np.random.random((N,N))
+        A = (A @ A.T).astype(np.float64)
+        x_ans = np.random.random((N)).astype(np.float64)
+        b = np.dot(A, x_ans)
+
+        print('Running CPU...')
+        start = time.time()
+        x_cpu = fit_cpu(A, b, tol, max_iter)
+        print(np.linalg.norm(x_cpu - x_ans))
+        end = time.time()
+        print('%s:  %f sec' % ("CPU", end - start))
+
+        a_dpt = dpnp.asarray(A, dtype=dpnp.float64)
+        b_dpt = dpnp.asarray(b, dtype=dpnp.float64)
+
+        print('Running GPU...')
+        start = time.time()
+        x_gpu = fit(a_dpt, b_dpt, tol, max_iter)
+
+        print(np.linalg.norm(dpnp.asnumpy(x_gpu) - x_ans))
+        end = time.time()
+        print('%s:  %f sec' % ("GPU", end - start))
+
+        print()
+
+
+if __name__ == '__main__':
+    parser = argparse.ArgumentParser()
+    parser.add_argument('--gpu-id', '-g', default=0, type=int,
+                        help='ID of GPU.')
+    parser.add_argument('--tol', '-t', default=0.1, type=float,
+                        help='tolerance to stop iteration')
+    parser.add_argument('--max-iter', '-m', default=5000, type=int,
+                        help='number of iterations')
+    args = parser.parse_args()
+    run(args.gpu_id, args.tol, args.max_iter)
diff --git a/DirectProgramming/Python/DPNP_GettingStarted/sample.json b/DirectProgramming/Python/DPNP_GettingStarted/sample.json
@@ -0,0 +1,25 @@
+{
+  "guid": "B8E1196A-83B1-4972-8D87-E15F14CAFC82",
+  "name": "Intel® Python Data Parallel Extension for NumPy",
+  "categories": ["Toolkit/oneAPI DirectProgramming/Python/DPNP_GettingStarted"],
+  "description": "This sample code shows how to find conjugate gradient using the Intel Python API powered by the dpnp",
+  "builder": ["cli"],
+  "languages": [{"python":{}}],
+  "dependencies": ["intelpython"],
+  "os":["linux"],
+  "targetDevice": ["GPU","CPU"],
+  "ciTests": {
+  	"linux": [
+    {
+  		"env": [
+        "source /intel/oneapi/intelpython/env/activate"
+      ],
+  		"id": "idp_intel_dpnp_cg_py",
+  		"steps": [
+         "python cg.py" 
+  		 ]
+  	}
+    ]
+},
+"expertise": "Code Optimization"
+}
diff --git a/DirectProgramming/Python/DPNP_GettingStarted/third-party-programs.txt b/DirectProgramming/Python/DPNP_GettingStarted/third-party-programs.txt