You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<p>The tuple describes the non-partitioned device where the array has been
808
808
allocated, or the non-partitioned parent device of the allocation
809
809
device.</p>
810
-
<p>See <codeclass="docutils literal notranslate"><spanclass="pre">DLDeviceType</span></code> for a list of devices supported by the DLPack
811
-
protocol.</p>
810
+
<p>See <aclass="reference internal" href="../../tensor.constants.html#dpctl.tensor.DLDeviceType" title="dpctl.tensor.DLDeviceType"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">dpctl.tensor.DLDeviceType</span></code></a> for a list of devices supported
<ddclass="field-odd"><p><strong>DLPackCreationError</strong> – when the <codeclass="docutils literal notranslate"><spanclass="pre">device_id</span></code> could not be determined.</p>
<h2>Installation via Intel(R) Distribution for Python<aclass="headerlink" href="#installation-via-intel-r-distribution-for-python" title="Permalink to this heading">¶</a></h2>
869
869
<p><aclass="reference external" href="https://www.intel.com/content/www/us/en/developer/tools/oneapi/distribution-for-python.html">Intel(R) Distribution for Python*</a> is distributed as a conda-based installer
870
-
and includes <aclass="reference internal" href="../api_reference/dpctl/index.html#module-dpctl" title="dpctl"><codeclass="xref py py-mod docutils literal notranslate"><spanclass="pre">dpctl</span></code></a> along with its dependencies and sister projects <aclass="reference external" href="https://intelpython.github.io/dpnp/overview.html#module-dpnp" title="(in Data Parallel Extension for NumPy v0.19.0dev1+15.g876e9403a7e)"><codeclass="xref py py-mod docutils literal notranslate"><spanclass="pre">dpnp</span></code></a>
870
+
and includes <aclass="reference internal" href="../api_reference/dpctl/index.html#module-dpctl" title="dpctl"><codeclass="xref py py-mod docutils literal notranslate"><spanclass="pre">dpctl</span></code></a> along with its dependencies and sister projects <aclass="reference external" href="https://intelpython.github.io/dpnp/overview.html#module-dpnp" title="(in Data Parallel Extension for NumPy v0.19.0dev2+1.g30918e48741)"><codeclass="xref py py-mod docutils literal notranslate"><spanclass="pre">dpnp</span></code></a>
<p>Once the installed environment is activated, <codeclass="docutils literal notranslate"><spanclass="pre">dpctl</span></code> should be ready to use.</p>
<aclass="reference external" href="https://intel.github.io/llvm/UsersManual.html">DPC++ Compiler User Manual</a>.</p>
938
938
<sectionid="cuda-build">
939
939
<h4>CUDA build<aclass="headerlink" href="#cuda-build" title="Permalink to this heading">¶</a></h4>
940
-
<p><codeclass="docutils literal notranslate"><spanclass="pre">dpctl</span></code> can be built for CUDA devices using the <codeclass="docutils literal notranslate"><spanclass="pre">DPCTL_TARGET_CUDA</span></code> CMake option,
941
-
which accepts a specific compute architecture string:</p>
940
+
<p><codeclass="docutils literal notranslate"><spanclass="pre">dpctl</span></code> can be built for CUDA devices using the <codeclass="docutils literal notranslate"><spanclass="pre">--target-cuda</span></code> argument.</p>
941
+
<p>To target a specific architecture (e.g., <codeclass="docutils literal notranslate"><spanclass="pre">sm_80</span></code>):</p>
<p>To use the default architecture (<codeclass="docutils literal notranslate"><spanclass="pre">sm_50</span></code>),
953
+
<p>To use the default architecture (<codeclass="docutils literal notranslate"><spanclass="pre">sm_50</span></code>) with CMake options,
946
954
set <codeclass="docutils literal notranslate"><spanclass="pre">DPCTL_TARGET_CUDA</span></code> to a value such as <codeclass="docutils literal notranslate"><spanclass="pre">ON</span></code>, <codeclass="docutils literal notranslate"><spanclass="pre">TRUE</span></code>, <codeclass="docutils literal notranslate"><spanclass="pre">YES</span></code>, <codeclass="docutils literal notranslate"><spanclass="pre">Y</span></code>, or <codeclass="docutils literal notranslate"><spanclass="pre">1</span></code>:</p>
@@ -957,26 +965,28 @@ <h4>CUDA build<a class="headerlink" href="#cuda-build" title="Permalink to this
957
965
</section>
958
966
<sectionid="amd-build">
959
967
<h4>AMD build<aclass="headerlink" href="#amd-build" title="Permalink to this heading">¶</a></h4>
960
-
<p><codeclass="docutils literal notranslate"><spanclass="pre">dpctl</span></code> can be built for AMD devices using the <codeclass="docutils literal notranslate"><spanclass="pre">DPCTL_TARGET_HIP</span></code> CMake option,
961
-
which requires specifying a compute architecture string:</p>
<p><codeclass="docutils literal notranslate"><spanclass="pre">dpctl</span></code> can be built for AMD devices using the <codeclass="docutils literal notranslate"><spanclass="pre">--target-hip</span></code> argument.</p>
<p>Note that the <cite>oneAPI for AMD GPUs</cite> plugin requires the architecture be specified and only
966
973
one architecture can be specified at a time.</p>
967
974
<p>To determine the architecture code (<codeclass="docutils literal notranslate"><spanclass="pre"><arch></span></code>) for your AMD GPU, run:</p>
968
975
<p>This will print names like <codeclass="docutils literal notranslate"><spanclass="pre">gfx90a</span></code>, <codeclass="docutils literal notranslate"><spanclass="pre">gfx1030</span></code>, etc.
969
-
You can then use one of them as the argument to <codeclass="docutils literal notranslate"><spanclass="pre">-DDPCTL_TARGET_HIP</span></code>.</p>
976
+
You can then use one of them as the argument to <codeclass="docutils literal notranslate"><spanclass="pre">--target-hip</span></code>.</p>
970
977
<p>For example:</p>
978
+
<p>Alternatively, you can use the <codeclass="docutils literal notranslate"><spanclass="pre">DPCTL_TARGET_HIP</span></code> CMake option:</p>
<h4>Multi-target build<aclass="headerlink" href="#multi-target-build" title="Permalink to this heading">¶</a></h4>
974
985
<p>The default <codeclass="docutils literal notranslate"><spanclass="pre">dpctl</span></code> build from the source enables support of Intel devices only.
975
986
Extending the build with a custom SYCL target additionally enables support of CUDA or AMD
976
987
device in <codeclass="docutils literal notranslate"><spanclass="pre">dpctl</span></code>. Besides, the support can be also extended to enable both CUDA and AMD
0 commit comments