You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<h2>Installation via Intel(R) Distribution for Python<aclass="headerlink" href="#installation-via-intel-r-distribution-for-python" title="Permalink to this heading">¶</a></h2>
869
869
<p><aclass="reference external" href="https://www.intel.com/content/www/us/en/developer/tools/oneapi/distribution-for-python.html">Intel(R) Distribution for Python*</a> is distributed as a conda-based installer
870
-
and includes <aclass="reference internal" href="../api_reference/dpctl/index.html#module-dpctl" title="dpctl"><codeclass="xref py py-mod docutils literal notranslate"><spanclass="pre">dpctl</span></code></a> along with its dependencies and sister projects <aclass="reference external" href="https://intelpython.github.io/dpnp/overview.html#module-dpnp" title="(in Data Parallel Extension for NumPy v0.19.0dev0+17.g03e5ba0c729)"><codeclass="xref py py-mod docutils literal notranslate"><spanclass="pre">dpnp</span></code></a>
870
+
and includes <aclass="reference internal" href="../api_reference/dpctl/index.html#module-dpctl" title="dpctl"><codeclass="xref py py-mod docutils literal notranslate"><spanclass="pre">dpctl</span></code></a> along with its dependencies and sister projects <aclass="reference external" href="https://intelpython.github.io/dpnp/overview.html#module-dpnp" title="(in Data Parallel Extension for NumPy v0.19.0dev0+21.gca9a3f039a)"><codeclass="xref py py-mod docutils literal notranslate"><spanclass="pre">dpnp</span></code></a>
<p>Once the installed environment is activated, <codeclass="docutils literal notranslate"><spanclass="pre">dpctl</span></code> should be ready to use.</p>
873
873
</section>
@@ -946,6 +946,15 @@ <h4>CUDA build<a class="headerlink" href="#cuda-build" title="Permalink to this
<p>To use the default architecture (<codeclass="docutils literal notranslate"><spanclass="pre">sm_50</span></code>) with CMake options,
954
+
set <codeclass="docutils literal notranslate"><spanclass="pre">DPCTL_TARGET_CUDA</span></code> to a value such as <codeclass="docutils literal notranslate"><spanclass="pre">ON</span></code>, <codeclass="docutils literal notranslate"><spanclass="pre">TRUE</span></code>, <codeclass="docutils literal notranslate"><spanclass="pre">YES</span></code>, <codeclass="docutils literal notranslate"><spanclass="pre">Y</span></code>, or <codeclass="docutils literal notranslate"><spanclass="pre">1</span></code>:</p>
<p>Note that kernels are built for the default architecture (<codeclass="docutils literal notranslate"><spanclass="pre">sm_50</span></code>), allowing them to work on a
950
959
wider range of architectures, but limiting the usage of more recent CUDA features.</p>
951
960
<p>For reference, compute architecture strings like <codeclass="docutils literal notranslate"><spanclass="pre">sm_80</span></code> correspond to specific
@@ -966,6 +975,10 @@ <h4>AMD build<a class="headerlink" href="#amd-build" title="Permalink to this he
966
975
<p>This will print names like <codeclass="docutils literal notranslate"><spanclass="pre">gfx90a</span></code>, <codeclass="docutils literal notranslate"><spanclass="pre">gfx1030</span></code>, etc.
967
976
You can then use one of them as the argument to <codeclass="docutils literal notranslate"><spanclass="pre">--target-hip</span></code>.</p>
968
977
<p>For example:</p>
978
+
<p>Alternatively, you can use the <codeclass="docutils literal notranslate"><spanclass="pre">DPCTL_TARGET_HIP</span></code> CMake option:</p>
0 commit comments