Skip to content

Commit fa37755

Browse files
author
github-actions[doc-deploy-bot]
committed
Docs for pull request 2096
1 parent 7bcd59f commit fa37755

File tree

5 files changed

+13
-11
lines changed

5 files changed

+13
-11
lines changed

pulls/2096/_modules/dpctl/tensor/_ctors.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -904,8 +904,6 @@ <h1>Source code for dpctl.tensor._ctors</h1><div class="highlight"><pre>
904904
<span class="k">raise</span> <span class="ne">TypeError</span><span class="p">(</span>
905905
<span class="sa">f</span><span class="s2">&quot;Expected dpctl.tensor.usm_ndarray, got </span><span class="si">{</span><span class="nb">type</span><span class="p">(</span><span class="n">usm_ndary</span><span class="p">)</span><span class="si">}</span><span class="s2">&quot;</span>
906906
<span class="p">)</span>
907-
<span class="k">if</span> <span class="n">dtype</span> <span class="ow">is</span> <span class="kc">None</span><span class="p">:</span>
908-
<span class="n">dtype</span> <span class="o">=</span> <span class="n">usm_ndary</span><span class="o">.</span><span class="n">dtype</span>
909907
<span class="k">if</span> <span class="n">usm_type</span> <span class="ow">is</span> <span class="kc">None</span><span class="p">:</span>
910908
<span class="n">usm_type</span> <span class="o">=</span> <span class="n">usm_ndary</span><span class="o">.</span><span class="n">usm_type</span>
911909
<span class="k">if</span> <span class="n">sycl_queue</span> <span class="ow">is</span> <span class="ow">not</span> <span class="kc">None</span><span class="p">:</span>
@@ -915,6 +913,8 @@ <h1>Source code for dpctl.tensor._ctors</h1><div class="highlight"><pre>
915913
<span class="n">copy_q</span> <span class="o">=</span> <span class="n">normalize_queue_device</span><span class="p">(</span><span class="n">sycl_queue</span><span class="o">=</span><span class="n">sycl_queue</span><span class="p">,</span> <span class="n">device</span><span class="o">=</span><span class="n">exec_q</span><span class="p">)</span>
916914
<span class="k">else</span><span class="p">:</span>
917915
<span class="n">copy_q</span> <span class="o">=</span> <span class="n">usm_ndary</span><span class="o">.</span><span class="n">sycl_queue</span>
916+
<span class="k">if</span> <span class="n">dtype</span> <span class="ow">is</span> <span class="kc">None</span><span class="p">:</span>
917+
<span class="n">dtype</span> <span class="o">=</span> <span class="n">_map_to_device_dtype</span><span class="p">(</span><span class="n">usm_ndary</span><span class="o">.</span><span class="n">dtype</span><span class="p">,</span> <span class="n">copy_q</span><span class="p">)</span>
918918
<span class="c1"># Conditions for zero copy:</span>
919919
<span class="n">can_zero_copy</span> <span class="o">=</span> <span class="n">copy</span> <span class="ow">is</span> <span class="ow">not</span> <span class="kc">True</span>
920920
<span class="c1"># dtype is unchanged</span>

pulls/2096/_sources/beginners_guides/installation.rst.txt

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -176,10 +176,11 @@ set ``DPCTL_TARGET_CUDA`` to a value such as ``ON``, ``TRUE``, ``YES``, ``Y``, o
176176
Note that kernels are built for ``sm_50`` by default, allowing them to work on a wider
177177
range of architectures, but limiting the usage of more recent CUDA features.
178178

179-
For reference, compute architecture strings like ``sm_80`` are based on
180-
CUDA Compute Capability. A complete mapping between NVIDIA GPU models and their
181-
respective ``sm_XX`` values can be found in the official
182-
`CUDA GPU Compute Capability <https://developer.nvidia.com/cuda-gpus>`_.
179+
For reference, compute architecture strings like ``sm_80`` correspond to specific
180+
CUDA Compute Capabilities (e.g., Compute Capability 8.0 corresponds to ``sm_80``).
181+
A complete mapping between NVIDIA GPU models and their respective
182+
Compute Capabilities can be found in the official
183+
`CUDA GPU Compute Capability <https://developer.nvidia.com/cuda-gpus>`_ documentation.
183184

184185
A full list of available SYCL alias targets is available in the
185186
`DPC++ Compiler User Manual <https://intel.github.io/llvm/UsersManual.html>`_.

pulls/2096/beginners_guides/installation.html

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -944,10 +944,11 @@ <h3>Building for custom SYCL targets<a class="headerlink" href="#building-for-cu
944944
</div>
945945
<p>Note that kernels are built for <code class="docutils literal notranslate"><span class="pre">sm_50</span></code> by default, allowing them to work on a wider
946946
range of architectures, but limiting the usage of more recent CUDA features.</p>
947-
<p>For reference, compute architecture strings like <code class="docutils literal notranslate"><span class="pre">sm_80</span></code> are based on
948-
CUDA Compute Capability. A complete mapping between NVIDIA GPU models and their
949-
respective <code class="docutils literal notranslate"><span class="pre">sm_XX</span></code> values can be found in the official
950-
<a class="reference external" href="https://developer.nvidia.com/cuda-gpus">CUDA GPU Compute Capability</a>.</p>
947+
<p>For reference, compute architecture strings like <code class="docutils literal notranslate"><span class="pre">sm_80</span></code> correspond to specific
948+
CUDA Compute Capabilities (e.g., Compute Capability 8.0 corresponds to <code class="docutils literal notranslate"><span class="pre">sm_80</span></code>).
949+
A complete mapping between NVIDIA GPU models and their respective
950+
Compute Capabilities can be found in the official
951+
<a class="reference external" href="https://developer.nvidia.com/cuda-gpus">CUDA GPU Compute Capability</a> documentation.</p>
951952
<p>A full list of available SYCL alias targets is available in the
952953
<a class="reference external" href="https://intel.github.io/llvm/UsersManual.html">DPC++ Compiler User Manual</a>.</p>
953954
<p>To build for AMD devices, use:</p>

pulls/2096/objects.inv

0 Bytes
Binary file not shown.

pulls/2096/searchindex.js

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)