forked from pytorch/pytorch
-
Notifications
You must be signed in to change notification settings - Fork 75
[ROCm][Windows] Fixing undefined symbol linker error after exposing MIOpen symbols #2415
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 83 commits
Commits
Show all changes
84 commits
Select commit
Hold shift + click to select a range
8734655
[release/2.8] Enable wheels
jithunnair-amd dc95b0c
Updates to build for Noble (Ubuntu 24.04) and py3.12
jithunnair-amd b741af3
[release/2.8] Make triton build ROCm version agnostic
ethanwee1 b4c293a
[release/2.8] Replace upstream install_rocm_magma.sh with rocm fork v…
jithunnair-amd 9ed3d2e
[release/2.8] Upgrade numpy versions; Use different package versions …
jithunnair-amd 12508fd
[release/2.8] Removing --user flag from all pip install commands
ethanwee1 90d7d4b
[ROCm] Remove use of warpsize on host-side compilation (pytorch#156979)
jithunnair-amd 186180d
[release/2.8] Improve C10_WARP_SIZE compatibility
xinyazhang 8e7b99f
Fix sha256 for aotriton ROCm7.0 tarball
jithunnair-amd d7c64fc
Update third_party/composable_kernel submodule commit as per https://…
jithunnair-amd b81d4d1
Use ROCm/triton and update triton.txt
jithunnair-amd 98e9537
Add related_commits file (#2396)
pragupta 12a145a
Add QA automation scripts for running PyTorch unit tests
jithunnair-amd 3c7ddbf
[release/2.6] enable NHWC batchnorm with MIOpen (#2023)
dnikolaev-amd fb20451
test_decompose_mem_bound_mm.py tolerance increase for navi3x
iupaikov-amd 32449c9
[release/2.7] enable NHWC batchnorm by default on ROCm7.0+ (#2180)
dnikolaev-amd 23f0b5f
[release/2.7] import 'Dict' to fix common_utils.py (#2181)
dnikolaev-amd 48630d8
[AUTOGENERATED] [release/2.7] [rocm6.4_internal_testing] Replaced ROC…
okakarpa ae17c3a
[release/2.7] [SWDEV-535259] enable miopen channels last 3d for conv …
okakarpa e4d62b1
[AUTOGENERATED] [release/2.7] Add 3D batchnorm tests (#2243)
okakarpa d40f3c8
[AUTOGENERATED] [release/2.5] [ROCm][layer_norm] Use __builtin_amdgcn…
rocm-mici dbb9f2a
[release/2.6] remove xfail from 'batch_norm_with_update' (#2070)
dnikolaev-amd e62e394
[release/2.7] Enable mx fp8 support on ROCm (#2199)
jagadish-amd e0160f1
Extend CK gemm/sdpa support to gfx950 (#45)
alugorey 08390c7
[release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Fa…
pmaybank 01857c6
[ROCm] Set thread_work_size to 16 for vectorized elementwise kernels …
jerrymannil e60c0c4
[release/2.7] Fix SDPA skip logic (#2281)
AmdSampsa 01eaee8
[release/2.7] Update test_binary_ufuncs.py after numpy upgrade (#2289)
ethanwee1 80e8974
[AUTOGENERATED] [release/2.7] fix jit_utils.cpp (#2320)
okakarpa bb44c0c
Clean up CUDA state between tests (#2335)
rraminen 1f312c4
cublaslt/hipblaslt persistent workspace (#156495)
jeffdaily 3b7f377
[AUTOGENERATED] [release/2.7] [release/2.6] Fix dtype before comparin…
okakarpa 8b23614
[ROCm][Windows] Fixing undefined symbol linker error after exposing M…
tvukovic-amd 5446c03
[MPS] Fix `index_kernel` for large tensors (#158239)
pytorchbot 71c68bc
Add flag to fx.passes.split_module to normalize input names (#157793)
pytorchbot 4c1d666
Add warning about removed sm50 and sm60 arches (#158478)
pytorchbot 352edf2
[cherry-pick][inductor][triton] Update HAS_WARP_SPEC to check triton.…
atalman 66b89d1
[CUDA] Use runtime driver API for cuStreamWriteValue32 (#158585)
pytorchbot 10eb3f2
Add stride check for attn_mask on non-cpu device (#158618)
CaoE 117d9d4
[cherry-pick] temporarily disabling generation of weblinks for torch …
Sidharth123-cpu 88d04c8
[Reland] Add warning about removed sm50 and sm60 arches (#158744)
atalman 3006279
[cherry-pick][release 2.8] Update OpenBLAS commit (#151547) (#158243)
Camyll 45ef46b
[cherry-pick][Docker builds] Move from Miniconda to Miniforge (#15837…
atalman e5e8a38
[async-TP] Turn asserts back into silent skips (#158736)
pytorchbot a3dea79
[cherry-pick] Fix AArch64 segfaults by disabling strict-aliasing in G…
robert-hardwick d3960e5
Pull latest Sphinx theme (#158595) (#158673)
svekars 2f85ac2
[Dynamo] Use proper sources for constructing dataclass defaults (#158…
pytorchbot 9298444
[cherry-pick] Unify torch.tensor and torch.ops.aten.scalar_tensor beh…
atalman 29973ff
Cherry pick PR 158746 (#158801)
svekars d007588
[MPS] Reimplement `tri[ul]` as Metal shaders (#158867)
pytorchbot 9176b69
[MPS] Switch Cholesky decomp to column wise (#158237)
pytorchbot 2d0385b
Revert "[Dynamo] Allow inlining into AO quantization modules (#152934…
atalman c1f2017
Move out super large one off foreach_copy test (#158880)
pytorchbot 947a201
[Release Only] Remove nvshmem from list of preload libraries (#158925)
atalman 360aa17
Use ROCm/triton and update triton.txt
jithunnair-amd f34b83a
[release/2.8] [Bugfix][Inductor] Fix dependency list merged incorrect…
pragupta bbb1d6e
[release/2.8] enable py3.13 (#2366)
ethanwee1 af2ce88
[SWDEV-539076] Initial naive foreach autotune support (#2377)
jataylo 41956f1
[release/2.7][ROCm][tunableop] UT tolerance increase for matmul_small…
naromero77amd 0826c75
[release/2.7] [SWDEV-543214] Reland #2416 Fix warps runtime (#2421)
jataylo af7b538
[AUTOGENERATED] [release/2.8] [ROCm] Use opportunistic fastatomics ba…
okakarpa b10cd6b
Update triton pin for gfx950 improvements (#2443)
jataylo 5413133
[AUTOGENERATED] [release/2.8] [release/2.7] [SWDEV-543214] Reland #24…
okakarpa d6a6383
[AUTOGENERATED] [release/2.8] [ROCm] Limit number of values per threa…
okakarpa 3995f1a
[release/2.8] Define datatypes when ROCM_VERSION >= 70000 (#2470)
rraminen 4fe2355
[release/2.8] Add mx fp4 support (#2472)
jagadish-amd 016bbef
[AUTOGENERATED] [release/2.8] [rocm7.0_internal_testing] skip test_tr…
okakarpa 8e96f16
Update version as 2.8.0
jithunnair-amd 29b4c24
[release/2.8] pin requirements.txt (#2481)
ethanwee1 16cac0c
[AUTOGENERATED] [release/2.8] [SWDEV-539215] - Autotune support for p…
okakarpa 0856917
[release/2.8] fp8: skip rowwise tests (#2477)
jagadish-amd 0da7d02
[release/2.8] update related_commit (#2490)
amd-sriram f7921f4
[SWDEV-539119] [release/2.8] Add fast_tanh support (#2484)
jataylo 2b29216
[AUTOGENERATED] [release/2.8] remove extra transposes in NHWC convolu…
okakarpa 4634272
[release/2.8] [triton] Triton bump to fix ROCm 7.0 issues (#2498)
iupaikov-amd 0e1a3e9
[AUTOGENERATED] [release/2.8] [ROCm] Improve reduction sum performanc…
dhonnappa-amd fe840fa
[release/2.8] Using c10d.barrier() in test_extra_cuda_context test in…
dhonnappa-amd d9d5b96
[AUTOGENERATED] [release/2.8] Change triton package name depending on…
dhonnappa-amd 608069b
[AUTOGENERATED] [release/2.8] NAVI32 specific fixes (#2467)
okakarpa d6007d3
[release/2.8] Define uint32 t when ROCM_VERSION >= 70000 (#2513)
rraminen 5d3dec1
[AUTOGENERATED] [release/2.8] Remove tb-nightly (#2538)
dhonnappa-amd 8ade7b5
Use ROCm/triton and update triton.txt
jithunnair-amd e4df565
Merge branch 'release/2.8' into fix_torch_macros_for_miopen
tvukovic-amd 52684a6
revert triton change
tvukovic-amd File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1 +1 @@ | ||
| f9e5bf54a2fe1a6262a41b27b38180cdb6fae6a2 | ||
| f9e5bf54a2fe1a6262a41b27b38180cdb6fae6a2 | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nope, don't do this. Remove the triton change.