Skip to content

Conversation

@mgorny
Copy link
Contributor

@mgorny mgorny commented Nov 8, 2024

Checklist

  • Used a personal fork of the feedstock to propose changes
  • Bumped the build number (if the version is unchanged)
  • Reset the build number to 0 (if the version changed)
  • Re-rendered with the latest conda-smithy (Use the phrase @conda-forge-admin, please rerender in a comment in this PR for automated rerendering)
  • Ensured the license file is being packaged.

Patch pytorch to use the system mkldnn library from the onednn package rather than building one locally from within ideep submodules. Given that ideep itself is a header-only library, I presume this is what was meant in #108 (comment), and indeed unvendoring onednn seems to improve build time significantly.

That said, our onednn package does not support GPU runtime (conda-forge/onednn-feedstock#44) but at least according to my testing, that part of the library was not enabled by our PyTorch builds before (due to missing SYCL).

The patch is a bit hacky, and probably needs some polishing before being submitted upstream (and testing on other platforms).

Part of issue #108

@conda-forge-admin
Copy link
Contributor

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe/meta.yaml) and found it was in an excellent condition.

Copy link
Member

@beckermr beckermr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am confused by this PR. We need any libraries linked by pytorch to be provided by conda-forge.

Patch pytorch to use the system mkldnn library from the onednn package
rather than building one locally from within ideep submodules.  Given
that ideep itself is a header-only library, I presume this is what
was meant in conda-forge#108 (comment),
and indeed unvendoring onednn seems to improve build time significantly.

That said, our onednn package does not support GPU runtime
(conda-forge/onednn-feedstock#44) but at least
according to my testing, that part of the library was not enabled by our
PyTorch builds before (due to missing SYCL).

The patch is a bit hacky, and probably needs some polishing before being
submitted upstream (and testing on other platforms).

Part of issue conda-forge#108
@mgorny
Copy link
Contributor Author

mgorny commented Nov 8, 2024

I am confused by this PR. We need any libraries linked by pytorch to be provided by conda-forge.

onednn is provided by conda-forge. For some (historical?) reason it's named mkldnn in PyTorch CMake files.

@beckermr
Copy link
Member

beckermr commented Nov 8, 2024

Ahh I see. LGTM!

@beckermr beckermr dismissed their stale review November 8, 2024 16:55

I was confused!

@mgorny mgorny marked this pull request as draft November 8, 2024 16:57
@mgorny
Copy link
Contributor Author

mgorny commented Nov 8, 2024

Hmm, I also should make the onednn dependency conditional. Need to test it a little more too.

@mgorny
Copy link
Contributor Author

mgorny commented Nov 8, 2024

This will also require a patch to onednn-feedstock, to enable experimental ukernel. I will submit it once I finish testing (probably next week).

mgorny added a commit to mgorny/onednn-feedstock that referenced this pull request Nov 9, 2024
In order to unvendor onednn from PyTorch feedstock
(conda-forge/pytorch-cpu-feedstock#289), onednn needs to be built with
DNNL_EXPERIMENTAL_UKERNEL enabled.

Please also note that the current version of PyTorch requires onednn
3.5.3, so we are also going to request creating a branch with
the equivalent patch added for that version.
@hmaarrfk
Copy link
Contributor

hmaarrfk commented Nov 9, 2024

You should keep the feature set consistent between windows/unix if you can.

  • Modify the .bat script as well

You might need to disable things for non-x86_64 arch. You can test target_platform in the sh file.

@conda-forge-admin
Copy link
Contributor

Hi! This is the friendly automated conda-forge-linting service.

I wanted to let you know that I linted all conda-recipes in your PR (recipe/meta.yaml) and found some lint.

Here's what I've got...

For recipe/meta.yaml:

  • ❌ Selectors are suggested to take a <two spaces>#<one space>[<expression>] form. See lines [141, 265]

@hmaarrfk
Copy link
Contributor

@conda-forge-admin please rerender

@hmaarrfk
Copy link
Contributor

Lets see how the improvement translates to azure.

Copy link
Member

@h-vetinari h-vetinari left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The direction here looks very good (custom channel aside of course). 👍

Comment on lines +1 to +4
diff --git a/CMakeLists.txt b/CMakeLists.txt
index 98593c2d..94a9d63d 100644
--- a/CMakeLists.txt
+++ b/CMakeLists.txt
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For quickly applying the patches as they exist on the feedstock (i.e. if it's not you who did the last change to them), it's nicer to generate them with git format-patch <tag> --no-signature.

That has several advantages:

  • commit message can be used to provide context (this patch is pretty self-explanatory, but in general...)
  • authorship is easily visible (who to ask in case there are questions about intent, e.g. when rebasing to new versions)
  • easily apply them to a source-checkout in one go using e.g.
    find ../path/to/feedstock/recipe/patches/ | xargs git am
  • having a local branch of patches makes it much easier to rebase the changes to a new version
  • etc.

So while raw diff's are technically possible, the output of git format-patch is preferred.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It was just my dirty proof-of-concept. Ideally, I'd prefer to make something suitable for upstreaming.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants