Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Adding details pertinent to EKS Auto Mode
Last week I reached out to AWS support. The NodePool provisioned nodes and the containers running on them did not see the neuron devices. The AWS Support wasn't aware of the QUICKSTART and it is only Provisioners that are described in AWS Documentation (the auto-generated response, before I got to speak to an engineer was: this is known issue, see screenshot)
I reached to internal slack channel where I got help. The critical piece was specifying the request so that the neuron-plugin exposes the device. And other critical piece was: EKS Auto Mode should work.
Here is a screenshot of AWS support Q:
Testing done:
After the changes, deployed via flyte. And tested the devices were doing inference. Here is my pod definition:
Terms of contribution:
By submitting this pull request, I agree that this contribution is dual-licensed under the terms of both the Apache License, version 2.0, and the MIT license.