Skip to content

Enabling of the Performance model for NNPA / z17#3383

Open
AlexandreEichenberger wants to merge 20 commits intoonnx:mainfrom
AlexandreEichenberger:missing-model-z17
Open

Enabling of the Performance model for NNPA / z17#3383
AlexandreEichenberger wants to merge 20 commits intoonnx:mainfrom
AlexandreEichenberger:missing-model-z17

Conversation

@AlexandreEichenberger
Copy link
Collaborator

In Granite embedding, I noticed that we are sending a scalar SQRT to NNPA because the current policy is to send any legal operations to the NNPA.

After adding support to missing new operations in z17 (sqrt & leaky relu), I noticed that some patterns transforming ONNX ops to ZHigh ops were firing without considerations for placement (aka the pass that determines where ops should run, NNPA vs CPU). So I added some additional conditions to rules for the ONNX -> ZHigh transition.

I also modified a bit the handling of dynamic dimensions expected values when plugged into the performance models for CPU and NNPA.

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
…ed as faster than CPU)

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant