You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Computing RoPE for Gemma Like Models in NPU has some accuracy issues.
This patch adds a matcher pass in avoid.cpp of npuw pipeline to create
an NPU avoid pattern for RoPE computation in Gemma like models.
This is added as an avoid pattern and to enable this the below flag :
"NPUW_ONLINE_AVOID": "P:GemmaRoPE/NPU" must be added to config. To
recognize this pattern check is added to "snapshot.cpp".
Signed-off-by: Ghosh, Tamoghna <[email protected]>
---------
Signed-off-by: Ghosh, Tamoghna <[email protected]>
Signed-off-by: tamoghna <[email protected]>
Co-authored-by: Alexey Smirnov <[email protected]>
0 commit comments