Skip to content

Conversation

@p-lanza
Copy link

@p-lanza p-lanza commented Nov 3, 2025

Tested on phi 3.5 model and GQA conversion is working as expected

TODO:

  • Convert com.microsoft.RotaryEmbedding

@p-lanza p-lanza closed this Nov 4, 2025
@p-lanza p-lanza reopened this Nov 4, 2025
@p-lanza p-lanza requested review from roberteg16 and ttjost November 4, 2025 14:39
@p-lanza p-lanza force-pushed the planzase.convert_onnx_to_ms_gqa_and_rope branch 2 times, most recently from 19ae705 to 6685d83 Compare November 10, 2025 09:46
@p-lanza p-lanza force-pushed the planzase.convert_onnx_to_ms_gqa_and_rope branch from 6685d83 to b8f0faa Compare November 10, 2025 09:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants