Skip to content

Does magi support zigzag dispatch pattern? #196

@jordane95

Description

@jordane95

Hi, I'm trying to use magi attention as backend in mcore to replace nvte. But it seems that a lot of adaptations is required: cp split, rope, undispatch. The complexity comes from the custom comm pattern adopted by magi designed for specific attn mask. I just want to use regular varlen + casual mask + swa, so simple zigzag would be efficient enough. I'm wondering if there is support for using zigzag dispatch mode so I won't need to change data dispatch and rope part.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions