Skip to content

Conversation

xiajunshi
Copy link

Changes: #785
Key Updates
Updated setup.py to accommodate compilation issues.
Updated triton_attention.py to avoid running out of shared memory.
Updated custom_gguf.py to avoid the numpy non-writable problem.
Marlin op is currently disabled.

remove "All Rights Reserved "
remove "All Rights Reserved."
remove "All Rights Reserved."
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants