-
Notifications
You must be signed in to change notification settings - Fork 0
UPSTREAM PR #17077: HIP: RDNA4 tensor core support for MMF #118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
UPSTREAM PR #17077: HIP: RDNA4 tensor core support for MMF #118
Conversation
|
Access the complete analysis in the LOCI Dashboard Performance Analysis SummaryOverviewAnalysis of project_id Key FindingsPerformance Metrics:
Core Function Impact: Power Consumption Analysis: Flame Graph and CFG Analysis: GitHub Code Review: Conclusion: |
6b50572 to
733e776
Compare
6d2349e to
9248736
Compare
db9060f to
8a26d77
Compare
a87918f to
6f7320f
Compare
2b1a9e2 to
9ea0205
Compare
|
Access the complete analysis in the LOCI Dashboard Performance Analysis SummaryOverviewAnalysis of llama.cpp project comparing versions Key FindingsPerformance Metrics:
Core Function Impact: Power Consumption Analysis: Flame Graph and CFG Analysis: GitHub Code Review Insights: Conclusion: |
ef7ca13 to
c65ae84
Compare
Mirrored from ggml-org/llama.cpp#17077
Add RDNA4 tensor core support for MMF, honestly the performance is lower than expectation. The model is at https://huggingface.co/Mungert/DeepSeek-R1-0528-Qwen3-8B-GGUF