You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CONTRIBUTING.md
+3Lines changed: 3 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,6 +16,9 @@
16
16
- Use the following format for the squashed commit title: `<module> : <commit title> (#<issue_number>)`. For example: `utils : fix typo in utils.py (#1234)`
17
17
- Optionally pick a `<module>` from here: https://github.com/ggml-org/llama.cpp/wiki/Modules
18
18
- Consider adding yourself to [CODEOWNERS](CODEOWNERS)
19
+
- Let authors, who are also collaborators, merge their own PRs
20
+
- When merging a PR by a contributor, make sure you have a good understanding of the changes
21
+
- Be mindful of maintenance: most of the work going into a feature happens after the PR is merged. If the PR author is not committed to contribute long-term, someone else needs to take responsibility (you)
Copy file name to clipboardExpand all lines: docs/backend/CANN.md
+4Lines changed: 4 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -314,3 +314,7 @@ Converting the matmul weight format from ND to NZ to improve performance. Enable
314
314
### GGML_CANN_ACL_GRAPH
315
315
316
316
Operators are executed using ACL graph execution, rather than in op-by-op (eager) mode. Enabled by default.
317
+
318
+
### GGML_CANN_GRAPH_CACHE_CAPACITY
319
+
320
+
Maximum number of compiled CANN graphs kept in the LRU cache, default is 12. When the number of cached graphs exceeds this capacity, the least recently used graph will be evicted.
0 commit comments