Commit 0c58ba3
authored
rpc : reuse compute graph buffers (#21299)
Reuse the buffer for the ggml context which is used for creating the
compute graph on the server side. This partially addresses a memory leak
created by the CUDA backend due to using buffer addresses as cache
keys.
ref: #21265
ref: #203151 parent 57ace0d commit 0c58ba3
1 file changed
+6
-5
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1009 | 1009 | | |
1010 | 1010 | | |
1011 | 1011 | | |
1012 | | - | |
1013 | | - | |
| 1012 | + | |
| 1013 | + | |
1014 | 1014 | | |
1015 | 1015 | | |
1016 | 1016 | | |
| |||
1518 | 1518 | | |
1519 | 1519 | | |
1520 | 1520 | | |
1521 | | - | |
| 1521 | + | |
| 1522 | + | |
| 1523 | + | |
1522 | 1524 | | |
1523 | 1525 | | |
1524 | | - | |
| 1526 | + | |
1525 | 1527 | | |
1526 | 1528 | | |
1527 | 1529 | | |
| |||
1551 | 1553 | | |
1552 | 1554 | | |
1553 | 1555 | | |
1554 | | - | |
1555 | 1556 | | |
1556 | 1557 | | |
1557 | 1558 | | |
| |||
0 commit comments