Commit 08fe6cb
Use dependency injection for runner (#53)
Summary:
X-link: pytorch/executorch#10326
Pass in runner components, move most of the instantiation logic from `load()` to a new static API `create()`.
This adds testability to runner components.
Next step would be moving most of the logic out into `extension/llm/runner/` so that it can be used on non-llama models.
Currently the logic for getting tokenizer instance should not assume llama, which I can modify in next diff.
Differential Revision: D731655461 parent def3a96 commit 08fe6cb
1 file changed
+4
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
56 | 56 | | |
57 | 57 | | |
58 | 58 | | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
59 | 63 | | |
60 | 64 | | |
61 | 65 | | |
| |||
0 commit comments