Last Update: 2026-03-09
Not official. These documents are not official project documentation. They were generated by reverse-engineering the source code and may be incomplete, inaccurate, or out of date. Use them for reference only.
This repository contains architectural views and related artifacts for the LLM‑D project, including:
- LLM-D_Inference_Scheduler.md
- A concise, developer-focused reference that provides behavioral and structural models describing the
Inference Scheduler's runtime behavior and the structural relationships between key classifiers including those that contain configurable attributes (e.g. configurable via YAML). Use it to locate where to implement new scheduling logic, and to infer which attributes affect which plugins.
- A concise, developer-focused reference that provides behavioral and structural models describing the
- LLM-D_Inference_Scheduler (Short).md
- A compact, slide-style overview of the above suitable for quick presentations.