We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent a00e6b5 commit b656b2dCopy full SHA for b656b2d
python/mlc_llm/serve/engine.py
@@ -1856,7 +1856,7 @@ def _generate( # pylint: disable=too-many-locals
1856
generation_config: GenerationConfig,
1857
request_id: str,
1858
) -> Iterator[List[engine_base.CallbackStreamOutput]]:
1859
- """Internal synchronous text generation interface of AsyncMLCEngine.
+ """Internal synchronous text generation interface of MLCEngine.
1860
The method is a coroutine that streams a list of CallbackStreamOutput
1861
at a time via yield. The returned list length is the number of
1862
parallel generations specified by `generation_config.n`
0 commit comments