vllm.entrypoints.serve.sleep.api_router ¶
attach_router ¶
engine_client ¶
engine_client(request: Request) -> EngineClient
vllm.entrypoints.serve.sleep.api_router ¶ attach_router ¶ engine_client ¶engine_client(request: Request) -> EngineClient
is_sleeping async ¶vllm/entrypoints/serve/sleep/api_router.py sleep async ¶vllm/entrypoints/serve/sleep/api_router.py wake_up async ¶vllm/entrypoints/serve/sleep/api_router.py