vllm-ascend

Here are 2 public repositories matching this topic...

Deploy GLM-4.6V-Flash (9B dense VLM) on Huawei Ascend 910B NPU with vLLM - multimodal, OpenAI API, single/dual-card serving, reproducible benchmarks.

glm cann multimodal npu ascend vision-language-model vllm llm-inference huawei-ascend vllm-ascend

Mooncake、vLLM、vLLM Ascend 相关的中文教程。

mooncake vllm vllm-ascend

Add a description, image, and links to the vllm-ascend topic page so that developers can more easily learn about it.

To associate your repository with the vllm-ascend topic, visit your repo's landing page and select "manage topics."