Sync from v0.13
This commit is contained in:
5
docs/deployment/integrations/llmaz.md
Normal file
5
docs/deployment/integrations/llmaz.md
Normal file
@@ -0,0 +1,5 @@
|
||||
# llmaz
|
||||
|
||||
[llmaz](https://github.com/InftyAI/llmaz) is an easy-to-use and advanced inference platform for large language models on Kubernetes, aimed for production use. It uses vLLM as the default model serving backend.
|
||||
|
||||
Please refer to the [Quick Start](https://github.com/InftyAI/llmaz?tab=readme-ov-file#quick-start) for more details.
|
||||
Reference in New Issue
Block a user