Files
enginex-mlu370-vllm/vllm-v0.6.2/docs/source/serving/metrics.rst
2026-02-04 17:22:39 +08:00

14 lines
418 B
ReStructuredText

Production Metrics
==================
vLLM exposes a number of metrics that can be used to monitor the health of the
system. These metrics are exposed via the `/metrics` endpoint on the vLLM
OpenAI compatible API server.
The following metrics are exposed:
.. literalinclude:: ../../../vllm/engine/metrics.py
:language: python
:start-after: begin-metrics-definitions
:end-before: end-metrics-definitions