Triton Metrics API
Prometheus-compatible metrics API for monitoring server and model performance including inference request counts, latencies, GPU utilization, and memory usage.
Documentation
Documentation
https://github.com/triton-inference-server/server/blob/main/docs/user_guide/metrics.md
Specifications
Other Resources
OpenAPI
#Metrics
#Monitoring
#Observability
#Prometheus