API Evangelist API Evangelist
API Learnings
Toolbox
API Evangelist LLC

KServe Open Inference Protocol API

KServe implements the Open Inference Protocol (OIP), also known as the KServe V2 Inference Protocol, which provides a standardized REST and gRPC interface for model inference across frameworks. KServe is a standardized distributed generative and predictive AI inference platform for scalable, multi-framework deployment on Kubernetes. CNCF incubating project since November 2025. Supports TensorFlow, PyTorch, scikit-learn, XGBoost, ONNX, vLLM, and HuggingFace.

Documentation

Specifications

Other Resources

OpenAPI

kserve-open-inference-protocol-openapi.yml Raw ↑