KServe Open Inference Protocol API
KServe implements the Open Inference Protocol (OIP), also known as the KServe V2 Inference Protocol, which provides a standardized REST and gRPC interface for model inference across frameworks. KServe is a standardized distributed generative and predictive AI inference platform for scalable, multi-framework deployment on Kubernetes. CNCF incubating project since November 2025. Supports TensorFlow, PyTorch, scikit-learn, XGBoost, ONNX, vLLM, and HuggingFace.
Documentation
Documentation
https://kserve.github.io/website/docs/intro
Getting Started
https://kserve.github.io/website/docs/get_started/
Specifications
Other Resources
GitHub
https://github.com/kserve/kserve
Changelog
https://github.com/kserve/kserve/releases
SwaggerUI
https://kserve.github.io/website/latest/reference/swagger-ui/
OpenAPI
#CNCF
#Inference
#Kubernetes
#Model Serving
#Open Inference Protocol
#Open Source