KServe Inference API
KServe (formerly KFServing) provides a serverless model inference API on Kubernetes, supporting standardized prediction protocols, autoscaling, and multi-framework model serving.
Documentation
Specifications
SDKs
OpenAPI
#Inference
#Model Serving
#Predictions
#Serverless