NVIDIA NIM · Arazzo Workflow
NVIDIA NIM Health Gated Completion
Version 1.0.0
Check a self-hosted NIM container's readiness, and only run a text completion once the engine reports ready.
View Spec
View on GitHub
AIArtificial IntelligenceInferenceMicroservicesLLMFoundation ModelsGPUKubernetesNVIDIAOpenAI CompatibleArazzoWorkflows
Provider
Workflows
health-gated-completion
Gate a text completion behind the container readiness probe.
Checks the readiness probe, and only when the engine is ready lists models and issues a text completion; otherwise ends without inference.
1
checkReadiness
getReadiness
Probe the container readiness endpoint. A 200 means the model engine has loaded and the container can accept traffic.
2
listAvailableModels
listModels
List the models the ready container serves to confirm the requested model is available before inference.
3
runCompletion
createCompletion
Run a legacy text completion against the requested model now that the container is confirmed ready.
Source API Descriptions
openapi
openapi