NVIDIA NIM · Arazzo Workflow
NVIDIA NIM Voice Assistant Loop
Version 1.0.0
Transcribe an audio clip with Riva ASR, answer the transcript with an LLM, then synthesize the reply with Riva TTS.
View Spec
View on GitHub
AIArtificial IntelligenceInferenceMicroservicesLLMFoundation ModelsGPUKubernetesNVIDIAOpenAI CompatibleArazzoWorkflows
Provider
Workflows
voice-assistant-loop
Speech-to-text, chat answer, then text-to-speech in a single loop.
Transcribes an audio clip, generates a chat answer to the transcript, and synthesizes the answer back into audio.
1
transcribeAudio
createTranscription
Transcribe the uploaded audio clip into text using a Riva ASR NIM via a multipart/form-data upload.
2
answerTranscript
createChatCompletion
Send the transcript to a chat model to generate a spoken-style reply.
3
synthesizeReply
createSpeech
Synthesize the chat reply back into audio bytes using a Riva TTS NIM.
Source API Descriptions
openapi