NVIDIA NIM · Arazzo Workflow
NVIDIA NIM RAG Rerank And Answer
Version 1.0.0
Embed a query, rerank candidate passages against it, then answer the question grounded in the top passage.
View Spec
View on GitHub
AIArtificial IntelligenceInferenceMicroservicesLLMFoundation ModelsGPUKubernetesNVIDIAOpenAI CompatibleArazzoWorkflows
Provider
Workflows
rag-rerank-answer
Embed a query, rerank candidate passages, and answer grounded in the best one.
Generates a query embedding, scores candidate passages with a reranker, selects the most relevant passage, and asks the chat model to answer using that passage as context.
1
embedQuery
createEmbedding
Generate a dense embedding vector for the query using a NeMo Retriever embedding model with the asymmetric query input type.
2
rerankPassages
rankPassages
Score every candidate passage against the query with a cross-encoder reranker so the most relevant passage can be selected.
3
answerWithContext
createChatCompletion
Compose a grounded answer by giving the chat model the highest-ranked passage as system context alongside the original query.
Source API Descriptions
openapi
openapi