Hyperbolic · Arazzo Workflow
Hyperbolic Generate And Describe Image
Version 1.0.0
Render an image with diffusion, then describe it with a vision LLM and narrate the caption.
View Spec
View on GitHub
AIArtificial IntelligenceComputeDecentralizedDePINGPUImage GenerationInferenceLLMMarketplaceOpen SourceArazzoWorkflows
Provider
Workflows
generate-and-describe-image
Generate an image, caption it with a vision model, and narrate the caption.
Renders an image from a prompt, feeds the returned base64 image as a data URI to a vision chat model for captioning, then converts the caption to speech audio.
1
render
generateImage
Generate a single image from the supplied prompt and capture its base64 bytes.
2
caption
createChatCompletion
Pass the generated image to a vision model as a data URI and ask for a one sentence caption.
3
narrate
generateAudio
Convert the generated caption to speech audio and capture the base64 audio and its duration.
Source API Descriptions
openapi
openapi