Adobe · Arazzo Workflow
Adobe OCR a Scanned PDF
Version 1.0.0
Upload a scanned PDF, run OCR to make it searchable, poll the job, and fetch the result.
View Spec
View on GitHub
AnalyticsCreative CloudDigital Asset ManagementDocument ServicesE-CommerceE-SignaturesExperience CloudGenerative AIMarketingPDFWork ManagementArazzoWorkflows
Provider
Workflows
ocr-pdf
Make a scanned PDF searchable with optical character recognition.
Requests an upload slot for the source PDF, submits an ocrPDF job, polls job status until OCR finishes, and retrieves the download URI for the searchable PDF.
1
requestUpload
uploadAsset
Request a pre-signed upload URI and asset ID for the source PDF, which is then PUT to the returned uploadUri out of band.
2
submitOcr
ocrPDF
Submit an asynchronous ocrPDF job that makes the uploaded PDF searchable. Returns 201 with an in-progress job status.
3
pollStatus
getJobStatus
Poll the ocrPDF job until it is no longer in progress, looping back while the status remains "in progress".
4
getOutput
getAsset
Resolve a pre-signed download URI for the searchable output PDF.