Mindee · Arazzo Workflow
Mindee OCR Then Extract
Version 1.0.0
Run OCR over a document to capture its raw text, then extract structured fields from the same file, reading both outputs.
View Spec
View on GitHub
Document ParsingOCRIDPAIMachine LearningInvoicesReceiptsIDsComputer VisionArazzoWorkflows
Provider
Workflows
ocr-then-extract
OCR a document for raw text, then extract structured fields from it.
Runs OCR to capture per-page text, then enqueues the same file for extraction, polling each job to completion and reading the pages and the extracted fields.
1
enqueueOcr
Enqueue_OCR_Product_Inference_v2_products_ocr_enqueue_post
Send the document to the asynchronous OCR queue to capture its full text.
2
pollOcr
Get_Job_Status_v2_jobs__job_id__get
Poll the shared jobs endpoint until the OCR job reports Processed or Failed.
3
getOcr
Get_OCR_Product_Result_v2_products_ocr_results__inference_id__get
Read the recognized per-page text from the completed OCR inference.
4
enqueueExtraction
Enqueue_Extraction_Product_Inference_v2_products_extraction_enqueue_post
Send the same file to the extraction queue against the chosen extraction model to read its structured fields.
5
pollExtraction
Get_Job_Status_v2_jobs__job_id__get
Poll the shared jobs endpoint until the extraction job reports Processed or Failed.
6
getExtraction
Get_Extraction_Product_Result_v2_products_extraction_results__inference_id__get
Retrieve the completed extraction inference and read the structured fields parsed from the document.
Source API Descriptions
openapi
openapi