Hyperbrowser · Arazzo Workflow
Hyperbrowser Crawl Site and Retrieve
Version 1.0.0
Start a crawl from a seed URL, poll status, then page through the results.
View Spec
View on GitHub
Headless BrowserBrowser InfrastructureWeb ScrapingWeb CrawlingData ExtractionAI AgentsBrowser AutomationComputer UseStealthProxiesCAPTCHA SolvingMCPHyperAgentX402ArazzoWorkflows
Provider
Workflows
crawl-site-and-retrieve
Crawl a site from a seed URL and return the crawled page content.
Submits a crawl job, waits for completion by polling status, and pulls the first batch of crawled pages once the job finishes.
1
startCrawl
post-api-crawl
Submit a crawl job for the seed URL with the supplied page budget and capture the returned jobId.
2
pollStatus
get-api-crawl-id-status
Poll the crawl job status. The status moves through pending and running before reaching completed, failed, or stopped; loop back while still in progress and branch out on a terminal state.
3
getResults
get-api-crawl-id
Fetch the first batch of crawled pages from the completed crawl job along with the total crawled page count and batch counters.