Azure Databricks · Arazzo Workflow
Azure Databricks Provision a Cluster and Run a Job on It
Version 1.0.0
Create a cluster, wait until RUNNING, create a job on it, then run it.
View Spec
View on GitHub
AnalyticsApache SparkBig DataData EngineeringMachine LearningArazzoWorkflows
Provider
Workflows
provision-cluster-and-run-job
Create a cluster, wait for RUNNING, then create and run a notebook job.
Chains createCluster, a getCluster poll to RUNNING, createJob bound to the new cluster, runJobNow, and a getJobRun poll to TERMINATED.
1
createCluster
createCluster
Create the cluster that will host the job. The cluster starts in PENDING and the cluster_id is returned.
2
pollCluster
getCluster
Retrieve the cluster state. Repeat until the cluster is RUNNING, then create the job; branch to failure if it terminates.
3
createJob
createJob
Create a notebook job bound to the newly provisioned cluster and capture the job_id.
4
runJobNow
runJobNow
Trigger an immediate run of the job and capture the run_id for polling.
5
pollRun
getJobRun
Retrieve the run state. Repeat until the run life_cycle_state is TERMINATED, then end with the final result_state.