Amazon EMR · Arazzo Workflow
Amazon EMR Run a Spark ETL Job
Version 1.0.0
Launch a Spark cluster and queue an ETL processing step in one call.
View Spec
View on GitHub
Amazon Web ServicesAnalyticsApache SparkBig DataData ProcessingHadoopArazzoWorkflows
Provider
Workflows
run-spark-etl-job
Run a Spark cluster with ETL processing steps queued.
Creates and starts a new EMR cluster with Spark installed and queues the supplied ETL processing steps to run once the cluster is provisioned, returning the identifier of the newly created cluster.
1
runSparkEtl
RunJobFlow
Create and start a new EMR cluster with Spark installed and queue the supplied ETL processing steps to run once the cluster is provisioned.