Apache Spark is a fast and general-purpose cluster computing system. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLlib for machine learning, GraphX for graph processing, and Spark Streaming.
TIBCO ActiveMatrix BusinessWorks Plug-in for Apache Spark – Community Edition plugs into TIBCO ActiveMatrix BusinessWorks. You can use this plug-in to configure a connection to Spark server, and then use activities to run Spark SQL, execute spark Scala code and submit spark job.
The plug-in provides the following main features:
Spark Connection Shared Resource
You can use the Spark connection shared resource to connect Spark Livy server. The shared resource is used by the Spark activities.
Spark SQL Activity
You can use this activity to run spark sql.
Spark Execution Activity
You can use this activity to submit spark code and jobs.
Wait for Completion Activity
You can use this activity to wait for spark execution completion.