IBM InfoSphere Streams Version 4.1.0

Toolkit com.ibm.streamsx.sparkmllib 1.0.0

SPL standard and specialized toolkits > com.ibm.streamsx.sparkmllib 1.0.0

General Information

Apache Spark is a fast general purpose clustering system that is well suited for machine learning algorithms. MLlib is a machine learning library provided with Spark with support for common machine learning algorithms including classification, regression, collaborative filtering and others. This toolkit allows Spark's MLlib library to be used for real-time scoring of data in InfoSphere® Streams.

Developing and running applications that use the SparkMLLib Toolkit
To create applications that use the SparkMLLib Toolkit, you must configure either Streams Studio or the SPL compiler to be aware of the location of the toolkit.
Version
1.0.0
Required Product Version
4.0.1.0