MLlib.

spark_mlib

Spark provides a machine learning library known as MLlib. Its goal is to make practical machine learning scalable and easy.

Spark MLlib provides various machine learning algorithms such as classification, regression, clustering, and collaborative filtering. It also provides tools such as featurization, pipelines, persistence, and utilities for handling linear algebra operations, statistics and data handling.

At a high level, it provides tools such as:

  • ML Algorithms: Common learning algorithms such as classification, regression, clustering, and collaborative filtering.
  • Featurization: Feature extraction, transformation, dimensionality reduction, and selection.
  • Pipelines: Tools for constructing, evaluating, and tuning ML Pipelines.
  • Persistence: Saving and load algorithms, models, and Pipelines.
  • Utilities: Linear algebra, statistics, data handling, etc.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.