Jubatus : Distributed Online Machine Learning Framework

Jubatus is a distributed processing framework and streaming machine learning library. Jubatus includes these functionalities:

  • Online Machine Learning Library: Classification, Regression, Recommendation (Nearest Neighbor Search), Graph Mining, Anomaly Detection, Clustering
  • Feature Vector Converter (fv_converter): Data Preprocess and Feature Extraction
  • Framework for Distributed Online Machine Learning with Fault Tolerance

Jubatus can be integrated with Spark (JubaQL) and Python / scikit-learn (Jubakit).

Japanese documentation is also available.

Get Jubatus

The latest version of Jubatus is 1.1.1 (ChangeLog). Jubatus is distributed under LGPL v2.1.

Other Resources