Mahout 0.12.0 Features by Engine

Single Machine MapReduce Spark H2O Flink
Mahout Math-Scala Core Library and Scala DSL
Mahout Distributed BLAS. Distributed Row Matrix API with R and Matlab like operators. Distributed ALS, SPCA, SSVD, thin-QR. Similarity Analysis. x x x
Mahout Interactive Shell
Interactive REPL shell for Spark optimized Mahout DSL x
Collaborative Filtering with CLI drivers
User-Based Collaborative Filtering deprecated deprecated x
Item-Based Collaborative Filtering x x x
Matrix Factorization with ALS x x
Matrix Factorization with ALS on Implicit Feedback x x
Weighted Matrix Factorization, SVD++ x
Classification with CLI drivers
Logistic Regression - trained via SGD deprecated
Naive Bayes / Complementary Naive Bayes deprecated x
Hidden Markov Models deprecated
Clustering with CLI drivers
Canopy Clustering deprecated deprecated
k-Means Clustering deprecated deprecated
Fuzzy k-Means deprecated deprecated
Streaming k-Means deprecated deprecated
Spectral Clustering deprecated
Dimensionality Reduction note: most scala-based dimensionality reduction algorithms are available through the Mahout Math-Scala Core Library for all engines
Singular Value Decomposition deprecated deprecated x x x
Lanczos Algorithm deprecated deprecated
Stochastic SVD deprecated deprecated x x x
PCA (via Stochastic SVD) deprecated deprecated x x x
QR Decomposition deprecated deprecated x x x
Topic Models
Latent Dirichlet Allocation deprecated deprecated
Miscellaneous
RowSimilarityJob deprecated x
Collocations deprecated
Sparse TF-IDF Vectors from Text deprecated
XML Parsing deprecated
Email Archive Parsing deprecated
Evolutionary Processes x