Mahout 0.12.0 Features by Engine

  Single Machine MapReduce Spark H2O Flink
Mahout Math-Scala Core Library and Scala DSL          
Mahout Distributed BLAS. Distributed Row Matrix API with R and Matlab like operators. Distributed ALS, SPCA, SSVD, thin-QR. Similarity Analysis.     x x x
           
Mahout Interactive Shell          
Interactive REPL shell for Spark optimized Mahout DSL     x    
           
Collaborative Filtering with CLI drivers          
User-Based Collaborative Filtering deprecated deprecated x    
Item-Based Collaborative Filtering x x x    
Matrix Factorization with ALS x x      
Matrix Factorization with ALS on Implicit Feedback x x      
Weighted Matrix Factorization, SVD++ x        
           
Classification with CLI drivers          
Logistic Regression - trained via SGD deprecated        
Naive Bayes / Complementary Naive Bayes   deprecated x    
Hidden Markov Models deprecated        
           
Clustering with CLI drivers          
Canopy Clustering deprecated deprecated      
k-Means Clustering deprecated deprecated      
Fuzzy k-Means deprecated deprecated      
Streaming k-Means deprecated deprecated      
Spectral Clustering   deprecated      
           
Dimensionality Reduction note: most scala-based dimensionality reduction algorithms are available through the Mahout Math-Scala Core Library for all engines          
Singular Value Decomposition deprecated deprecated x x x
Lanczos Algorithm deprecated deprecated      
Stochastic SVD deprecated deprecated x x x
PCA (via Stochastic SVD) deprecated deprecated x x x
QR Decomposition deprecated deprecated x x x
           
Topic Models          
Latent Dirichlet Allocation deprecated deprecated      
           
Miscellaneous          
RowSimilarityJob   deprecated x    
Collocations   deprecated      
Sparse TF-IDF Vectors from Text   deprecated      
XML Parsing   deprecated      
Email Archive Parsing   deprecated      
Evolutionary Processes x