org.apache.mahout.sparkbindings.drm
underlying rdd to wrap over.
number of rows; if unspecified, we will compute with an inexpensive traversal.
number of columns; if unspecified, we will try to guess with an inexpensive traversal.
cache level to use. (Implementors usually want to override the default!)
unique partitioning tag. Used to detect identically partitioned operands.
true if the matrix is int-keyed, and if it also may have missing rows (will require a lazy fix for some physical operations.
cache level to use.
cache level to use. (Implementors usually want to override the default!)
Action operator -- does not necessary means Spark action; but does mean running BLAS optimizer and writing down Spark graph lineage since last checkpointed DRM.
Action operator -- does not necessary means Spark action; but does mean running BLAS optimizer and writing down Spark graph lineage since last checkpointed DRM.
Collecting DRM to fron-end in-core Matrix.
Collecting DRM to fron-end in-core Matrix.
If key in DRM is Int, then matrix is collected using key as row index. Otherwise, order of rows in result is undefined but key.toString is applied as rowLabelBindings of the in-core matrix .
Note that this pre-allocates target matrix and then assigns collected RDD to it thus this likely would require about 2 times the RDD memory
Dump matrix as computed Mahout's DRM into specified (HD)FS path
Dump matrix as computed Mahout's DRM into specified (HD)FS path
output path to dump Matrix to
Explicit extraction of key class Tag
Explicit extraction of key class Tag
Changes the number of rows in the DRM without actually touching the underlying data.
Changes the number of rows in the DRM without actually touching the underlying data. Used to redimension a DRM after it has been created, which implies some blank, non-existent rows.
new row dimension
unique partitioning tag.
unique partitioning tag. Used to detect identically partitioned operands.
if matrix was previously persisted into cache, delete cached representation
if matrix was previously persisted into cache, delete cached representation
Spark-specific optimizer-checkpointed DRM.
matrix key type (e.g. the keys of sequence files once persisted)