org.apache.mahout.drivers

ItemSimilarityDriver

object ItemSimilarityDriver extends MahoutSparkDriver

Command line interface for org.apache.mahout.math.cf.SimilarityAnalysis#cooccurrencesIDSs. Reads text lines that contain (row id, column id, ...). The IDs are user specified strings which will be preserved in the output. The individual elements will be accumulated into a matrix like org.apache.mahout.math.indexeddataset.IndexedDataset and org.apache.mahout.math.cf.SimilarityAnalysis#cooccurrencesIDSs will be used to calculate row-wise self-similarity, or when using filters or two inputs, will generate two matrices and calculate both the self-similarity of the primary matrix and the row-wise similarity of the primary to the secondary. Returns one or two directories of text files formatted as specified in the options. The options allow flexible control of the input schema, file discovery, output schema, and control of algorithm parameters. To get help run

mahout spark-itemsimilarity

for a full explanation of options. To process simple elements of text delimited values (userID,itemID) with or without a strengths and with a separator of tab, comma, or space, you can specify only the input and output file and directory--all else will default to the correct values. Each output line will contain the Item ID and similar items sorted by LLR strength descending. mahout spark-itemsimilarity }}} values (userID,itemID) with or without a strengths and with a separator of tab, comma, or space, you can specify only the input and output file and directory--all else will default to the correct values. Each output line will contain the Item ID and similar items sorted by LLR strength descending.

Note

To use with a Spark cluster see the --master option, if you run out of heap space check the --sparkExecutorMemory option. Other org.apache.spark.SparkConf key value pairs can be with the -D:k=v option.

Linear Supertypes
MahoutSparkDriver, MahoutDriver, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. ItemSimilarityDriver
  2. MahoutSparkDriver
  3. MahoutDriver
  4. AnyRef
  5. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. var _useExistingContext: Boolean

    Definition Classes
    MahoutDriver
  7. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  8. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  9. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  10. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  11. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  12. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  13. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  14. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  15. def main(args: Array[String]): Unit

    Entry point, not using Scala App trait

    Entry point, not using Scala App trait

    args

    Command line args, if empty a help message is printed.

    Definition Classes
    ItemSimilarityDriver → MahoutDriver
  16. implicit var mc: DistributedContext

    Attributes
    protected
    Definition Classes
    MahoutDriver
  17. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  18. final def notify(): Unit

    Definition Classes
    AnyRef
  19. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  20. implicit var parser: MahoutOptionParser

    Attributes
    protected
    Definition Classes
    MahoutDriver
  21. def process(): Unit

    Definition Classes
    ItemSimilarityDriver → MahoutDriver
  22. implicit var sparkConf: SparkConf

    Definition Classes
    MahoutSparkDriver
  23. def start(): Unit

    Creates a Spark context to run the job inside.

    Creates a Spark context to run the job inside. Override to set the SparkConf values specific to the job, these must be set before the context is created.

    Attributes
    protected
    Definition Classes
    ItemSimilarityDriverMahoutSparkDriver → MahoutDriver
  24. def stop(): Unit

    Attributes
    protected
    Definition Classes
    MahoutDriver
  25. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  26. def toString(): String

    Definition Classes
    AnyRef → Any
  27. def useContext(context: DistributedContext): Unit

    Call this before start to use an existing context as when running multiple drivers from a scalatest suite.

    Call this before start to use an existing context as when running multiple drivers from a scalatest suite.

    context

    An already set up context to run against

    Definition Classes
    MahoutSparkDriver
  28. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  29. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  30. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from MahoutSparkDriver

Inherited from MahoutDriver

Inherited from AnyRef

Inherited from Any

Ungrouped