Skip navigation links
A B C D E F G H I J K L M N O P Q R S T U V W X 

A

AbstractBooleanPrefJDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
 
AbstractBooleanPrefJDBCDataModel(DataSource, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractBooleanPrefJDBCDataModel
 
AbstractClusterWriter - Class in org.apache.mahout.utils.clustering
Base class for implementing ClusterWriter
AbstractClusterWriter(Writer, Map<Integer, List<WeightedPropertyVectorWritable>>, DistanceMeasure) - Constructor for class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
AbstractJDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
An abstract superclass for JDBCDataModel implementations, providing most of the common functionality that any such implementation would need.
AbstractJDBCDataModel(DataSource, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
AbstractJDBCDataModel(DataSource, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
AbstractJDBCItemSimilarity - Class in org.apache.mahout.cf.taste.impl.similarity.jdbc
An ItemSimilarity which draws pre-computed item-item similarities from a database table via JDBC.
AbstractJDBCItemSimilarity(DataSource, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
AbstractJDBCItemSimilarity(DataSource, String, String, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
AbstractLuceneIterator - Class in org.apache.mahout.utils.vectors.lucene
Iterate over a Lucene index, extracting term vectors.
AbstractLuceneIterator(TermInfo, double, IndexReader, Weight, double, String) - Constructor for class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
accept(Path) - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
addDateFormat(Integer, DateFormat) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
addDateFormat(Integer, DateFormat) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
addLabel(String, Integer) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
addLabel(String, Integer) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
addNominal(String, String, int) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
addNominal(String, String, int) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
addOptions() - Method in class org.apache.mahout.text.SequenceFilesFromDirectory
Override this method in order to add additional options to the command line of the SequenceFileFromDirectory job.
addType(Integer, ARFFType) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
addType(Integer, ARFFType) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
allSimilarItemIDs(long) - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
ANALYZER_NAME - Static variable in class org.apache.mahout.utils.regex.RegexMapper
 
AnalyzerTransformer - Class in org.apache.mahout.utils.regex
 
AnalyzerTransformer() - Constructor for class org.apache.mahout.utils.regex.AnalyzerTransformer
 
AnalyzerTransformer(Analyzer) - Constructor for class org.apache.mahout.utils.regex.AnalyzerTransformer
 
AnalyzerTransformer(Analyzer, String) - Constructor for class org.apache.mahout.utils.regex.AnalyzerTransformer
 
ARFF_COMMENT - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
ARFF_SPARSE - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
ARFF_SPARSE_END - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
ARFFModel - Interface in org.apache.mahout.utils.vectors.arff
An interface for representing an ARFFModel.
ARFFType - Enum in org.apache.mahout.utils.vectors.arff
 
ARFFVectorIterable - Class in org.apache.mahout.utils.vectors.arff
Read in ARFF (http://www.cs.waikato.ac.nz/~ml/weka/arff.html) and create Vectors

Attribute type handling: Numeric -> As is Nominal -> ordinal(value) i.e.

ARFFVectorIterable(File, ARFFModel) - Constructor for class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
ARFFVectorIterable(File, Charset, ARFFModel) - Constructor for class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
ARFFVectorIterable(String, ARFFModel) - Constructor for class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
ARFFVectorIterable(Reader, ARFFModel) - Constructor for class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
ATTRIBUTE - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 

B

BASE_INPUT_PATH - Static variable in class org.apache.mahout.text.SequenceFilesFromDirectory
 
BASE_INPUT_PATH - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
benchmark(BenchmarkRunner.BenchmarkFn) - Method in class org.apache.mahout.benchmark.BenchmarkRunner
 
benchmark() - Method in class org.apache.mahout.benchmark.CloneBenchmark
 
benchmark(DistanceMeasure) - Method in class org.apache.mahout.benchmark.ClosestCentroidBenchmark
 
benchmark(DistanceMeasure) - Method in class org.apache.mahout.benchmark.DistanceBenchmark
 
benchmark() - Method in class org.apache.mahout.benchmark.DotBenchmark
 
benchmark() - Method in class org.apache.mahout.benchmark.MinusBenchmark
 
benchmark() - Method in class org.apache.mahout.benchmark.PlusBenchmark
 
benchmark() - Method in class org.apache.mahout.benchmark.SerializationBenchmark
 
benchmark() - Method in class org.apache.mahout.benchmark.TimesBenchmark
 
benchmarkD(BenchmarkRunner.BenchmarkFnD) - Method in class org.apache.mahout.benchmark.BenchmarkRunner
 
BenchmarkFn() - Constructor for class org.apache.mahout.benchmark.BenchmarkRunner.BenchmarkFn
 
BenchmarkFnD() - Constructor for class org.apache.mahout.benchmark.BenchmarkRunner.BenchmarkFnD
 
BenchmarkRunner - Class in org.apache.mahout.benchmark
 
BenchmarkRunner(long, long) - Constructor for class org.apache.mahout.benchmark.BenchmarkRunner
 
BenchmarkRunner.BenchmarkFn - Class in org.apache.mahout.benchmark
 
BenchmarkRunner.BenchmarkFnD - Class in org.apache.mahout.benchmark
 
BloomTokenFilter - Class in org.apache.mahout.utils.nlp.collocations.llr
Emits tokens based on bloom filter membership.
BloomTokenFilter(Filter, boolean, TokenStream) - Constructor for class org.apache.mahout.utils.nlp.collocations.llr.BloomTokenFilter
 
BODY_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
BODY_SEPARATOR_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
buildJson(Iterable<Pair<String, Double>>) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
buildJson(Iterable<Pair<String, Double>>, StringBuilder) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
buildPreference(ResultSet) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractBooleanPrefJDBCDataModel
 
buildPreference(ResultSet) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
buildRecommender() - Method in class org.apache.mahout.cf.taste.web.RecommenderWrapper
 
bump - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
Bump125 - Class in org.apache.mahout.utils
Helps with making nice intervals at arbitrary scale.
Bump125() - Constructor for class org.apache.mahout.utils.Bump125
 

C

CachedTermInfo - Class in org.apache.mahout.utils.vectors.lucene
Caches TermEntries from a single field.
CachedTermInfo(IndexReader, String, int, int) - Constructor for class org.apache.mahout.utils.vectors.lucene.CachedTermInfo
 
CassandraDataModel - Class in org.apache.mahout.cf.taste.impl.model.cassandra
A DataModel based on a Cassandra keyspace.
CassandraDataModel() - Constructor for class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
Uses the standard Cassandra host and port (localhost:9160), and keyspace name ("recommender").
CassandraDataModel(String, int, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
CDbwEvaluator - Class in org.apache.mahout.clustering.cdbw
This class calculates the CDbw metric as defined in http://www.db-net.aueb.gr/index.php/corporate/content/download/227/833/file/HV_poster2002.pdf
CDbwEvaluator(Map<Integer, List<VectorWritable>>, List<Cluster>, DistanceMeasure) - Constructor for class org.apache.mahout.clustering.cdbw.CDbwEvaluator
For testing only
CDbwEvaluator(Configuration, Path) - Constructor for class org.apache.mahout.clustering.cdbw.CDbwEvaluator
Initialize a new instance from job information
ChainTransformer - Class in org.apache.mahout.utils.regex
Chain together several RegexTransformer and apply them to the match in succession
ChainTransformer() - Constructor for class org.apache.mahout.utils.regex.ChainTransformer
 
ChainTransformer(List<RegexTransformer>) - Constructor for class org.apache.mahout.utils.regex.ChainTransformer
 
CHARSET_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
CHUNK_SIZE_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
ChunkedWrapper - Class in org.apache.mahout.utils.io
ChunkedWriter based implementation of the WrappedWriter interface.
ChunkedWrapper(ChunkedWriter) - Constructor for class org.apache.mahout.utils.io.ChunkedWrapper
 
ChunkedWriter - Class in org.apache.mahout.utils.io
Writes data splitted in multiple Hadoop sequence files of approximate equal size.
ChunkedWriter(Configuration, int, Path) - Constructor for class org.apache.mahout.utils.io.ChunkedWriter
 
cIndex(int) - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
cleanup(Mapper<IntWritable, WeightedVectorWritable, IntWritable, WeightedVectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
cleanup(Reducer<IntWritable, WeightedVectorWritable, IntWritable, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
cleanup(Reducer<WritableComparable<?>, Writable, WritableComparable<?>, Writable>.Context) - Method in class org.apache.mahout.utils.SplitInputJob.SplitInputReducer
 
cleanupMappingCollection() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Cleanup mapping collection.
CLONE - Static variable in class org.apache.mahout.benchmark.CloneBenchmark
 
CloneBenchmark - Class in org.apache.mahout.benchmark
 
CloneBenchmark(VectorBenchmarks) - Constructor for class org.apache.mahout.benchmark.CloneBenchmark
 
close() - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
close() - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
close() - Method in class org.apache.mahout.text.WholeFileRecordReader
 
close() - Method in class org.apache.mahout.text.wikipedia.XmlInputFormat.XmlRecordReader
 
close() - Method in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
close() - Method in class org.apache.mahout.utils.clustering.GraphMLClusterWriter
 
close() - Method in class org.apache.mahout.utils.io.ChunkedWrapper
 
close() - Method in class org.apache.mahout.utils.io.ChunkedWriter
 
close() - Method in class org.apache.mahout.utils.io.IOWriterWrapper
 
close() - Method in class org.apache.mahout.utils.vectors.io.DelimitedTermInfoWriter
Does NOT close the underlying writer
close() - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
close() - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
ClosestCentroidBenchmark - Class in org.apache.mahout.benchmark
 
ClosestCentroidBenchmark(VectorBenchmarks) - Constructor for class org.apache.mahout.benchmark.ClosestCentroidBenchmark
 
ClusterDumper - Class in org.apache.mahout.utils.clustering
 
ClusterDumper(Path, Path) - Constructor for class org.apache.mahout.utils.clustering.ClusterDumper
 
ClusterDumper() - Constructor for class org.apache.mahout.utils.clustering.ClusterDumper
 
ClusterDumper.OUTPUT_FORMAT - Enum in org.apache.mahout.utils.clustering
 
ClusterDumperWriter - Class in org.apache.mahout.utils.clustering
Implements a ClusterWriter that outputs in the format used by ClusterDumper in Mahout 0.5
ClusterDumperWriter(Writer, Map<Integer, List<WeightedPropertyVectorWritable>>, DistanceMeasure, int, String[], int) - Constructor for class org.apache.mahout.utils.clustering.ClusterDumperWriter
 
ClusterEvaluator - Class in org.apache.mahout.clustering.evaluation
 
ClusterEvaluator(Map<Integer, List<VectorWritable>>, List<Cluster>, DistanceMeasure) - Constructor for class org.apache.mahout.clustering.evaluation.ClusterEvaluator
For testing only
ClusterEvaluator(Configuration, Path) - Constructor for class org.apache.mahout.clustering.evaluation.ClusterEvaluator
Initialize a new instance from job information
clusterIdToPoints - Variable in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
ClusterLabels - Class in org.apache.mahout.utils.vectors.lucene
Get labels for the cluster using Log Likelihood Ratio (LLR).
ClusterLabels(Path, Path, String, String, int, int) - Constructor for class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
CLUSTERS - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
ClusterWriter - Interface in org.apache.mahout.utils.clustering
Writes out clusters
compare(byte[], int, int, byte[], int, int) - Method in class org.apache.mahout.utils.SplitInputJob.SplitInputComparator
 
computeNext() - Method in class org.apache.mahout.utils.vectors.csv.CSVVectorIterator
 
computeNext() - Method in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
configure(Map<Integer, List<VectorWritable>>, DistanceMeasure) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
configure(Map<Integer, List<VectorWritable>>) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
ConfusionMatrixDumper - Class in org.apache.mahout.classifier
Export a ConfusionMatrix in various text formats: ToString version Grayscale HTML table Summary HTML table Table of counts all with optional HTML wrappers Input format: Hadoop SequenceFile with Text key and MatrixWritable value, 1 pair Intended to consume ConfusionMatrix SequenceFile output by Bayes TestClassifier class
ConnectionPoolDataSource - Class in org.apache.mahout.cf.taste.impl.model.jdbc
A wrapper DataSource which pools connections.
ConnectionPoolDataSource(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
countLines(FileSystem, Path, Charset) - Static method in class org.apache.mahout.utils.SplitInput
Count the lines in the file specified as returned by BufferedReader.readLine()
CREATE_COPY - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
CREATE_INCREMENTALLY - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
createBenchmark() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
createComponents(String) - Method in class org.apache.mahout.text.MailArchivesClusteringAnalyzer
 
createComponents(String) - Method in class org.apache.mahout.text.wikipedia.WikipediaAnalyzer
 
createData() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class org.apache.mahout.text.MultipleTextFileInputFormat
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class org.apache.mahout.text.wikipedia.XmlInputFormat
 
createSequenceFiles(MailOptions) - Method in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
CSVClusterWriter - Class in org.apache.mahout.utils.clustering
Format is adjacency style as put forth at http://gephi.org/users/supported-graph-formats/csv-format/, the centroid is the first element and all the rest of the row are the points in that cluster
CSVClusterWriter(Writer, Map<Integer, List<WeightedPropertyVectorWritable>>, DistanceMeasure) - Constructor for class org.apache.mahout.utils.clustering.CSVClusterWriter
 
CSVVectorIterator - Class in org.apache.mahout.utils.vectors.csv
Iterates a CSV file and produces Vector.
CSVVectorIterator(Reader) - Constructor for class org.apache.mahout.utils.vectors.csv.CSVVectorIterator
 
CSVVectorIterator(Reader, CSVStrategy) - Constructor for class org.apache.mahout.utils.vectors.csv.CSVVectorIterator
 

D

DATA - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
DATA_SOURCE_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
DEFAULT_ITEM_ID_COLUMN - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
DEFAULT_MAX_LABELS - Static variable in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
DEFAULT_MIN_IDS - Static variable in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
DEFAULT_MONGO_MAP_COLLECTION - Static variable in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
DEFAULT_PREFERENCE_COLUMN - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
DEFAULT_PREFERENCE_TABLE - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
DEFAULT_PREFERENCE_TIME_COLUMN - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
DEFAULT_QUOTED_TEXT - Static variable in class org.apache.mahout.utils.email.MailOptions
 
DEFAULT_USER_ID_COLUMN - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
DelimitedTermInfoWriter - Class in org.apache.mahout.utils.vectors.io
Write TermInfo to a Writer in a textual, delimited format with header.
DelimitedTermInfoWriter(Writer, String, String) - Constructor for class org.apache.mahout.utils.vectors.io.DelimitedTermInfoWriter
 
DENSE_FN_RAND - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
DENSE_FN_SEQ - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
DENSE_VECTOR - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
depends(Vector) - Method in class org.apache.mahout.benchmark.BenchmarkRunner.BenchmarkFn
Adds a random data dependency so that JVM does not remove dead code.
depends(Vector) - Method in class org.apache.mahout.benchmark.BenchmarkRunner.BenchmarkFnD
Adds a random data dependency so that JVM does not remove dead code.
DESERIALIZE - Static variable in class org.apache.mahout.benchmark.SerializationBenchmark
 
deserializeBenchmark() - Method in class org.apache.mahout.benchmark.SerializationBenchmark
 
DICTIONARY_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
DICTIONARY_TYPE_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
DISTANCE_MEASURE_KEY - Static variable in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
DistanceBenchmark - Class in org.apache.mahout.benchmark
 
DistanceBenchmark(VectorBenchmarks) - Constructor for class org.apache.mahout.benchmark.DistanceBenchmark
 
doGet(HttpServletRequest, HttpServletResponse) - Method in class org.apache.mahout.cf.taste.web.RecommenderServlet
 
doGetPreferencesForItem(long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
doPost(HttpServletRequest, HttpServletResponse) - Method in class org.apache.mahout.cf.taste.web.RecommenderServlet
 
DotBenchmark - Class in org.apache.mahout.benchmark
 
DotBenchmark(VectorBenchmarks) - Constructor for class org.apache.mahout.benchmark.DotBenchmark
 
Driver - Class in org.apache.mahout.utils.vectors.arff
 
Driver - Class in org.apache.mahout.utils.vectors.lucene
 
Driver() - Constructor for class org.apache.mahout.utils.vectors.lucene.Driver
 
dumpVectors() - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 

E

END_TAG_KEY - Static variable in class org.apache.mahout.text.wikipedia.XmlInputFormat
 
estimatePreference(long, long) - Method in class org.apache.mahout.cf.taste.web.RecommenderWrapper
 
EVALUATE_CLUSTERS - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
exportWithIDsOnly() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
exportWithPrefs() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
extract(CharSequence, Pattern, Collection<Integer>, String, RegexTransformer) - Static method in class org.apache.mahout.utils.regex.RegexUtils
 
extract(CharSequence, StringBuilder, Pattern, Collection<Integer>, String, RegexTransformer) - Static method in class org.apache.mahout.utils.regex.RegexUtils
 

F

field - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
FILE_FILTER_CLASS_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromDirectory
 
firstEntries(Vector, int) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
format(String) - Method in class org.apache.mahout.utils.regex.FPGFormatter
 
format(String) - Method in class org.apache.mahout.utils.regex.IdentityFormatter
 
format(String) - Method in interface org.apache.mahout.utils.regex.RegexFormatter
 
FORMATTER_CLASS - Static variable in class org.apache.mahout.utils.regex.RegexMapper
 
FPGFormatter - Class in org.apache.mahout.utils.regex
Collapses/converts all whitespace to a single tab
FPGFormatter() - Constructor for class org.apache.mahout.utils.regex.FPGFormatter
 
FROM - Static variable in class org.apache.mahout.utils.email.MailOptions
 
FROM_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
FROM_PREFIX - Static variable in class org.apache.mahout.utils.email.MailProcessor
 
fromIdToLong(String, boolean) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Translates the MongoDB identifier to Mahout/MongoDBDataModel's internal identifier, if required.
fromLongToId(long) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Translates the Mahout/MongoDBDataModel's internal identifier to MongoDB identifier, if required.

G

generateKey(String, String, String) - Static method in class org.apache.mahout.text.SequenceFilesFromMailArchivesMapper
 
generateKey(File, String, String) - Static method in class org.apache.mahout.utils.email.MailProcessor
 
GenericJDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
A generic DataModel designed for use with other JDBC data sources; one just specifies all necessary SQL queries to the constructor here.
GenericJDBCDataModel(Properties) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
Specifies all SQL queries in a Properties object.
GenericJDBCDataModel(File) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
GenericJDBCDataModel(String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
GET_ALL_USERS_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_ITEMS_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_MAX_PREFERENCE_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_MIN_PREFERENCE_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_NUM_ITEMS_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_NUM_PREFERENCE_FOR_ITEM_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_NUM_PREFERENCE_FOR_ITEMS_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_NUM_USERS_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_PREFERENCE_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_PREFERENCE_TIME_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_PREFS_FOR_ITEM_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_USER_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
GET_USERS_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
getAllEntries() - Method in class org.apache.mahout.utils.vectors.lucene.CachedTermInfo
 
getAllEntries() - Method in interface org.apache.mahout.utils.vectors.TermInfo
 
getAnalyzer() - Method in class org.apache.mahout.utils.regex.AnalyzerTransformer
 
getARFFType(Integer) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getARFFType(Integer) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getBodySeparator() - Method in class org.apache.mahout.utils.email.MailOptions
 
getCallback() - Method in class org.apache.mahout.utils.SplitInput
 
getCDbw() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
Compute the CDbw validity metric (eqn 8).
getChain() - Method in class org.apache.mahout.utils.regex.ChainTransformer
 
getCharset() - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
getCharset() - Method in class org.apache.mahout.utils.email.MailOptions
 
getCharset() - Method in class org.apache.mahout.utils.SplitInput
 
getChunkSize() - Method in class org.apache.mahout.utils.email.MailOptions
 
getClusterIdToPoints() - Method in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
getClusterIdToPoints() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getClusterLabels(Integer, Collection<WeightedPropertyVectorWritable>) - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
Get the list of labels, sorted by best score.
getConf() - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
getConnection() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
getConnection(String, String) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
getCurrentKey() - Method in class org.apache.mahout.text.WholeFileRecordReader
 
getCurrentKey() - Method in class org.apache.mahout.text.wikipedia.XmlInputFormat.XmlRecordReader
 
getCurrentValue() - Method in class org.apache.mahout.text.WholeFileRecordReader
 
getCurrentValue() - Method in class org.apache.mahout.text.wikipedia.XmlInputFormat.XmlRecordReader
 
getDataModel() - Method in class org.apache.mahout.cf.taste.web.RecommenderWrapper
 
getDataSource() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getDateFormat(Integer) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getDateFormat(Integer) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getDateMap() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
Map of Date formatters used
getDelegate() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getDelegateInMemory() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getDocFreq() - Method in class org.apache.mahout.utils.vectors.TermEntry
 
getFetchSize() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLBooleanPrefJDBCDataModel
 
getFetchSize() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLJDBCDataModel
 
getFetchSize() - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCInMemoryItemSimilarity
 
getFetchSize() - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCItemSimilarity
 
getFs() - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
getIdField() - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
getIndicator() - Method in enum org.apache.mahout.utils.vectors.arff.ARFFType
 
getInput() - Method in class org.apache.mahout.utils.email.MailOptions
 
getInputDirectory() - Method in class org.apache.mahout.utils.SplitInput
 
getInstance() - Static method in class org.apache.mahout.cf.taste.web.RecommenderSingleton
 
getItemAIDColumn() - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
getItemBIDColumn() - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
getItemIDColumn() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getItemIDs() - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getItemIDs() - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getItemIDs() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getItemIDs() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getItemIDs() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getItemIDsFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getItemIDsFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getItemIDsFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getItemIDsFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getItemIDsFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getLabel(String) - Method in enum org.apache.mahout.utils.vectors.arff.ARFFType
 
getLabelBindings() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
The vector attributes (labels in Mahout speak)
getLabelBindings() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
The vector attributes (labels in Mahout speak), unmodifiable
getLabelIndex(String) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getLabelIndex(String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getLabels() - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
getLabelSize() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getLabelSize() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getLoginTimeout() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
getLogWriter() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
getLongColumn(ResultSet, int) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
Subclasses may wish to override this if ID values in the file are not numeric.
getMaxPointsPerCluster() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getMaxPreference() - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getMaxPreference() - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getMaxPreference() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractBooleanPrefJDBCDataModel
 
getMaxPreference() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getMaxPreference() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getMaxPreference() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getMinPreference() - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getMinPreference() - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getMinPreference() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractBooleanPrefJDBCDataModel
 
getMinPreference() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getMinPreference() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getMinPreference() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getModel() - Method in class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
Returns info about the ARFF content that was parsed.
getNominalMap() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getNominalMap() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
Map nominals to ids.
getNominalValue(String, String) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getNominalValue(String, String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getNumItems() - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getNumItems() - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getNumItems() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getNumItems() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getNumItems() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getNumTopFeatures() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getNumUsers() - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getNumUsers() - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getNumUsers() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getNumUsers() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getNumUsers() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getNumUsersWithPreferenceFor(long) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getNumUsersWithPreferenceFor(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getNumUsersWithPreferenceFor(long) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getNumUsersWithPreferenceFor(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getNumUsersWithPreferenceFor(long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getNumUsersWithPreferenceFor(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getNumUsersWithPreferenceFor(long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getNumUsersWithPreferenceFor(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getNumUsersWithPreferenceFor(long) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getNumUsersWithPreferenceFor(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getOptions() - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
getOptions() - Method in class org.apache.mahout.utils.email.MailProcessor
 
getOutput() - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
getOutputDir() - Method in class org.apache.mahout.utils.email.MailOptions
 
getParentLogger() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
getPatternOrder() - Method in class org.apache.mahout.utils.email.MailOptions
 
getPatternsToMatch() - Method in class org.apache.mahout.utils.email.MailOptions
 
getPoints(Cluster, String[]) - Method in class org.apache.mahout.utils.clustering.JsonClusterWriter
Create a List of HashMaps containing Vector point information
getPreferenceColumn() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getPreferencesForItem(long) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getPreferencesForItem(long) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getPreferencesForItem(long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getPreferencesForItem(long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getPreferencesForItem(long) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getPreferencesFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getPreferencesFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getPreferencesFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getPreferencesFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getPreferencesFromUser(long) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getPreferenceTable() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getPreferenceTime(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getPreferenceTime(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getPreferenceTime(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getPreferenceTime(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getPreferenceTime(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getPreferenceValue(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getPreferenceValue(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getPreferenceValue(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getPreferenceValue(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getPreferenceValue(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getPrefix() - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
getPrefix() - Method in class org.apache.mahout.utils.email.MailOptions
 
getPrefix() - Method in class org.apache.mahout.utils.email.MailProcessor
 
getProgress() - Method in class org.apache.mahout.text.WholeFileRecordReader
 
getProgress() - Method in class org.apache.mahout.text.wikipedia.XmlInputFormat.XmlRecordReader
 
getQuotedTextPattern() - Method in class org.apache.mahout.utils.email.MailOptions
 
getRecommender() - Method in class org.apache.mahout.cf.taste.web.RecommenderSingleton
 
getRelation() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getRelation() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getRepresentativePoints(Configuration) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
getRepresentativePoints(Configuration, Path) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
getRunner() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
getSeparator() - Method in class org.apache.mahout.utils.email.MailOptions
 
getSimilarityColumn() - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
getSimilarityTable() - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
getSplitLocation() - Method in class org.apache.mahout.utils.SplitInput
 
getSubString() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getTableName() - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getTerm() - Method in class org.apache.mahout.utils.vectors.TermEntry
 
getTermDictionary() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getTermEntry(String, String) - Method in class org.apache.mahout.utils.vectors.lucene.CachedTermInfo
 
getTermEntry(String, String) - Method in interface org.apache.mahout.utils.vectors.TermInfo
 
getTermIdx() - Method in class org.apache.mahout.utils.vectors.TermEntry
 
getTestOutputDirectory() - Method in class org.apache.mahout.utils.SplitInput
 
getTestRandomSelectionPct() - Method in class org.apache.mahout.utils.SplitInput
 
getTestRandomSelectionSize() - Method in class org.apache.mahout.utils.SplitInput
 
getTestSplitPct() - Method in class org.apache.mahout.utils.SplitInput
 
getTestSplitSize() - Method in class org.apache.mahout.utils.SplitInput
 
getTopFeatures(Vector, String[], int) - Static method in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
getTopFeaturesList(Vector, String[], int) - Method in class org.apache.mahout.utils.clustering.JsonClusterWriter
Create a List of HashMaps containing top terms information
getTopTerms(Vector, String[], int) - Static method in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
getTrainingOutputDirectory() - Method in class org.apache.mahout.utils.SplitInput
 
getTypeMap() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
The map of types encountered
getUpdatePreferenceSQL() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92JDBCDataModel
 
getUserIDColumn() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getUserIDs() - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
getUserIDs() - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
getUserIDs() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
getUserIDs() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
getUserIDs() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
getValue(String, int) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getValue(String, int) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
Convert a piece of String data at a specific spot into a value
getVector() - Method in class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
getVectorName(int) - Method in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
Given the document name, derive a name for the vector.
getVectorName(int) - Method in class org.apache.mahout.utils.vectors.lucene.LuceneIterator
 
getVerifyPreferenceSQL() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92BooleanPrefJDBCDataModel
 
getVerifyPreferenceSQL() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92JDBCDataModel
 
getWordCount() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
The count of the number of words seen
getWordCount() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
The count of the number of words seen
getWords() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getWords() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
Immutable map of words to the long id used for those words
getWriter() - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
getWriter() - Method in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
getWriter() - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
getWriter() - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
GraphMLClusterWriter - Class in org.apache.mahout.utils.clustering
GraphML -- see http://gephi.org/users/supported-graph-formats/graphml-format/
GraphMLClusterWriter(Writer, Map<Integer, List<WeightedPropertyVectorWritable>>, DistanceMeasure, int, String[], int) - Constructor for class org.apache.mahout.utils.clustering.GraphMLClusterWriter
 
GROUP_MATCHERS - Static variable in class org.apache.mahout.utils.regex.RegexMapper
 

H

hasPreferenceValues() - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
hasPreferenceValues() - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
hasPreferenceValues() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractBooleanPrefJDBCDataModel
 
hasPreferenceValues() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
hasPreferenceValues() - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
hasPreferenceValues() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
HBaseDataModel - Class in org.apache.mahout.cf.taste.impl.model.hbase
Naive approach of storing one preference as one value in the table.
HBaseDataModel(String) - Constructor for class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
HBaseDataModel(String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
HBaseDataModel(HTablePool, String, Configuration) - Constructor for class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 

I

IDENTITY_FORMATTER - Static variable in class org.apache.mahout.utils.regex.RegexUtils
 
IDENTITY_TRANSFORMER - Static variable in class org.apache.mahout.utils.regex.RegexUtils
 
IdentityFormatter - Class in org.apache.mahout.utils.regex
 
IdentityFormatter() - Constructor for class org.apache.mahout.utils.regex.IdentityFormatter
 
IdentityTransformer - Class in org.apache.mahout.utils.regex
No-op
IdentityTransformer() - Constructor for class org.apache.mahout.utils.regex.IdentityTransformer
 
idField - Variable in class org.apache.mahout.utils.vectors.lucene.LuceneIterator
 
idFieldSelector - Variable in class org.apache.mahout.utils.vectors.lucene.LuceneIterator
 
increment() - Method in class org.apache.mahout.utils.Bump125
 
incrementalCreateBenchmark() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
incrementToken() - Method in class org.apache.mahout.utils.nlp.collocations.llr.BloomTokenFilter
 
indexReader - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
init(ServletConfig) - Method in class org.apache.mahout.cf.taste.web.RecommenderServlet
 
initialize(InputSplit, TaskAttemptContext) - Method in class org.apache.mahout.text.WholeFileRecordReader
 
initialize(InputSplit, TaskAttemptContext) - Method in class org.apache.mahout.text.wikipedia.XmlInputFormat.XmlRecordReader
 
initializeIfNeeded(String) - Static method in class org.apache.mahout.cf.taste.web.RecommenderSingleton
 
InputDriver - Class in org.apache.mahout.clustering.conversion
This class converts text files containing space-delimited floating point numbers into Mahout sequence files of VectorWritable suitable for input to the clustering jobs in particular, and any Mahout job requiring this input in general.
InputMapper - Class in org.apache.mahout.clustering.conversion
 
InputMapper() - Constructor for class org.apache.mahout.clustering.conversion.InputMapper
 
interClusterDensities() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
This function evaluates the density of points in the regions between each clusters (eqn 1).
interClusterDensity() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
This function evaluates the average density of points in the regions between clusters (eqn 1).
interClusterDensity() - Method in class org.apache.mahout.clustering.evaluation.ClusterEvaluator
Computes the inter-cluster density as defined in "Mahout In Action"
interClusterDistances() - Method in class org.apache.mahout.clustering.evaluation.ClusterEvaluator
Computes the inter-cluster distances
intraClusterDensities() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
The average density within clusters is defined as the percentage of representative points that reside in the neighborhood of the clusters' centers.
intraClusterDensities() - Method in class org.apache.mahout.clustering.evaluation.ClusterEvaluator
Computes the intra-cluster densities for all clusters as the average distance of the representative points from each other
intraClusterDensity() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
The average density within clusters is defined as the percentage of representative points that reside in the neighborhood of the clusters' centers.
intraClusterDensity() - Method in class org.apache.mahout.clustering.evaluation.ClusterEvaluator
Computes the average intra-cluster density as the average of each cluster's intra-cluster density
IOWriterWrapper - Class in org.apache.mahout.utils.io
Implementation of the WrappedWriter interface based on Writer.
IOWriterWrapper(Writer) - Constructor for class org.apache.mahout.utils.io.IOWriterWrapper
 
isIDInModel(String) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Checks if an ID is currently in the model.
isIncludeBody() - Method in class org.apache.mahout.utils.email.MailOptions
 
isNumeric(String) - Static method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
isStripQuotedText() - Method in class org.apache.mahout.utils.email.MailOptions
 
isWrapperFor(Class<?>) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
itemSimilarities(long, long[]) - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
itemSimilarity(long, long) - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
iterator() - Method in class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
iterator() - Method in class org.apache.mahout.utils.vectors.lucene.LuceneIterable
 

J

JsonClusterWriter - Class in org.apache.mahout.utils.clustering
Dump cluster info to JSON formatted lines.
JsonClusterWriter(Writer, Map<Integer, List<WeightedPropertyVectorWritable>>, DistanceMeasure, int, String[]) - Constructor for class org.apache.mahout.utils.clustering.JsonClusterWriter
 

K

KEY_PREFIX_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromDirectory
 
KEY_PREFIX_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 

L

LDAPrintTopics - Class in org.apache.mahout.clustering.lda
Class to print out the top K words for each topic.
loadTermDictionary(File) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
Read in a dictionary file.
loadTermDictionary(Configuration, String) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
Read a dictionary in SequenceFile generated by DictionaryVectorizer
LuceneIterable - Class in org.apache.mahout.utils.vectors.lucene
Iterable counterpart to LuceneIterator.
LuceneIterable(IndexReader, String, String, TermInfo, Weight) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterable
 
LuceneIterable(IndexReader, String, String, TermInfo, Weight, double) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterable
 
LuceneIterable(IndexReader, String, String, TermInfo, Weight, double, double) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterable
Produce a LuceneIterable that can create the Vector plus normalize it.
LuceneIterator - Class in org.apache.mahout.utils.vectors.lucene
An Iterator over Vectors that uses a Lucene index as the source for creating the Vectors.
LuceneIterator(IndexReader, String, String, TermInfo, Weight, double) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterator
Produce a LuceneIterable that can create the Vector plus normalize it.
LuceneIterator(IndexReader, String, String, TermInfo, Weight, double, double) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterator
 

M

MailArchivesClusteringAnalyzer - Class in org.apache.mahout.text
Custom Lucene Analyzer designed for aggressive feature reduction for clustering the ASF Mail Archives using an extended set of stop words, excluding non-alpha-numeric tokens, and porter stemming.
MailArchivesClusteringAnalyzer() - Constructor for class org.apache.mahout.text.MailArchivesClusteringAnalyzer
 
MailArchivesClusteringAnalyzer(CharArraySet) - Constructor for class org.apache.mahout.text.MailArchivesClusteringAnalyzer
 
MailOptions - Class in org.apache.mahout.utils.email
Configuration options to be used by MailProcessor.
MailOptions() - Constructor for class org.apache.mahout.utils.email.MailOptions
 
MailProcessor - Class in org.apache.mahout.utils.email
Converts an mbox mail archive into a group of Hadoop Sequence Files with equal size.
MailProcessor(MailOptions, String, Writer) - Constructor for class org.apache.mahout.utils.email.MailProcessor
Creates a MailProcessor that does not write to sequence files, but to a single text file.
MailProcessor(MailOptions, String, ChunkedWriter) - Constructor for class org.apache.mahout.utils.email.MailProcessor
This is the main constructor of MailProcessor.
main(String[]) - Static method in class org.apache.mahout.benchmark.DotBenchmark
 
main(String[]) - Static method in class org.apache.mahout.benchmark.VectorBenchmarks
 
main(String[]) - Static method in class org.apache.mahout.classifier.ConfusionMatrixDumper
 
main(String[]) - Static method in class org.apache.mahout.clustering.conversion.InputDriver
 
main(String[]) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
main(String[]) - Static method in class org.apache.mahout.clustering.lda.LDAPrintTopics
 
main(String[]) - Static method in class org.apache.mahout.text.SequenceFilesFromDirectory
 
main(String[]) - Static method in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
main(String[]) - Static method in class org.apache.mahout.text.TextParagraphSplittingJob
 
main(String[]) - Static method in class org.apache.mahout.text.wikipedia.WikipediaDatasetCreatorDriver
Takes in two arguments: The input Path where the input documents live The output Path where to write the classifier as a SequenceFile
main(String[]) - Static method in class org.apache.mahout.text.wikipedia.WikipediaXmlSplitter
 
main(String[]) - Static method in class org.apache.mahout.text.WikipediaToSequenceFile
Takes in two arguments: The input Path where the input documents live The output Path where to write the classifier as a SequenceFile
main(String[]) - Static method in class org.apache.mahout.utils.clustering.ClusterDumper
 
main(String[]) - Static method in class org.apache.mahout.utils.MatrixDumper
 
main(String[]) - Static method in class org.apache.mahout.utils.regex.RegexConverterDriver
 
main(String[]) - Static method in class org.apache.mahout.utils.SequenceFileDumper
 
main(String[]) - Static method in class org.apache.mahout.utils.SplitInput
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.arff.Driver
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.lucene.Driver
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.RowIdJob
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.VectorDumper
 
map(LongWritable, Text, Mapper<LongWritable, Text, Text, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.conversion.InputMapper
 
map(IntWritable, WeightedVectorWritable, Mapper<IntWritable, WeightedVectorWritable, IntWritable, WeightedVectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
map(IntWritable, BytesWritable, Mapper<IntWritable, BytesWritable, Text, Text>.Context) - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryMapper
 
map(IntWritable, BytesWritable, Mapper<IntWritable, BytesWritable, Text, Text>.Context) - Method in class org.apache.mahout.text.SequenceFilesFromMailArchivesMapper
 
map(Text, Text, Mapper<Text, Text, Text, Text>.Context) - Method in class org.apache.mahout.text.TextParagraphSplittingJob.SplitMap
 
map(LongWritable, Text, Mapper<LongWritable, Text, Text, Text>.Context) - Method in class org.apache.mahout.text.wikipedia.WikipediaDatasetCreatorMapper
 
map(LongWritable, Text, Mapper<LongWritable, Text, Text, Text>.Context) - Method in class org.apache.mahout.text.wikipedia.WikipediaMapper
 
map(LongWritable, Text, Mapper<LongWritable, Text, LongWritable, Text>.Context) - Method in class org.apache.mahout.utils.regex.RegexMapper
 
map(BytesRef, int) - Method in class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
MapBackedARFFModel - Class in org.apache.mahout.utils.vectors.arff
Holds ARFF information in Map.
MapBackedARFFModel() - Constructor for class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
MapBackedARFFModel(Map<String, Long>, long, Map<String, Map<String, Integer>>) - Constructor for class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
mapPoint(IntWritable, WeightedVectorWritable, DistanceMeasure, Map<Integer, List<VectorWritable>>, Map<Integer, WeightedVectorWritable>) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
MatrixDumper - Class in org.apache.mahout.utils
Export a Matrix in various text formats: * CSV file Input format: Hadoop SequenceFile with Text key and MatrixWritable value, 1 pair TODO: Needs class for key value- should not hard-code to Text.
maxErrorDocs - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
measure - Variable in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
MinusBenchmark - Class in org.apache.mahout.benchmark
 
MinusBenchmark(VectorBenchmarks) - Constructor for class org.apache.mahout.benchmark.MinusBenchmark
 
MongoDBDataModel - Class in org.apache.mahout.cf.taste.impl.model.mongodb
A DataModel backed by a MongoDB database.
MongoDBDataModel() - Constructor for class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Creates a new MongoDBDataModel
MongoDBDataModel(String, int, String, String, boolean, boolean, DateFormat) - Constructor for class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Creates a new MongoDBDataModel with MongoDB basic configuration (without authentication)
MongoDBDataModel(String, int, String, String, boolean, boolean, DateFormat, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Creates a new MongoDBDataModel with MongoDB advanced configuration (without authentication)
MongoDBDataModel(String, int, String, String, boolean, boolean, DateFormat, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Creates a new MongoDBDataModel with MongoDB basic configuration (with authentication)
MongoDBDataModel(String, int, String, String, boolean, boolean, DateFormat, String, String, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Creates a new MongoDBDataModel with MongoDB advanced configuration (with authentication)
mongoUpdateDate() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Date of the latest update of the model.
MultipleTextFileInputFormat - Class in org.apache.mahout.text
Used in combining a large number of text files into one text input reader along with the WholeFileRecordReader class.
MultipleTextFileInputFormat() - Constructor for class org.apache.mahout.text.MultipleTextFileInputFormat
 
MySQLBooleanPrefJDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
See also MySQLJDBCDataModel -- same except deals with a table without preference info:
MySQLBooleanPrefJDBCDataModel() - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLBooleanPrefJDBCDataModel
Creates a MySQLBooleanPrefJDBCDataModel using the default DataSource (named AbstractJDBCComponent.DEFAULT_DATASOURCE_NAME and default table/column names.
MySQLBooleanPrefJDBCDataModel(String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLBooleanPrefJDBCDataModel
Creates a MySQLBooleanPrefJDBCDataModel using the default DataSource found under the given name, and using default table/column names.
MySQLBooleanPrefJDBCDataModel(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLBooleanPrefJDBCDataModel
Creates a MySQLBooleanPrefJDBCDataModel using the given DataSource and default table/column names.
MySQLBooleanPrefJDBCDataModel(DataSource, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLBooleanPrefJDBCDataModel
Creates a MySQLBooleanPrefJDBCDataModel using the given DataSource and default table/column names.
MySQLJDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
A JDBCDataModel backed by a MySQL database and accessed via JDBC.
MySQLJDBCDataModel() - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLJDBCDataModel
Creates a MySQLJDBCDataModel using the default DataSource (named AbstractJDBCComponent.DEFAULT_DATASOURCE_NAME and default table/column names.
MySQLJDBCDataModel(String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLJDBCDataModel
Creates a MySQLJDBCDataModel using the default DataSource found under the given name, and using default table/column names.
MySQLJDBCDataModel(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLJDBCDataModel
Creates a MySQLJDBCDataModel using the given DataSource and default table/column names.
MySQLJDBCDataModel(DataSource, String, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.MySQLJDBCDataModel
Creates a MySQLJDBCDataModel using the given DataSource and default table/column names.
MySQLJDBCInMemoryItemSimilarity - Class in org.apache.mahout.cf.taste.impl.similarity.jdbc
 
MySQLJDBCInMemoryItemSimilarity() - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCInMemoryItemSimilarity
 
MySQLJDBCInMemoryItemSimilarity(String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCInMemoryItemSimilarity
 
MySQLJDBCInMemoryItemSimilarity(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCInMemoryItemSimilarity
 
MySQLJDBCInMemoryItemSimilarity(DataSource, String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCInMemoryItemSimilarity
 
MySQLJDBCItemSimilarity - Class in org.apache.mahout.cf.taste.impl.similarity.jdbc
An ItemSimilarity backed by a MySQL database and accessed via JDBC.
MySQLJDBCItemSimilarity() - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCItemSimilarity
 
MySQLJDBCItemSimilarity(String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCItemSimilarity
 
MySQLJDBCItemSimilarity(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCItemSimilarity
 
MySQLJDBCItemSimilarity(DataSource, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.MySQLJDBCItemSimilarity
 

N

nextDocId - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
nextKeyValue() - Method in class org.apache.mahout.text.WholeFileRecordReader
 
nextKeyValue() - Method in class org.apache.mahout.text.wikipedia.XmlInputFormat.XmlRecordReader
 
nextLogRecord - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
NO_NORMALIZING - Static variable in class org.apache.mahout.utils.vectors.lucene.LuceneIterable
 
normPower - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
NUM_WORDS_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
numErrorDocs - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 

O

org.apache.mahout.benchmark - package org.apache.mahout.benchmark
 
org.apache.mahout.cf.taste.impl.model.cassandra - package org.apache.mahout.cf.taste.impl.model.cassandra
 
org.apache.mahout.cf.taste.impl.model.hbase - package org.apache.mahout.cf.taste.impl.model.hbase
 
org.apache.mahout.cf.taste.impl.model.jdbc - package org.apache.mahout.cf.taste.impl.model.jdbc
 
org.apache.mahout.cf.taste.impl.model.mongodb - package org.apache.mahout.cf.taste.impl.model.mongodb
 
org.apache.mahout.cf.taste.impl.similarity.jdbc - package org.apache.mahout.cf.taste.impl.similarity.jdbc
 
org.apache.mahout.cf.taste.web - package org.apache.mahout.cf.taste.web
 
org.apache.mahout.classifier - package org.apache.mahout.classifier
 
org.apache.mahout.clustering.cdbw - package org.apache.mahout.clustering.cdbw
 
org.apache.mahout.clustering.conversion - package org.apache.mahout.clustering.conversion
 
org.apache.mahout.clustering.evaluation - package org.apache.mahout.clustering.evaluation
 
org.apache.mahout.clustering.lda - package org.apache.mahout.clustering.lda
 
org.apache.mahout.text - package org.apache.mahout.text
 
org.apache.mahout.text.wikipedia - package org.apache.mahout.text.wikipedia
 
org.apache.mahout.utils - package org.apache.mahout.utils
 
org.apache.mahout.utils.clustering - package org.apache.mahout.utils.clustering
 
org.apache.mahout.utils.email - package org.apache.mahout.utils.email
 
org.apache.mahout.utils.io - package org.apache.mahout.utils.io
 
org.apache.mahout.utils.nlp.collocations.llr - package org.apache.mahout.utils.nlp.collocations.llr
 
org.apache.mahout.utils.regex - package org.apache.mahout.utils.regex
 
org.apache.mahout.utils.vectors - package org.apache.mahout.utils.vectors
 
org.apache.mahout.utils.vectors.arff - package org.apache.mahout.utils.vectors.arff
 
org.apache.mahout.utils.vectors.csv - package org.apache.mahout.utils.vectors.csv
 
org.apache.mahout.utils.vectors.io - package org.apache.mahout.utils.vectors.io
 
org.apache.mahout.utils.vectors.lucene - package org.apache.mahout.utils.vectors.lucene
 
OUTPUT_FORMAT_OPT - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 

P

parseMailboxLineByLine(String, InputStream, Mapper<IntWritable, BytesWritable, Text, Text>.Context) - Method in class org.apache.mahout.text.SequenceFilesFromMailArchivesMapper
 
parseMboxLineByLine(File) - Method in class org.apache.mahout.utils.email.MailProcessor
Parses one complete mail archive, writing output to the writer constructor parameter.
parseOptions() - Method in class org.apache.mahout.text.SequenceFilesFromDirectory
Override this method in order to parse your additional options from the command line.
PlusBenchmark - Class in org.apache.mahout.benchmark
 
PlusBenchmark(VectorBenchmarks) - Constructor for class org.apache.mahout.benchmark.PlusBenchmark
 
POINTS_DIR_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
PostgreSQLBooleanPrefJDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
See also PostgreSQLJDBCDataModel -- same except deals with a table without preference info:
PostgreSQLBooleanPrefJDBCDataModel() - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLBooleanPrefJDBCDataModel
Creates a using the default DataSource (named AbstractJDBCComponent.DEFAULT_DATASOURCE_NAME and default table/column names.
PostgreSQLBooleanPrefJDBCDataModel(String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLBooleanPrefJDBCDataModel
Creates a using the default DataSource found under the given name, and using default table/column names.
PostgreSQLBooleanPrefJDBCDataModel(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLBooleanPrefJDBCDataModel
Creates a using the given DataSource and default table/column names.
PostgreSQLBooleanPrefJDBCDataModel(DataSource, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLBooleanPrefJDBCDataModel
Creates a using the given DataSource and default table/column names.
PostgreSQLJDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
A JDBCDataModel backed by a PostgreSQL database and accessed via JDBC.
PostgreSQLJDBCDataModel() - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLJDBCDataModel
Creates a using the default DataSource (named AbstractJDBCComponent.DEFAULT_DATASOURCE_NAME and default table/column names.
PostgreSQLJDBCDataModel(String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLJDBCDataModel
Creates a using the default DataSource found under the given name, and using default table/column names.
PostgreSQLJDBCDataModel(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLJDBCDataModel
Creates a using the given DataSource and default table/column names.
PostgreSQLJDBCDataModel(DataSource, String, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLJDBCDataModel
Creates a using the given DataSource and default table/column names.
PrefixAdditionFilter - Class in org.apache.mahout.text
Default parser for parsing text into sequence files.
PrefixAdditionFilter(Configuration, String, Map<String, String>, ChunkedWriter, Charset, FileSystem) - Constructor for class org.apache.mahout.text.PrefixAdditionFilter
 
printClusters(String[]) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
printCountsTable(ConfusionMatrix, PrintStream) - Static method in class org.apache.mahout.classifier.ConfusionMatrixDumper
 
printFooter(PrintStream) - Static method in class org.apache.mahout.classifier.ConfusionMatrixDumper
 
printGrayTable(ConfusionMatrix, PrintStream) - Static method in class org.apache.mahout.classifier.ConfusionMatrixDumper
 
printHeader(PrintStream, CharSequence) - Static method in class org.apache.mahout.classifier.ConfusionMatrixDumper
 
printRepresentativePoints(Path, int) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
Utility to print out representative points
printSummaryTable(ConfusionMatrix, PrintStream) - Static method in class org.apache.mahout.classifier.ConfusionMatrixDumper
 
process(FileStatus, Path) - Method in class org.apache.mahout.text.PrefixAdditionFilter
 
process(FileStatus, Path) - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
processDate(String, int) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
processNominal(String, String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
processNumeric(String) - Static method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
processString(String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 

Q

QUOTED_REGEX_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 

R

RAND_FN_DENSE - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
RAND_FN_SEQ - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
RAND_SPARSE_VECTOR - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
randBool() - Method in class org.apache.mahout.benchmark.BenchmarkRunner.BenchmarkFn
 
randBool() - Method in class org.apache.mahout.benchmark.BenchmarkRunner.BenchmarkFnD
 
randIndex() - Method in class org.apache.mahout.benchmark.BenchmarkRunner.BenchmarkFn
 
randIndex() - Method in class org.apache.mahout.benchmark.BenchmarkRunner.BenchmarkFnD
 
readPoints(Path, long, Configuration) - Static method in class org.apache.mahout.utils.clustering.ClusterDumper
 
readResourceToTempFile(String) - Static method in class org.apache.mahout.cf.taste.web.RecommenderWrapper
Reads the given resource into a temporary file.
recommend(long, int) - Method in class org.apache.mahout.cf.taste.web.RecommenderWrapper
 
recommend(long, int, IDRescorer) - Method in class org.apache.mahout.cf.taste.web.RecommenderWrapper
 
RecommenderServlet - Class in org.apache.mahout.cf.taste.web
A servlet which returns recommendations, as its name implies.
RecommenderServlet() - Constructor for class org.apache.mahout.cf.taste.web.RecommenderServlet
 
RecommenderSingleton - Class in org.apache.mahout.cf.taste.web
A singleton which holds an instance of a Recommender.
RecommenderWrapper - Class in org.apache.mahout.cf.taste.web
Users of the packaging and deployment mechanism in this module need to produce a Recommender implementation with a no-arg constructor, which will internally build the desired Recommender and delegate to it.
RecommenderWrapper() - Constructor for class org.apache.mahout.cf.taste.web.RecommenderWrapper
 
reduce(IntWritable, Iterable<WeightedVectorWritable>, Reducer<IntWritable, WeightedVectorWritable, IntWritable, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
reduce(Text, Iterable<Text>, Reducer<Text, Text, Text, Text>.Context) - Method in class org.apache.mahout.text.wikipedia.WikipediaDatasetCreatorReducer
 
reduce(WritableComparable<?>, Iterable<Writable>, Reducer<WritableComparable<?>, Writable, WritableComparable<?>, Writable>.Context) - Method in class org.apache.mahout.utils.SplitInputJob.SplitInputReducer
Randomly allocate key value pairs between test and training sets.
REFERENCES_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
refresh(Collection<Refreshable>) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
refresh(Collection<Refreshable>) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
refresh(Collection<Refreshable>) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
refresh(Collection<Refreshable>) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
refresh(Collection<Refreshable>) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Triggers "refresh" -- whatever that means -- of the implementation.
refresh(Collection<Refreshable>) - Method in class org.apache.mahout.cf.taste.impl.similarity.jdbc.AbstractJDBCItemSimilarity
 
refresh(Collection<Refreshable>) - Method in class org.apache.mahout.cf.taste.web.RecommenderWrapper
 
refreshData(String, Iterable<List<String>>, boolean) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
Adds/removes (user, item) pairs to/from the model.
REFS - Static variable in class org.apache.mahout.utils.email.MailOptions
 
REFS_PREFIX - Static variable in class org.apache.mahout.utils.email.MailProcessor
 
REGEX - Static variable in class org.apache.mahout.utils.regex.RegexMapper
 
RegexConverterDriver - Class in org.apache.mahout.utils.regex
Experimental
RegexConverterDriver() - Constructor for class org.apache.mahout.utils.regex.RegexConverterDriver
 
RegexFormatter - Interface in org.apache.mahout.utils.regex
 
RegexMapper - Class in org.apache.mahout.utils.regex
 
RegexMapper() - Constructor for class org.apache.mahout.utils.regex.RegexMapper
 
RegexTransformer - Interface in org.apache.mahout.utils.regex
Transforms the match of a regular expression.
RegexUtils - Class in org.apache.mahout.utils.regex
 
RELATION - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
ReloadFromJDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
A DataModel which loads, and can re-load, data from a JDBC-backed JDBCDataModel into memory, as a GenericDataModel or GenericBooleanPrefDataModel.
ReloadFromJDBCDataModel(JDBCDataModel) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
REMOVE_PREFERENCE_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
removePreference(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
removePreference(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
removePreference(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
removePreference(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
removePreference(long, long) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
removePreference(long, long) - Method in class org.apache.mahout.cf.taste.web.RecommenderWrapper
 
removeQuotes(String) - Static method in enum org.apache.mahout.utils.vectors.arff.ARFFType
Remove quotes and leading/trailing whitespace from a single or double quoted string
RepresentativePointsDriver - Class in org.apache.mahout.clustering.evaluation
 
RepresentativePointsMapper - Class in org.apache.mahout.clustering.evaluation
 
RepresentativePointsMapper() - Constructor for class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
RepresentativePointsReducer - Class in org.apache.mahout.clustering.evaluation
 
RepresentativePointsReducer() - Constructor for class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
RowIdJob - Class in org.apache.mahout.utils.vectors
Converts a vector representation of documents into a document x terms matrix.
RowIdJob() - Constructor for class org.apache.mahout.utils.vectors.RowIdJob
 
run(String[]) - Method in class org.apache.mahout.classifier.ConfusionMatrixDumper
 
run(String[]) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
run(Configuration, Path, Path, Path, DistanceMeasure, int, boolean) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
run(String[]) - Method in class org.apache.mahout.text.SequenceFilesFromDirectory
 
run(String[]) - Method in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
run(String[]) - Method in class org.apache.mahout.text.TextParagraphSplittingJob
 
run(String[]) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
run(String[]) - Method in class org.apache.mahout.utils.MatrixDumper
 
run(String[]) - Method in class org.apache.mahout.utils.regex.RegexConverterDriver
 
run(String[]) - Method in class org.apache.mahout.utils.SequenceFileDumper
 
run(String[]) - Method in class org.apache.mahout.utils.SplitInput
 
run(Configuration, Path, Path, int, float) - Static method in class org.apache.mahout.utils.SplitInputJob
Run job to downsample, randomly permute and split data into test and training sets.
run(Mapper<WritableComparable<?>, Writable, WritableComparable<?>, Writable>.Context) - Method in class org.apache.mahout.utils.SplitInputJob.SplitInputMapper
Only run map() for one out of every downsampleFactor inputs
run(String[]) - Method in class org.apache.mahout.utils.vectors.RowIdJob
 
run(String[]) - Method in class org.apache.mahout.utils.vectors.VectorDumper
 
runJob(Path, Path, String) - Static method in class org.apache.mahout.clustering.conversion.InputDriver
 
runJob(String, String, String, boolean, Class<? extends Analyzer>) - Static method in class org.apache.mahout.text.wikipedia.WikipediaDatasetCreatorDriver
Run the job
runJob(String, String, String, boolean, boolean, boolean) - Static method in class org.apache.mahout.text.WikipediaToSequenceFile
Run the job

S

SAMPLE_POINTS - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
separation() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
Calculate the separation of clusters (eqn 4) taking into account both the distances between the clusters' closest points and the Inter-cluster density.
SEPARATOR_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
SEQ_FN_DENSE - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
SEQ_FN_RAND - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
SEQ_SPARSE_VECTOR - Static variable in class org.apache.mahout.benchmark.VectorBenchmarks
 
SequenceFileDumper - Class in org.apache.mahout.utils
 
SequenceFileDumper() - Constructor for class org.apache.mahout.utils.SequenceFileDumper
 
SequenceFilesFromDirectory - Class in org.apache.mahout.text
Converts a directory of text documents into SequenceFiles of Specified chunkSize.
SequenceFilesFromDirectory() - Constructor for class org.apache.mahout.text.SequenceFilesFromDirectory
 
SequenceFilesFromDirectoryFilter - Class in org.apache.mahout.text
Implement this interface if you wish to extend SequenceFilesFromDirectory with your own parsing logic.
SequenceFilesFromDirectoryFilter(Configuration, String, Map<String, String>, ChunkedWriter, Charset, FileSystem) - Constructor for class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
SequenceFilesFromDirectoryMapper - Class in org.apache.mahout.text
Map class for SequenceFilesFromDirectory MR job
SequenceFilesFromDirectoryMapper() - Constructor for class org.apache.mahout.text.SequenceFilesFromDirectoryMapper
 
SequenceFilesFromMailArchives - Class in org.apache.mahout.text
Converts a directory of gzipped mail archives into SequenceFiles of specified chunkSize.
SequenceFilesFromMailArchives() - Constructor for class org.apache.mahout.text.SequenceFilesFromMailArchives
 
SequenceFilesFromMailArchivesMapper - Class in org.apache.mahout.text
Map Class for the SequenceFilesFromMailArchives job
SequenceFilesFromMailArchivesMapper() - Constructor for class org.apache.mahout.text.SequenceFilesFromMailArchivesMapper
 
SequenceFileVectorWriter - Class in org.apache.mahout.utils.vectors.io
Writes out Vectors to a SequenceFile.
SequenceFileVectorWriter(SequenceFile.Writer) - Constructor for class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
SerializationBenchmark - Class in org.apache.mahout.benchmark
 
SerializationBenchmark(VectorBenchmarks) - Constructor for class org.apache.mahout.benchmark.SerializationBenchmark
 
SERIALIZE - Static variable in class org.apache.mahout.benchmark.SerializationBenchmark
 
serializeBenchmark() - Method in class org.apache.mahout.benchmark.SerializationBenchmark
 
SET_PREFERENCE_SQL_KEY - Static variable in class org.apache.mahout.cf.taste.impl.model.jdbc.GenericJDBCDataModel
 
setAnalyzer(Analyzer) - Method in class org.apache.mahout.utils.regex.AnalyzerTransformer
 
setBodySeparator(String) - Method in class org.apache.mahout.utils.email.MailOptions
Sets the separator to use in the output between lines in the body, the default is "\n".
setCallback(SplitInput.SplitCallback) - Method in class org.apache.mahout.utils.SplitInput
Sets the callback used to inform the caller that an input file has been successfully split
setChain(List<RegexTransformer>) - Method in class org.apache.mahout.utils.regex.ChainTransformer
 
setCharset(Charset) - Method in class org.apache.mahout.utils.email.MailOptions
Sets the encoding of the input
setCharset(Charset) - Method in class org.apache.mahout.utils.SplitInput
Set the charset used to read and write files
setChunkSize(int) - Method in class org.apache.mahout.utils.email.MailOptions
Sets the size of each generated sequence file, in Megabytes.
setDelimiter(String) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setDictOut(String) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setExpectations(String, long) - Method in class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
setField(String) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setIdField(String) - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
setIdField(String) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setIncludeBody(boolean) - Method in class org.apache.mahout.utils.email.MailOptions
Sets whether mail bodies are included in the output
setInput(File) - Method in class org.apache.mahout.utils.email.MailOptions
 
setInputDirectory(Path) - Method in class org.apache.mahout.utils.SplitInput
Set the directory from which input data will be read when the the SplitInput.splitDirectory() method is invoked
setKeepPct(int) - Method in class org.apache.mahout.utils.SplitInput
Sets the percentage of the input data to keep in a map reduce split input job
setLoginTimeout(int) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
setLogWriter(PrintWriter) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
setLongParameter(PreparedStatement, int, long) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
Subclasses may wish to override this if ID values in the file are not numeric.
setLuceneDir(String) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setMapRedOutputDirectory(Path) - Method in class org.apache.mahout.utils.SplitInput
 
setMaxDFPercent(int) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setMaxDocs(long) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setMaxPercentErrorDocs(double) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setMaxPointsPerCluster(long) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
setMinDf(int) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setNorm(double) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setNumTopFeatures(int) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
setOutFile(String) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setOutput(String) - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
setOutputDir(String) - Method in class org.apache.mahout.utils.email.MailOptions
Sets the output directory where sequence files will be written.
setOutputFormat(ClusterDumper.OUTPUT_FORMAT) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
Convenience function to set the output format during testing.
setPatternOrder(Map<String, Integer>) - Method in class org.apache.mahout.utils.email.MailOptions
 
setPatternsToMatch(Pattern[]) - Method in class org.apache.mahout.utils.email.MailOptions
Sets the list of patterns to be applied in the given order to extract metadata fields (to, from, subject, etc.) from the input
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.hbase.HBaseDataModel
 
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractBooleanPrefJDBCDataModel
 
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel
 
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLBooleanPrefJDBCDataModel
Override since PostgreSQL doesn't have the same non-standard capability that MySQL has, to optionally ignore an insert that fails since the row exists already.
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.PostgreSQLJDBCDataModel
Override since PostgreSQL doesn't have the same non-standard capability that MySQL has, to optionally insert or update in one statement.
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ReloadFromJDBCDataModel
 
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92BooleanPrefJDBCDataModel
Override since PostgreSQL doesn't have the same non-standard capability that MySQL has, to optionally ignore an insert that fails since the row exists already.
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92JDBCDataModel
Override since SQL92 doesn't have the same non-standard capability that MySQL has, to optionally insert or update in one statement.
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
setPreference(long, long, float) - Method in class org.apache.mahout.cf.taste.web.RecommenderWrapper
 
setPrefix(String) - Method in class org.apache.mahout.utils.email.MailOptions
Sets the prefix that is combined with the archive name and with message ids to create SequenceFile keys.
setQuotedTextPattern(Pattern) - Method in class org.apache.mahout.utils.email.MailOptions
Sets the Pattern to use to identify lines that are quoted text.
setRelation(String) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
setRelation(String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
setSeparator(String) - Method in class org.apache.mahout.utils.email.MailOptions
Sets the separator to use in the output between metadata items (to, from, etc.).
setSeqDictOut(String) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
setSplitLocation(int) - Method in class org.apache.mahout.utils.SplitInput
Set the location of the start of the test/training data split.
setStripQuotedText(boolean) - Method in class org.apache.mahout.utils.email.MailOptions
Sets whether quoted text such as lines starting with | or > is striped off.
setSubString(int) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
setTermDictionary(String, String) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
setTestOutputDirectory(Path) - Method in class org.apache.mahout.utils.SplitInput
Set the directory to which test data will be written.
setTestRandomSelectionPct(int) - Method in class org.apache.mahout.utils.SplitInput
Sets number of random input samples that will be saved to the test set as a percentage of the size of the input set.
setTestRandomSelectionSize(int) - Method in class org.apache.mahout.utils.SplitInput
Sets number of random input samples that will be saved to the test set.
setTestSplitPct(int) - Method in class org.apache.mahout.utils.SplitInput
Sets the percentage of the input data to allocate to the test split
setTestSplitSize(int) - Method in class org.apache.mahout.utils.SplitInput
 
setTrainingOutputDirectory(Path) - Method in class org.apache.mahout.utils.SplitInput
Set the directory to which training data will be written.
setup(Mapper<LongWritable, Text, Text, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.conversion.InputMapper
 
setup(Mapper<IntWritable, WeightedVectorWritable, IntWritable, WeightedVectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
setup(Reducer<IntWritable, WeightedVectorWritable, IntWritable, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
setup(Mapper<IntWritable, BytesWritable, Text, Text>.Context) - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryMapper
 
setup(Mapper<IntWritable, BytesWritable, Text, Text>.Context) - Method in class org.apache.mahout.text.SequenceFilesFromMailArchivesMapper
 
setup(Mapper<LongWritable, Text, Text, Text>.Context) - Method in class org.apache.mahout.text.wikipedia.WikipediaDatasetCreatorMapper
 
setup(Mapper<LongWritable, Text, Text, Text>.Context) - Method in class org.apache.mahout.text.wikipedia.WikipediaMapper
 
setup(Mapper<LongWritable, Text, LongWritable, Text>.Context) - Method in class org.apache.mahout.utils.regex.RegexMapper
 
setup(Mapper<WritableComparable<?>, Writable, WritableComparable<?>, Writable>.Context) - Method in class org.apache.mahout.utils.SplitInputJob.SplitInputMapper
 
setup(Reducer<WritableComparable<?>, Writable, WritableComparable<?>, Writable>.Context) - Method in class org.apache.mahout.utils.SplitInputJob.SplitInputReducer
 
setUseMapRed(boolean) - Method in class org.apache.mahout.utils.SplitInput
Set to true to use map reduce to split the input
setWeightType(String) - Method in class org.apache.mahout.utils.vectors.lucene.Driver
 
skippedErrorMessages - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
splitComplete(Path, int, int, int, int) - Method in interface org.apache.mahout.utils.SplitInput.SplitCallback
 
splitDirectory() - Method in class org.apache.mahout.utils.SplitInput
Perform a split on directory specified by SplitInput.setInputDirectory(Path) by calling SplitInput.splitFile(Path) on each file found within that directory.
splitDirectory(Path) - Method in class org.apache.mahout.utils.SplitInput
Perform a split on the specified directory by calling SplitInput.splitFile(Path) on each file found within that directory.
splitDirectory(Configuration, Path) - Method in class org.apache.mahout.utils.SplitInput
 
splitFile(Path) - Method in class org.apache.mahout.utils.SplitInput
Perform a split on the specified input file.
SplitInput - Class in org.apache.mahout.utils
A utility for splitting files in the input format used by the Bayes classifiers or anything else that has one item per line or SequenceFiles (key/value) into training and test sets in order to perform cross-validation.
SplitInput() - Constructor for class org.apache.mahout.utils.SplitInput
 
SplitInput.SplitCallback - Interface in org.apache.mahout.utils
Used to pass information back to a caller once a file has been split without the need for a data object
SplitInputComparator() - Constructor for class org.apache.mahout.utils.SplitInputJob.SplitInputComparator
 
SplitInputJob - Class in org.apache.mahout.utils
Class which implements a map reduce version of SplitInput.
SplitInputJob.SplitInputComparator - Class in org.apache.mahout.utils
Randomly permute key value pairs
SplitInputJob.SplitInputMapper - Class in org.apache.mahout.utils
Mapper which downsamples the input by downsamplingFactor
SplitInputJob.SplitInputReducer - Class in org.apache.mahout.utils
Reducer which uses MultipleOutputs to randomly allocate key value pairs between test and training outputs
SplitInputMapper() - Constructor for class org.apache.mahout.utils.SplitInputJob.SplitInputMapper
 
SplitInputReducer() - Constructor for class org.apache.mahout.utils.SplitInputJob.SplitInputReducer
 
SplitMap() - Constructor for class org.apache.mahout.text.TextParagraphSplittingJob.SplitMap
 
SQL92BooleanPrefJDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
See also SQL92JDBCDataModel -- same except deals with a table without preference info:
SQL92BooleanPrefJDBCDataModel() - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92BooleanPrefJDBCDataModel
Creates a using the default DataSource (named AbstractJDBCComponent.DEFAULT_DATASOURCE_NAME and default table/column names.
SQL92BooleanPrefJDBCDataModel(String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92BooleanPrefJDBCDataModel
Creates a using the default DataSource found under the given name, and using default table/column names.
SQL92BooleanPrefJDBCDataModel(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92BooleanPrefJDBCDataModel
Creates a using the given DataSource and default table/column names.
SQL92BooleanPrefJDBCDataModel(DataSource, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92BooleanPrefJDBCDataModel
Creates a using the given DataSource and default table/column names.
SQL92JDBCDataModel - Class in org.apache.mahout.cf.taste.impl.model.jdbc
A JDBCDataModel backed by a SQL92 compatible database and accessed via JDBC.
SQL92JDBCDataModel() - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92JDBCDataModel
Creates a using the default DataSource (named AbstractJDBCComponent.DEFAULT_DATASOURCE_NAME and default table/column names.
SQL92JDBCDataModel(String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92JDBCDataModel
Creates a using the default DataSource found under the given name, and using default table/column names.
SQL92JDBCDataModel(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92JDBCDataModel
Creates a using the given DataSource and default table/column names.
SQL92JDBCDataModel(DataSource, String, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.model.jdbc.SQL92JDBCDataModel
Creates a using the given DataSource and default table/column names.
SQL92JDBCInMemoryItemSimilarity - Class in org.apache.mahout.cf.taste.impl.similarity.jdbc
 
SQL92JDBCInMemoryItemSimilarity() - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.SQL92JDBCInMemoryItemSimilarity
 
SQL92JDBCInMemoryItemSimilarity(String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.SQL92JDBCInMemoryItemSimilarity
 
SQL92JDBCInMemoryItemSimilarity(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.SQL92JDBCInMemoryItemSimilarity
 
SQL92JDBCInMemoryItemSimilarity(DataSource, String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.SQL92JDBCInMemoryItemSimilarity
 
SQL92JDBCItemSimilarity - Class in org.apache.mahout.cf.taste.impl.similarity.jdbc
 
SQL92JDBCItemSimilarity() - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.SQL92JDBCItemSimilarity
 
SQL92JDBCItemSimilarity(String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.SQL92JDBCItemSimilarity
 
SQL92JDBCItemSimilarity(DataSource) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.SQL92JDBCItemSimilarity
 
SQL92JDBCItemSimilarity(DataSource, String, String, String, String) - Constructor for class org.apache.mahout.cf.taste.impl.similarity.jdbc.SQL92JDBCItemSimilarity
 
START_TAG_KEY - Static variable in class org.apache.mahout.text.wikipedia.XmlInputFormat
 
STATE_IN_KEY - Static variable in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
STRIP_QUOTED_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
SUBJECT - Static variable in class org.apache.mahout.utils.email.MailOptions
 
SUBJECT_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
SUBJECT_PREFIX - Static variable in class org.apache.mahout.utils.email.MailProcessor
 
SUBSTRING_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 

T

TermEntry - Class in org.apache.mahout.utils.vectors
Each entry in a TermInfo dictionary.
TermEntry(String, int, int) - Constructor for class org.apache.mahout.utils.vectors.TermEntry
 
terminfo - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
TermInfo - Interface in org.apache.mahout.utils.vectors
Contains the term dictionary information associated with a vectorized collection of text documents
TermInfoWriter - Interface in org.apache.mahout.utils.vectors.io
 
TextParagraphSplittingJob - Class in org.apache.mahout.text
 
TextParagraphSplittingJob() - Constructor for class org.apache.mahout.text.TextParagraphSplittingJob
 
TextParagraphSplittingJob.SplitMap - Class in org.apache.mahout.text
 
TextualVectorWriter - Class in org.apache.mahout.utils.vectors.io
Write out the vectors to any Writer using Vector.asFormatString(), one per line by default.
TextualVectorWriter(Writer) - Constructor for class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
TFDFMapper - Class in org.apache.mahout.utils.vectors.lucene
Not thread-safe
TFDFMapper(int, Weight, TermInfo) - Constructor for class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
TimesBenchmark - Class in org.apache.mahout.benchmark
 
TimesBenchmark(VectorBenchmarks) - Constructor for class org.apache.mahout.benchmark.TimesBenchmark
 
TO - Static variable in class org.apache.mahout.utils.email.MailOptions
 
TO_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
TO_PREFIX - Static variable in class org.apache.mahout.utils.email.MailProcessor
 
topEntries(Vector, int) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
toString() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
toString() - Method in class org.apache.mahout.cf.taste.impl.model.cassandra.CassandraDataModel
 
toString() - Method in class org.apache.mahout.cf.taste.impl.model.mongodb.MongoDBDataModel
 
toString() - Method in class org.apache.mahout.cf.taste.web.RecommenderServlet
 
totalTerms(String) - Method in class org.apache.mahout.utils.vectors.lucene.CachedTermInfo
 
totalTerms(String) - Method in interface org.apache.mahout.utils.vectors.TermInfo
 
toWeightedTerms(Collection<Pair<Integer, Double>>, String[]) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
TRANSFORMER_CLASS - Static variable in class org.apache.mahout.utils.regex.RegexMapper
 
transformMatch(String) - Method in class org.apache.mahout.utils.regex.AnalyzerTransformer
 
transformMatch(String) - Method in class org.apache.mahout.utils.regex.ChainTransformer
 
transformMatch(String) - Method in class org.apache.mahout.utils.regex.IdentityTransformer
 
transformMatch(String) - Method in interface org.apache.mahout.utils.regex.RegexTransformer
 
transformMatch(String) - Method in class org.apache.mahout.utils.regex.URLDecodeTransformer
 

U

unwrap(Class<T>) - Method in class org.apache.mahout.cf.taste.impl.model.jdbc.ConnectionPoolDataSource
 
URLDecodeTransformer - Class in org.apache.mahout.utils.regex
 
URLDecodeTransformer() - Constructor for class org.apache.mahout.utils.regex.URLDecodeTransformer
 
URLDecodeTransformer(String) - Constructor for class org.apache.mahout.utils.regex.URLDecodeTransformer
 

V

validate() - Method in class org.apache.mahout.utils.SplitInput
Validates that the current instance is in a consistent state
valueOf(String) - Static method in enum org.apache.mahout.utils.clustering.ClusterDumper.OUTPUT_FORMAT
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.mahout.utils.vectors.arff.ARFFType
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.apache.mahout.utils.clustering.ClusterDumper.OUTPUT_FORMAT
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.mahout.utils.vectors.arff.ARFFType
Returns an array containing the constants of this enum type, in the order they are declared.
VectorBenchmarks - Class in org.apache.mahout.benchmark
 
VectorBenchmarks(int, int, int, int, int) - Constructor for class org.apache.mahout.benchmark.VectorBenchmarks
 
VectorDumper - Class in org.apache.mahout.utils.vectors
Can read in a SequenceFile of Vectors and dump out the results using Vector.asFormatString() to either the console or to a file.
VectorHelper - Class in org.apache.mahout.utils.vectors
Static utility methods related to vectors.
vectorToCSVString(Vector, boolean) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
vectorToCSVString(Vector, boolean, Appendable) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
vectorToJson(Vector, String[], int, boolean) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
VectorWriter - Interface in org.apache.mahout.utils.vectors.io
 
vIndex(int) - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 

W

weight - Variable in class org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator
 
WholeFileRecordReader - Class in org.apache.mahout.text
RecordReader used with the MultipleTextFileInputFormat class to read full files as k/v pairs and groups of files as single input splits.
WholeFileRecordReader(CombineFileSplit, TaskAttemptContext, Integer) - Constructor for class org.apache.mahout.text.WholeFileRecordReader
 
WikipediaAnalyzer - Class in org.apache.mahout.text.wikipedia
 
WikipediaAnalyzer() - Constructor for class org.apache.mahout.text.wikipedia.WikipediaAnalyzer
 
WikipediaAnalyzer(CharArraySet) - Constructor for class org.apache.mahout.text.wikipedia.WikipediaAnalyzer
 
WikipediaDatasetCreatorDriver - Class in org.apache.mahout.text.wikipedia
Create and run the Wikipedia Dataset Creator.
WikipediaDatasetCreatorMapper - Class in org.apache.mahout.text.wikipedia
Maps over Wikipedia xml format and output all document having the category listed in the input category file
WikipediaDatasetCreatorMapper() - Constructor for class org.apache.mahout.text.wikipedia.WikipediaDatasetCreatorMapper
 
WikipediaDatasetCreatorReducer - Class in org.apache.mahout.text.wikipedia
Can also be used as a local Combiner
WikipediaDatasetCreatorReducer() - Constructor for class org.apache.mahout.text.wikipedia.WikipediaDatasetCreatorReducer
 
WikipediaMapper - Class in org.apache.mahout.text.wikipedia
Maps over Wikipedia xml format and output all document having the category listed in the input category file
WikipediaMapper() - Constructor for class org.apache.mahout.text.wikipedia.WikipediaMapper
 
WikipediaToSequenceFile - Class in org.apache.mahout.text
Create and run the Wikipedia Dataset Creator.
WikipediaXmlSplitter - Class in org.apache.mahout.text.wikipedia
The Bayes example package provides some helper classes for training the Naive Bayes classifier on the Twenty Newsgroups data.
WrappedWriter - Interface in org.apache.mahout.utils.io
Convenience class for wrapping either a java.io.Writer or a SequenceFile.Writer with some basic functionality
write(Iterable<ClusterWritable>) - Method in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
write(Iterable<ClusterWritable>, long) - Method in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 
write(ClusterWritable) - Method in class org.apache.mahout.utils.clustering.ClusterDumperWriter
 
write(Iterable<ClusterWritable>) - Method in interface org.apache.mahout.utils.clustering.ClusterWriter
Write all values in the Iterable to the output
write(ClusterWritable) - Method in interface org.apache.mahout.utils.clustering.ClusterWriter
Write out a Cluster
write(Iterable<ClusterWritable>, long) - Method in interface org.apache.mahout.utils.clustering.ClusterWriter
Write the first maxDocs to the output.
write(ClusterWritable) - Method in class org.apache.mahout.utils.clustering.CSVClusterWriter
 
write(ClusterWritable) - Method in class org.apache.mahout.utils.clustering.GraphMLClusterWriter
 
write(ClusterWritable) - Method in class org.apache.mahout.utils.clustering.JsonClusterWriter
Generate HashMap with cluster info and write as a single JSON formatted line
write(String, String) - Method in class org.apache.mahout.utils.io.ChunkedWrapper
 
write(String, String) - Method in class org.apache.mahout.utils.io.ChunkedWriter
Writes a new key-value pair, creating a new sequence file if necessary.
write(String, String) - Method in class org.apache.mahout.utils.io.IOWriterWrapper
Writes a new key and value, separating them with one space.
write(String, String) - Method in interface org.apache.mahout.utils.io.WrappedWriter
Writes a new key-value pair.
write(TermInfo) - Method in class org.apache.mahout.utils.vectors.io.DelimitedTermInfoWriter
 
write(Iterable<Vector>, long) - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
write(Vector) - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
write(Iterable<Vector>) - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
write(TermInfo) - Method in interface org.apache.mahout.utils.vectors.io.TermInfoWriter
 
write(Iterable<Vector>) - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
write(Iterable<Vector>, long) - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
write(Vector) - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
write(Iterable<Vector>) - Method in interface org.apache.mahout.utils.vectors.io.VectorWriter
Write all values in the Iterable to the output
write(Vector) - Method in interface org.apache.mahout.utils.vectors.io.VectorWriter
Write out a vector
write(Iterable<Vector>, long) - Method in interface org.apache.mahout.utils.vectors.io.VectorWriter
Write the first maxDocs to the output.
writeFile(String, File, long, ARFFModel, File, String, boolean) - Static method in class org.apache.mahout.utils.vectors.arff.Driver
 
writeLabelBindings(File, ARFFModel, String, boolean) - Static method in class org.apache.mahout.utils.vectors.arff.Driver
 
writeLabelBindings(Writer, ARFFModel, String) - Static method in class org.apache.mahout.utils.vectors.arff.Driver
 
writeLabelBindingsJSON(Writer, ARFFModel) - Static method in class org.apache.mahout.utils.vectors.arff.Driver
 
writer - Variable in class org.apache.mahout.utils.clustering.AbstractClusterWriter
 

X

XmlInputFormat - Class in org.apache.mahout.text.wikipedia
Reads records that are delimited by a specific begin/end tag.
XmlInputFormat() - Constructor for class org.apache.mahout.text.wikipedia.XmlInputFormat
 
XmlInputFormat.XmlRecordReader - Class in org.apache.mahout.text.wikipedia
XMLRecordReader class to read through a given xml document to output xml blocks as records as specified by the start tag and end tag
XmlRecordReader(FileSplit, Configuration) - Constructor for class org.apache.mahout.text.wikipedia.XmlInputFormat.XmlRecordReader
 
A B C D E F G H I J K L M N O P Q R S T U V W X 
Skip navigation links

Copyright © 2008–2017 The Apache Software Foundation. All rights reserved.