Modifier and Type | Method and Description |
---|---|
static int |
BayesUtils.writeLabelIndex(org.apache.hadoop.conf.Configuration conf,
org.apache.hadoop.fs.Path indexPath,
Iterable<Pair<org.apache.hadoop.io.Text,org.apache.hadoop.io.IntWritable>> labels) |
Modifier and Type | Method and Description |
---|---|
static Pair<Matrix,Vector> |
TopicModel.loadModel(org.apache.hadoop.conf.Configuration conf,
org.apache.hadoop.fs.Path... modelPaths) |
Modifier and Type | Method and Description |
---|---|
Pair<List<? extends WeightedVector>,List<? extends WeightedVector>> |
BallKMeans.splitTrainTest(List<? extends WeightedVector> datapoints) |
Modifier and Type | Method and Description |
---|---|
static <A,B> Pair<A,B> |
Pair.of(A a,
B b) |
Pair<B,A> |
Pair.swap() |
Modifier and Type | Method and Description |
---|---|
int |
Pair.compareTo(Pair<A,B> other)
Defines an ordering on pairs that sorts by first value's natural ordering, ascending,
and then by second value's natural ordering.
|
Modifier and Type | Method and Description |
---|---|
protected Iterator<Pair<List<String>,Long>> |
StringRecordIterator.delegate() |
Modifier and Type | Method and Description |
---|---|
protected Pair<K,V> |
SequenceFileIterator.computeNext() |
Modifier and Type | Method and Description |
---|---|
protected Iterator<Pair<K,V>> |
SequenceFileDirIterator.delegate() |
Iterator<Pair<K,V>> |
SequenceFileDirIterable.iterator() |
Iterator<Pair<K,V>> |
SequenceFileIterable.iterator() |
Modifier and Type | Method and Description |
---|---|
static Iterator<Pair<org.apache.hadoop.io.Writable,Vector>> |
SSVDHelper.drmIterator(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.fs.Path glob,
org.apache.hadoop.conf.Configuration conf,
Deque<Closeable> closeables) |
Modifier and Type | Method and Description |
---|---|
static void |
HighDFWordsPruner.pruneVectors(org.apache.hadoop.fs.Path tfDir,
org.apache.hadoop.fs.Path prunedTFDir,
org.apache.hadoop.fs.Path prunedPartialTFDir,
long maxDF,
long minDF,
org.apache.hadoop.conf.Configuration baseConf,
Pair<Long[],List<org.apache.hadoop.fs.Path>> docFrequenciesFeatures,
float normPower,
boolean logNormalize,
int numReducers) |
Modifier and Type | Method and Description |
---|---|
static Pair<Long[],List<org.apache.hadoop.fs.Path>> |
TFIDFConverter.calculateDF(org.apache.hadoop.fs.Path input,
org.apache.hadoop.fs.Path output,
org.apache.hadoop.conf.Configuration baseConf,
int chunkSizeInMegabytes)
Calculates the document frequencies of all terms from the input set of vectors in
SequenceFile format. |
Modifier and Type | Method and Description |
---|---|
static void |
TFIDFConverter.processTfIdf(org.apache.hadoop.fs.Path input,
org.apache.hadoop.fs.Path output,
org.apache.hadoop.conf.Configuration baseConf,
Pair<Long[],List<org.apache.hadoop.fs.Path>> datasetFeatures,
int minDf,
long maxDF,
float normPower,
boolean logNormalize,
boolean sequentialAccessOutput,
boolean namedVector,
int numReducers)
Create Term Frequency-Inverse Document Frequency (Tf-Idf) Vectors from the input set of vectors in
SequenceFile format. |
Copyright © 2008–2017 The Apache Software Foundation. All rights reserved.