public final class ClusterClassificationDriver extends AbstractJob
argMap, inputFile, inputPath, outputFile, outputPath, tempPath
Modifier and Type | Method and Description |
---|---|
static void |
main(String[] args) |
static void |
run(org.apache.hadoop.conf.Configuration conf,
org.apache.hadoop.fs.Path input,
org.apache.hadoop.fs.Path clusteringOutputPath,
org.apache.hadoop.fs.Path output,
double clusterClassificationThreshold,
boolean emitMostLikely,
boolean runSequential) |
static void |
run(org.apache.hadoop.conf.Configuration conf,
org.apache.hadoop.fs.Path input,
org.apache.hadoop.fs.Path clusteringOutputPath,
org.apache.hadoop.fs.Path output,
Double clusterClassificationThreshold,
boolean emitMostLikely,
boolean runSequential)
Uses
ClusterClassifier to classify input vectors into their
respective clusters. |
int |
run(String[] args)
CLI to run Cluster Classification Driver.
|
addFlag, addInputOption, addOption, addOption, addOption, addOption, addOutputOption, buildOption, buildOption, getAnalyzerClassFromOption, getCLIOption, getConf, getDimensions, getFloat, getFloat, getGroup, getInputFile, getInputPath, getInt, getInt, getOption, getOption, getOption, getOptions, getOutputFile, getOutputPath, getOutputPath, getTempPath, getTempPath, hasOption, keyFor, maybePut, parseArguments, parseArguments, parseDirectories, prepareJob, prepareJob, prepareJob, prepareJob, setConf, setS3SafeCombinedInputPath, shouldRunNextPhase
public int run(String[] args) throws Exception
Exception
public static void run(org.apache.hadoop.conf.Configuration conf, org.apache.hadoop.fs.Path input, org.apache.hadoop.fs.Path clusteringOutputPath, org.apache.hadoop.fs.Path output, Double clusterClassificationThreshold, boolean emitMostLikely, boolean runSequential) throws IOException, InterruptedException, ClassNotFoundException
ClusterClassifier
to classify input vectors into their
respective clusters.input
- the input vectorsclusteringOutputPath
- the output path of clustering ( it reads clusters-*-final file
from here )output
- the location to store the classified vectorsclusterClassificationThreshold
- the threshold value of probability distribution function from 0.0
to 1.0. Any vector with pdf less that this threshold will not be
classified for the cluster.runSequential
- Run the process sequentially or in a mapreduce way.IOException
InterruptedException
ClassNotFoundException
public static void run(org.apache.hadoop.conf.Configuration conf, org.apache.hadoop.fs.Path input, org.apache.hadoop.fs.Path clusteringOutputPath, org.apache.hadoop.fs.Path output, double clusterClassificationThreshold, boolean emitMostLikely, boolean runSequential) throws IOException, InterruptedException, ClassNotFoundException
Copyright © 2008–2017 The Apache Software Foundation. All rights reserved.