public final class Job extends AbstractJob
argMap, inputFile, inputPath, outputFile, outputPath, tempPath
Modifier and Type | Method and Description |
---|---|
static void |
main(String[] args) |
static void |
run(org.apache.hadoop.conf.Configuration conf,
org.apache.hadoop.fs.Path input,
org.apache.hadoop.fs.Path output,
DistanceMeasure measure,
double t1,
double t2,
double convergenceDelta,
int maxIterations)
Run the kmeans clustering job on an input dataset using the given distance measure, t1, t2 and iteration
parameters.
|
static void |
run(org.apache.hadoop.conf.Configuration conf,
org.apache.hadoop.fs.Path input,
org.apache.hadoop.fs.Path output,
DistanceMeasure measure,
int k,
double convergenceDelta,
int maxIterations)
Run the kmeans clustering job on an input dataset using the given the number of clusters k and iteration
parameters.
|
int |
run(String[] args) |
addFlag, addInputOption, addOption, addOption, addOption, addOption, addOutputOption, buildOption, buildOption, getAnalyzerClassFromOption, getCLIOption, getConf, getDimensions, getFloat, getFloat, getGroup, getInputFile, getInputPath, getInt, getInt, getOption, getOption, getOption, getOptions, getOutputFile, getOutputPath, getOutputPath, getTempPath, getTempPath, hasOption, keyFor, maybePut, parseArguments, parseArguments, parseDirectories, prepareJob, prepareJob, prepareJob, prepareJob, setConf, setS3SafeCombinedInputPath, shouldRunNextPhase
public static void run(org.apache.hadoop.conf.Configuration conf, org.apache.hadoop.fs.Path input, org.apache.hadoop.fs.Path output, DistanceMeasure measure, int k, double convergenceDelta, int maxIterations) throws Exception
conf
- the Configuration to useinput
- the String denoting the input directory pathoutput
- the String denoting the output directory pathmeasure
- the DistanceMeasure to usek
- the number of clusters in KmeansconvergenceDelta
- the double convergence criteria for iterationsmaxIterations
- the int maximum number of iterationsException
public static void run(org.apache.hadoop.conf.Configuration conf, org.apache.hadoop.fs.Path input, org.apache.hadoop.fs.Path output, DistanceMeasure measure, double t1, double t2, double convergenceDelta, int maxIterations) throws Exception
conf
- the Configuration to useinput
- the String denoting the input directory pathoutput
- the String denoting the output directory pathmeasure
- the DistanceMeasure to uset1
- the canopy T1 thresholdt2
- the canopy T2 thresholdconvergenceDelta
- the double convergence criteria for iterationsmaxIterations
- the int maximum number of iterationsException
Copyright © 2008–2017 The Apache Software Foundation. All rights reserved.