@Deprecated public class DistributedLanczosSolver extends LanczosSolver implements org.apache.hadoop.util.Tool
SSVDSolver
Modifier and Type | Class and Description |
---|---|
class |
DistributedLanczosSolver.DistributedLanczosSolverJob
Deprecated.
Inner subclass of AbstractJob so we get access to AbstractJob's functionality w.r.t.
|
LanczosSolver.TimingSection
Modifier and Type | Field and Description |
---|---|
static String |
RAW_EIGENVECTORS
Deprecated.
|
SAFE_MAX
Constructor and Description |
---|
DistributedLanczosSolver()
Deprecated.
|
Modifier and Type | Method and Description |
---|---|
org.apache.hadoop.conf.Configuration |
getConf()
Deprecated.
|
static Vector |
getInitialVector(VectorIterable corpus)
Deprecated.
For the distributed case, the best guess at a useful initialization state for Lanczos we'll chose to be
uniform over all input dimensions, L_2 normalized.
|
DistributedLanczosSolver.DistributedLanczosSolverJob |
job()
Deprecated.
|
static void |
main(String[] args)
Deprecated.
|
int |
run(org.apache.hadoop.fs.Path inputPath,
org.apache.hadoop.fs.Path outputPath,
org.apache.hadoop.fs.Path outputTmpPath,
org.apache.hadoop.fs.Path workingDirPath,
int numRows,
int numCols,
boolean isSymmetric,
int desiredRank)
Deprecated.
Run the solver to produce the raw eigenvectors
|
int |
run(org.apache.hadoop.fs.Path inputPath,
org.apache.hadoop.fs.Path outputPath,
org.apache.hadoop.fs.Path outputTmpPath,
org.apache.hadoop.fs.Path workingDirPath,
int numRows,
int numCols,
boolean isSymmetric,
int desiredRank,
double maxError,
double minEigenvalue,
boolean inMemory)
Deprecated.
Run the solver to produce raw eigenvectors, then run the EigenVerificationJob to clean them
|
int |
run(String[] strings)
Deprecated.
|
LanczosState |
runJob(org.apache.hadoop.conf.Configuration originalConfig,
LanczosState state,
int desiredRank,
boolean isSymmetric,
String outputEigenVectorPathString)
Deprecated.
|
LanczosState |
runJob(org.apache.hadoop.conf.Configuration originalConfig,
org.apache.hadoop.fs.Path inputPath,
org.apache.hadoop.fs.Path outputTmpPath,
int numRows,
int numCols,
boolean isSymmetric,
int desiredRank,
String outputEigenVectorPathString)
Deprecated.
Factored-out LanczosSolver for the purpose of invoking it programmatically
|
void |
serializeOutput(LanczosState state,
org.apache.hadoop.fs.Path outputPath)
Deprecated.
|
void |
setConf(org.apache.hadoop.conf.Configuration configuration)
Deprecated.
|
calculateScaleFactor, orthoganalizeAgainstAllButLast, solve, solve
public static final String RAW_EIGENVECTORS
public static Vector getInitialVector(VectorIterable corpus)
public LanczosState runJob(org.apache.hadoop.conf.Configuration originalConfig, LanczosState state, int desiredRank, boolean isSymmetric, String outputEigenVectorPathString) throws IOException
IOException
public LanczosState runJob(org.apache.hadoop.conf.Configuration originalConfig, org.apache.hadoop.fs.Path inputPath, org.apache.hadoop.fs.Path outputTmpPath, int numRows, int numCols, boolean isSymmetric, int desiredRank, String outputEigenVectorPathString) throws IOException
IOException
public int run(String[] strings) throws Exception
run
in interface org.apache.hadoop.util.Tool
Exception
public int run(org.apache.hadoop.fs.Path inputPath, org.apache.hadoop.fs.Path outputPath, org.apache.hadoop.fs.Path outputTmpPath, org.apache.hadoop.fs.Path workingDirPath, int numRows, int numCols, boolean isSymmetric, int desiredRank, double maxError, double minEigenvalue, boolean inMemory) throws Exception
inputPath
- the Path to the input corpusoutputPath
- the Path to the outputoutputTmpPath
- a Path to a temporary working directorynumRows
- the int number of rowsnumCols
- the int number of columnsisSymmetric
- true if the input matrix is symmetricdesiredRank
- the int desired rank of eigenvectors to producemaxError
- the maximum allowable errorminEigenvalue
- the minimum usable eigenvalueinMemory
- true if the verification can be done in memoryException
public int run(org.apache.hadoop.fs.Path inputPath, org.apache.hadoop.fs.Path outputPath, org.apache.hadoop.fs.Path outputTmpPath, org.apache.hadoop.fs.Path workingDirPath, int numRows, int numCols, boolean isSymmetric, int desiredRank) throws Exception
inputPath
- the Path to the input corpusoutputPath
- the Path to the outputoutputTmpPath
- a Path to a temporary working directorynumRows
- the int number of rowsnumCols
- the int number of columnsisSymmetric
- true if the input matrix is symmetricdesiredRank
- the int desired rank of eigenvectors to produceException
public void serializeOutput(LanczosState state, org.apache.hadoop.fs.Path outputPath) throws IOException
state
- The final LanczosState to be serializedoutputPath
- The path (relative to the current Configuration's FileSystem) to save the output to.IOException
public void setConf(org.apache.hadoop.conf.Configuration configuration)
setConf
in interface org.apache.hadoop.conf.Configurable
public org.apache.hadoop.conf.Configuration getConf()
getConf
in interface org.apache.hadoop.conf.Configurable
public DistributedLanczosSolver.DistributedLanczosSolverJob job()
Copyright © 2008–2017 The Apache Software Foundation. All rights reserved.