public class SimpleTextEncodingVectorizer extends Object implements Vectorizer
Runs a Map/Reduce job that encodes FeatureVectorEncoder
the
input and writes it to the output as a sequence file.
Only works on basic text, where the value in the SequenceFile is a blob of text.
Constructor and Description |
---|
SimpleTextEncodingVectorizer() |
Modifier and Type | Method and Description |
---|---|
void |
createVectors(org.apache.hadoop.fs.Path input,
org.apache.hadoop.fs.Path output,
VectorizerConfig config) |
public void createVectors(org.apache.hadoop.fs.Path input, org.apache.hadoop.fs.Path output, VectorizerConfig config) throws IOException, ClassNotFoundException, InterruptedException
createVectors
in interface Vectorizer
IOException
ClassNotFoundException
InterruptedException
Copyright © 2008–2017 The Apache Software Foundation. All rights reserved.