public class CSVVectorizationEngine extends VectorizationEngine
| Modifier and Type | Field and Description |
|---|---|
static String |
SKIP_HEADER_KEY |
conf, configProps, inputFormat, normalizeData, outputFilename, outputFormat, printStats, reader, shuffleOn, split, writer| Constructor and Description |
|---|
CSVVectorizationEngine() |
| Modifier and Type | Method and Description |
|---|---|
void |
execute()
This is where our custom vectorization engine does its thing
|
Collection<Writable> |
vectorize(String key,
String value,
CSVInputSchema schema)
Use statistics collected from a previous pass to vectorize (or drop) each column
|
Collection<Writable> |
vectorizeToWritable(String key,
String value,
CSVInputSchema schema)
Use statistics collected from a previous pass to vectorize (or drop) each column
|
addTransform, applyTransforms, initializepublic static final String SKIP_HEADER_KEY
public void execute()
throws CanovaException,
IOException,
InterruptedException
execute in class VectorizationEngineCanovaExceptionIOExceptionInterruptedExceptionpublic Collection<Writable> vectorize(String key, String value, CSVInputSchema schema)
public Collection<Writable> vectorizeToWritable(String key, String value, CSVInputSchema schema)
Copyright © 2016. All rights reserved.