public class WordIndex extends StageAppender
Modifier and Type | Class and Description |
---|---|
static class |
WordIndex.Map
Emits each word with the total number of times the word was seen
|
static class |
WordIndex.Reduce
Writes each word,count
|
Modifier and Type | Field and Description |
---|---|
protected static String |
WORDCOUNT_THRESH |
protected static String |
WORDCOUNT_TIMETHRESH |
protected static String |
WORDCOUNT_TOPN |
Constructor and Description |
---|
WordIndex() |
WordIndex(int wordCountThreshold,
int topNWords) |
WordIndex(int wordCountThreshold,
int wordTimeThreshold,
int topNWords) |
Modifier and Type | Method and Description |
---|---|
static void |
main(String[] args) |
static LinkedHashMap<String,IndependentPair<Long,Long>> |
readWordCountLines(String path) |
static LinkedHashMap<String,IndependentPair<Long,Long>> |
readWordCountLines(String path,
String ext)
from a report output path get the words
|
void |
stage(MultiStagedJob mjob) |
static void |
writeToMatlab(String path)
Write a CSV wordIndex to a
MLCell writen to a .mat data file |
protected static final String WORDCOUNT_THRESH
protected static final String WORDCOUNT_TOPN
protected static final String WORDCOUNT_TIMETHRESH
public WordIndex(int wordCountThreshold, int topNWords)
public WordIndex(int wordCountThreshold, int wordTimeThreshold, int topNWords)
public WordIndex()
public static LinkedHashMap<String,IndependentPair<Long,Long>> readWordCountLines(String path) throws IOException
path
- IOException
public static LinkedHashMap<String,IndependentPair<Long,Long>> readWordCountLines(String path, String ext) throws IOException
path
- report output pathext
- where the words are in the pathIOException
public void stage(MultiStagedJob mjob)
stage
in class StageAppender
mjob
- what should i add myself to?public static void main(String[] args) throws IOException
IOException
public static void writeToMatlab(String path) throws IOException
MLCell
writen to a .mat data filepath
- IOException