public class WordIndex extends StageAppender
| Modifier and Type | Class and Description |
|---|---|
static class |
WordIndex.Map
Emits each word with the total number of times the word was seen
|
static class |
WordIndex.Reduce
Writes each word,count
|
| Modifier and Type | Field and Description |
|---|---|
protected static String |
WORDCOUNT_THRESH |
protected static String |
WORDCOUNT_TIMETHRESH |
protected static String |
WORDCOUNT_TOPN |
| Constructor and Description |
|---|
WordIndex() |
WordIndex(int wordCountThreshold,
int topNWords) |
WordIndex(int wordCountThreshold,
int wordTimeThreshold,
int topNWords) |
| Modifier and Type | Method and Description |
|---|---|
static void |
main(String[] args) |
static LinkedHashMap<String,IndependentPair<Long,Long>> |
readWordCountLines(String path) |
static LinkedHashMap<String,IndependentPair<Long,Long>> |
readWordCountLines(String path,
String ext)
from a report output path get the words
|
void |
stage(MultiStagedJob mjob) |
static void |
writeToMatlab(String path)
Write a CSV wordIndex to a
MLCell writen to a .mat data file |
protected static final String WORDCOUNT_THRESH
protected static final String WORDCOUNT_TOPN
protected static final String WORDCOUNT_TIMETHRESH
public WordIndex(int wordCountThreshold, int topNWords)
public WordIndex(int wordCountThreshold, int wordTimeThreshold, int topNWords)
public WordIndex()
public static LinkedHashMap<String,IndependentPair<Long,Long>> readWordCountLines(String path) throws IOException
path - IOExceptionpublic static LinkedHashMap<String,IndependentPair<Long,Long>> readWordCountLines(String path, String ext) throws IOException
path - report output pathext - where the words are in the pathIOExceptionpublic void stage(MultiStagedJob mjob)
stage in class StageAppendermjob - what should i add myself to?public static void main(String[] args) throws IOException
IOExceptionpublic static void writeToMatlab(String path) throws IOException
MLCell writen to a .mat data filepath - IOException