- Enclosing class:
- CumulativeTimeWord
public static class CumulativeTimeWord.IntersectionUnionReduce
extends org.apache.hadoop.mapreduce.Reducer<org.apache.hadoop.io.BytesWritable,org.apache.hadoop.io.BooleanWritable,org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.BytesWritable>
Recieve every word-time either from the current time period or from past time periods.
Has this word appeared either in the past and now? intersection == 1
Has this word appeared both in the past and now? union == 1
emit the time period with the length of the union set, the length of the intersection set and the ratio of these two (The Jacard Index)
- Author:
- Sina Samangooei (ss@ecs.soton.ac.uk)