textmaven.application.analyzer.extractor
Class Word

java.lang.Object
  extended bytextmaven.application.analyzer.extractor.Word

public class Word
extends java.lang.Object

Representing a word including its distribution

Version:
1.0
Author:
Thomas Krebs

Field Summary
 long mCount
          number counting occurences
 long mOccurence
          Location identification where word first occured
 java.lang.String mOriginalWord
          represents original word found
 java.lang.String mWord
          represented word reduced to its stem (if normalization is used)
 
Constructor Summary
Word(java.lang.String word, long count, long occurence)
           
Word(java.lang.String word, long count, long occurence, java.lang.String original)
           
 
Method Summary
 void addAll(java.util.List sentences)
          Adds all sentences
 void addSentence(java.lang.StringBuffer sentence)
          Adds sentence if it does not already exist
static java.util.Comparator getAscDistrComparator()
          Return comparator comparing words by their occurences in ascending order
static java.util.Comparator getDescDistrComparator()
          Return comparator comparing words by their occurences in descending order
static java.util.Comparator getOccComparator()
          Return comparator comparing words by their first occurence
 java.util.List getSentences()
          Returns sentences the word is contained in
static java.util.Comparator getWordComparator()
          Return comparator comparing words lexically
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

mWord

public java.lang.String mWord
represented word reduced to its stem (if normalization is used)


mOriginalWord

public java.lang.String mOriginalWord
represents original word found


mCount

public long mCount
number counting occurences


mOccurence

public long mOccurence
Location identification where word first occured

Constructor Detail

Word

public Word(java.lang.String word,
            long count,
            long occurence)

Word

public Word(java.lang.String word,
            long count,
            long occurence,
            java.lang.String original)
Method Detail

addSentence

public void addSentence(java.lang.StringBuffer sentence)
Adds sentence if it does not already exist

Parameters:
sentence - sentence to add

addAll

public void addAll(java.util.List sentences)
Adds all sentences

Parameters:
sentences - sentences to add

getSentences

public java.util.List getSentences()
Returns sentences the word is contained in

Returns:
list of sentences containing StringBuffers

getAscDistrComparator

public static java.util.Comparator getAscDistrComparator()
Return comparator comparing words by their occurences in ascending order

Returns:
Comparator

getDescDistrComparator

public static java.util.Comparator getDescDistrComparator()
Return comparator comparing words by their occurences in descending order

Returns:
Comparator

getWordComparator

public static java.util.Comparator getWordComparator()
Return comparator comparing words lexically

Returns:
Comparator

getOccComparator

public static java.util.Comparator getOccComparator()
Return comparator comparing words by their first occurence

Returns:
Comparator


Copyright © 2002-2005 Sourceforge. All Rights Reserved.