idl.tmt.documentparsing
Interface WordParsingListener

All Superinterfaces:
ParsingListener
All Known Implementing Classes:
BodyTextRepresentationBuilder, H1TextRepresentationBuilder, LinkTextRepresentationBuilder, TitleTextRepresentationBuilder

public interface WordParsingListener
extends ParsingListener

A parsing listener for parsing word text. Created on Jan 23, 2004

Author:
jelsas

Field Summary
static java.lang.String ALLOWWORD_LIST
          Property value for setting the allowword file.
static java.lang.String DELIMITER
          Property value for setting the delimiter to identify the end of one word and the beginning of another
static java.lang.String STOPWORD_LIST
          Property value for setting the stopword file.
 
Method Summary
 void word(java.lang.String word, int pos)
          Indicates that a word has been encountered in the document being parsed.
 
Methods inherited from interface idl.tmt.documentparsing.ParsingListener
documentCollectionComplete, documentComplete, newDocument
 

Field Detail

DELIMITER

public static final java.lang.String DELIMITER
Property value for setting the delimiter to identify the end of one word and the beginning of another

STOPWORD_LIST

public static final java.lang.String STOPWORD_LIST
Property value for setting the stopword file.

ALLOWWORD_LIST

public static final java.lang.String ALLOWWORD_LIST
Property value for setting the allowword file.
Method Detail

word

public void word(java.lang.String word,
                 int pos)
Indicates that a word has been encountered in the document being parsed.
Parameters:
word - The word encountered
pos - The character position of the word in the document