Overview
Package
Class
Use
Tree
Deprecated
Index
Help
PREV LETTER
NEXT LETTER
FRAMES
NO FRAMES
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
R
S
T
U
V
W
X
Z
_
D
data
- Variable in class idl.tmt.representation.
TmtClustering
debug
- Variable in class idl.tmt.representation.
BagOfWordsRepresentationBuilder
DEFAULT_ENTITY_REGEX
- Static variable in class idl.tmt.documentparsing.filters.
HtmlTagsFilter
DEFAULT_HTML_TAG_REGEX
- Static variable in class idl.tmt.documentparsing.filters.
HtmlTagsFilter
DEFAULT_INDEX
- Static variable in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
DEFAULT_INDEX
- Static variable in class idl.tmt.documentsource.webcrawl.
StringReplacementURLMapperTest
DEFAULT_INDEX_NAME
- Static variable in class idl.tmt.documentsource.webcrawl.
StringReplacementURLMapper
DEFAULT_REGEX
- Static variable in class idl.tmt.documentparsing.filters.
WordFilter
DEFAULT_SCRIPT_REGEX
- Static variable in class idl.tmt.documentparsing.filters.
HtmlTagsFilter
defaultIndex
- Variable in class idl.tmt.documentsource.webcrawl.
MultiMapURLMapper
defaultIndexStr
- Variable in class idl.tmt.documentsource.webcrawl.
StringReplacementURLMapper
DEFUALT_URL_REGEX
- Static variable in class idl.tmt.documentparsing.filters.
HtmlTagsFilter
delegateToTextParser
- Variable in class idl.tmt.documentparsing.
HTMLDocumentParser
deleteDir(File)
- Method in class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProviderTest
recursively empties a directory, then deletes that directory.
deleteTermsAfterIndex(int)
- Method in class idl.tmt.representation.
TermList
Deletes the terms in the termlist with indexes greater than the specified index.
DELIMITER
- Static variable in interface idl.tmt.documentparsing.
WordParsingListener
Property value for setting the delimiter to identify the end of one word and the beginning of another
delimiters
- Variable in class idl.tmt.documentparsing.
TextDocumentParser
DELIMITERS_PROP
- Static variable in class idl.tmt.documentparsing.
TextDocumentParser
Property key for setting the delimiters to use for parsing.
DenseMatrixReader
- class idl.tmt.representation.matrix.
DenseMatrixReader
.
Class for reading dense matrices from disk.
DenseMatrixReader()
- Constructor for class idl.tmt.representation.matrix.
DenseMatrixReader
DensityBasedClustering
- class idl.tmt.representation.
DensityBasedClustering
.
Created on Apr 14, 2004
DensityBasedClustering.DensityBasedSoftCluster
- class idl.tmt.representation.
DensityBasedClustering.DensityBasedSoftCluster
.
DensityBasedClustering.DensityBasedSoftCluster(DensityBasedClustering, double[], double)
- Constructor for class idl.tmt.representation.
DensityBasedClustering.DensityBasedSoftCluster
DensityBasedClustering(DensityBasedClusterer, IndexedCollection)
- Constructor for class idl.tmt.representation.
DensityBasedClustering
df
- Variable in class idl.tmt.util.
HTMLOutputUtil
digestConfiguration(String)
- Static method in class idl.tmt.
ConfigurationDigester
dimension
- Variable in class idl.tmt.representation.matrix.
VectorNormalizer
docCount
- Variable in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
docIDMap
- Variable in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
docIDMap
- Variable in class idl.tmt.documentparsing.
HTMLDocumentParser
docIDs
- Variable in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
docList
- Variable in class idl.tmt.representation.
IndexedCollection
docsClusters
- Variable in class idl.tmt.representation.
DensityBasedClustering
DOCUMENT_ID_UNKNOWN
- Static variable in class idl.tmt.documentsource.webcrawl.
MultiMapDocumentIDMap
DocumentClassification
- interface idl.tmt.classification.
DocumentClassification
.
Created on Jun 21, 2004
documentClassifications
- Variable in class idl.tmt.representation.
IndexedCollection
documentCollectionComplete()
- Method in class idl.tmt.representation.
BodyTextRepresentationBuilder
Indicates the completion of the document collection, and builds the matrix representation.
documentCollectionComplete()
- Method in class idl.tmt.representation.
LinkTextRepresentationBuilder
Indicates that parsing of the collection is done, and this object builds the representation matrix.
documentCollectionComplete()
- Method in class idl.tmt.representation.
TitleTextRepresentationBuilder
Builds the document representation matrix
documentCollectionComplete()
- Method in class idl.tmt.representation.
MetaTextRepresentationBuilder
Builds the document representation matrix
documentCollectionComplete()
- Method in class idl.tmt.representation.
H1TextRepresentationBuilder
Builds the document representation matrix
documentCollectionComplete()
- Method in class idl.tmt.classification.
HTMLMetricsClassifier
documentCollectionComplete()
- Method in interface idl.tmt.documentparsing.
ParsingListener
Indicates that the parsing of the entire collection of documents is complete.
documentCollectionComplete()
- Method in class idl.tmt.util.
TitleSaver
documentComplete()
- Method in class idl.tmt.representation.
BodyTextRepresentationBuilder
Indicates the completion of a document.
documentComplete()
- Method in class idl.tmt.representation.
LinkTextRepresentationBuilder
Ignored in this representation builder because we're just interested in the documents linked to, not the current document.
documentComplete()
- Method in class idl.tmt.representation.
TitleTextRepresentationBuilder
documentComplete()
- Method in class idl.tmt.representation.
MetaTextRepresentationBuilder
documentComplete()
- Method in class idl.tmt.representation.
H1TextRepresentationBuilder
documentComplete()
- Method in class idl.tmt.classification.
HTMLMetricsClassifier
documentComplete()
- Method in interface idl.tmt.documentparsing.
ParsingListener
Indicates that the parsing of the current document has completed.
documentComplete()
- Method in class idl.tmt.util.
TitleSaver
documentCount()
- Method in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
Returns a count of documents
documentCount()
- Method in interface idl.tmt.documentsource.filesystem.
FilesystemDocumentProvider
Retrieves an estimate the total number of documents this provider is able to produce.
documentCount()
- Method in class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProvider
Returns a count of the documents left in this provier
DocumentIDMapper
- interface idl.tmt.documentsource.
DocumentIDMapper
.
Provides the mapping from the Document to the DocumentID Created on Mar 18, 2004
documentIndex
- Variable in class idl.tmt.representation.
IndexedCollection
DocumentIndexWriter
- class idl.tmt.util.
DocumentIndexWriter
.
Builds an inverted index with format
termid docid count
per line, writing to a file on disk.
DocumentIndexWriter(String, TmtMatrix)
- Constructor for class idl.tmt.util.
DocumentIndexWriter
DocumentParser
- interface idl.tmt.documentparsing.
DocumentParser
.
Interface which provides parsing capabilities.
DocumentProvider
- interface idl.tmt.documentsource.
DocumentProvider
.
This interface defines an entry point into a source of documents.
DocumentProviderException
- exception idl.tmt.documentsource.
DocumentProviderException
.
Generic Exception indicate Created on Feb 25, 2004
DocumentProviderException()
- Constructor for class idl.tmt.documentsource.
DocumentProviderException
Constructs a new DocumentProviderException
DocumentProviderException(String)
- Constructor for class idl.tmt.documentsource.
DocumentProviderException
Constructs a new DocumentProviderException with the specified message
DocumentProviderException(String, Throwable)
- Constructor for class idl.tmt.documentsource.
DocumentProviderException
DocumentProviderException(Throwable)
- Constructor for class idl.tmt.documentsource.
DocumentProviderException
doEM()
- Method in class idl.tmt.clusterers.
EnhancedEM
Perform the EM algorithm
doTasks(LinkedList)
- Method in class idl.tmt.tasks.
TaskManager
doublec(int)
- Method in class idl.tmt.documentparsing.filters.
Stemmer
Overview
Package
Class
Use
Tree
Deprecated
Index
Help
PREV LETTER
NEXT LETTER
FRAMES
NO FRAMES
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
R
S
T
U
V
W
X
Z
_