A B C D E F G H I J K L M N O P R S T U V W X Z _

D

data - Variable in class idl.tmt.representation.TmtClustering
 
debug - Variable in class idl.tmt.representation.BagOfWordsRepresentationBuilder
 
DEFAULT_ENTITY_REGEX - Static variable in class idl.tmt.documentparsing.filters.HtmlTagsFilter
 
DEFAULT_HTML_TAG_REGEX - Static variable in class idl.tmt.documentparsing.filters.HtmlTagsFilter
 
DEFAULT_INDEX - Static variable in class idl.tmt.documentsource.webcrawl.WgetWebCrawlContext
 
DEFAULT_INDEX - Static variable in class idl.tmt.documentsource.webcrawl.StringReplacementURLMapperTest
 
DEFAULT_INDEX_NAME - Static variable in class idl.tmt.documentsource.webcrawl.StringReplacementURLMapper
 
DEFAULT_REGEX - Static variable in class idl.tmt.documentparsing.filters.WordFilter
 
DEFAULT_SCRIPT_REGEX - Static variable in class idl.tmt.documentparsing.filters.HtmlTagsFilter
 
defaultIndex - Variable in class idl.tmt.documentsource.webcrawl.MultiMapURLMapper
 
defaultIndexStr - Variable in class idl.tmt.documentsource.webcrawl.StringReplacementURLMapper
 
DEFUALT_URL_REGEX - Static variable in class idl.tmt.documentparsing.filters.HtmlTagsFilter
 
delegateToTextParser - Variable in class idl.tmt.documentparsing.HTMLDocumentParser
 
deleteDir(File) - Method in class idl.tmt.documentsource.filesystem.GenericFilesystemDocumentProviderTest
recursively empties a directory, then deletes that directory.
deleteTermsAfterIndex(int) - Method in class idl.tmt.representation.TermList
Deletes the terms in the termlist with indexes greater than the specified index.
DELIMITER - Static variable in interface idl.tmt.documentparsing.WordParsingListener
Property value for setting the delimiter to identify the end of one word and the beginning of another
delimiters - Variable in class idl.tmt.documentparsing.TextDocumentParser
 
DELIMITERS_PROP - Static variable in class idl.tmt.documentparsing.TextDocumentParser
Property key for setting the delimiters to use for parsing.
DenseMatrixReader - class idl.tmt.representation.matrix.DenseMatrixReader.
Class for reading dense matrices from disk.
DenseMatrixReader() - Constructor for class idl.tmt.representation.matrix.DenseMatrixReader
 
DensityBasedClustering - class idl.tmt.representation.DensityBasedClustering.
Created on Apr 14, 2004
DensityBasedClustering.DensityBasedSoftCluster - class idl.tmt.representation.DensityBasedClustering.DensityBasedSoftCluster.
 
DensityBasedClustering.DensityBasedSoftCluster(DensityBasedClustering, double[], double) - Constructor for class idl.tmt.representation.DensityBasedClustering.DensityBasedSoftCluster
 
DensityBasedClustering(DensityBasedClusterer, IndexedCollection) - Constructor for class idl.tmt.representation.DensityBasedClustering
 
df - Variable in class idl.tmt.util.HTMLOutputUtil
 
digestConfiguration(String) - Static method in class idl.tmt.ConfigurationDigester
 
dimension - Variable in class idl.tmt.representation.matrix.VectorNormalizer
 
docCount - Variable in class idl.tmt.documentsource.webcrawl.WgetWebCrawlContext
 
docIDMap - Variable in class idl.tmt.documentsource.webcrawl.WgetWebCrawlContext
 
docIDMap - Variable in class idl.tmt.documentparsing.HTMLDocumentParser
 
docIDs - Variable in class idl.tmt.representation.GenericHardClustering.GenericHardCluster
 
docList - Variable in class idl.tmt.representation.IndexedCollection
 
docsClusters - Variable in class idl.tmt.representation.DensityBasedClustering
 
DOCUMENT_ID_UNKNOWN - Static variable in class idl.tmt.documentsource.webcrawl.MultiMapDocumentIDMap
 
DocumentClassification - interface idl.tmt.classification.DocumentClassification.
Created on Jun 21, 2004
documentClassifications - Variable in class idl.tmt.representation.IndexedCollection
 
documentCollectionComplete() - Method in class idl.tmt.representation.BodyTextRepresentationBuilder
Indicates the completion of the document collection, and builds the matrix representation.
documentCollectionComplete() - Method in class idl.tmt.representation.LinkTextRepresentationBuilder
Indicates that parsing of the collection is done, and this object builds the representation matrix.
documentCollectionComplete() - Method in class idl.tmt.representation.TitleTextRepresentationBuilder
Builds the document representation matrix
documentCollectionComplete() - Method in class idl.tmt.representation.MetaTextRepresentationBuilder
Builds the document representation matrix
documentCollectionComplete() - Method in class idl.tmt.representation.H1TextRepresentationBuilder
Builds the document representation matrix
documentCollectionComplete() - Method in class idl.tmt.classification.HTMLMetricsClassifier
 
documentCollectionComplete() - Method in interface idl.tmt.documentparsing.ParsingListener
Indicates that the parsing of the entire collection of documents is complete.
documentCollectionComplete() - Method in class idl.tmt.util.TitleSaver
 
documentComplete() - Method in class idl.tmt.representation.BodyTextRepresentationBuilder
Indicates the completion of a document.
documentComplete() - Method in class idl.tmt.representation.LinkTextRepresentationBuilder
Ignored in this representation builder because we're just interested in the documents linked to, not the current document.
documentComplete() - Method in class idl.tmt.representation.TitleTextRepresentationBuilder
 
documentComplete() - Method in class idl.tmt.representation.MetaTextRepresentationBuilder
 
documentComplete() - Method in class idl.tmt.representation.H1TextRepresentationBuilder
 
documentComplete() - Method in class idl.tmt.classification.HTMLMetricsClassifier
 
documentComplete() - Method in interface idl.tmt.documentparsing.ParsingListener
Indicates that the parsing of the current document has completed.
documentComplete() - Method in class idl.tmt.util.TitleSaver
 
documentCount() - Method in class idl.tmt.documentsource.webcrawl.WgetWebCrawlContext
Returns a count of documents
documentCount() - Method in interface idl.tmt.documentsource.filesystem.FilesystemDocumentProvider
Retrieves an estimate the total number of documents this provider is able to produce.
documentCount() - Method in class idl.tmt.documentsource.filesystem.GenericFilesystemDocumentProvider
Returns a count of the documents left in this provier
DocumentIDMapper - interface idl.tmt.documentsource.DocumentIDMapper.
Provides the mapping from the Document to the DocumentID Created on Mar 18, 2004
documentIndex - Variable in class idl.tmt.representation.IndexedCollection
 
DocumentIndexWriter - class idl.tmt.util.DocumentIndexWriter.
Builds an inverted index with format
termid docid count
per line, writing to a file on disk.
DocumentIndexWriter(String, TmtMatrix) - Constructor for class idl.tmt.util.DocumentIndexWriter
 
DocumentParser - interface idl.tmt.documentparsing.DocumentParser.
Interface which provides parsing capabilities.
DocumentProvider - interface idl.tmt.documentsource.DocumentProvider.
This interface defines an entry point into a source of documents.
DocumentProviderException - exception idl.tmt.documentsource.DocumentProviderException.
Generic Exception indicate Created on Feb 25, 2004
DocumentProviderException() - Constructor for class idl.tmt.documentsource.DocumentProviderException
Constructs a new DocumentProviderException
DocumentProviderException(String) - Constructor for class idl.tmt.documentsource.DocumentProviderException
Constructs a new DocumentProviderException with the specified message
DocumentProviderException(String, Throwable) - Constructor for class idl.tmt.documentsource.DocumentProviderException
 
DocumentProviderException(Throwable) - Constructor for class idl.tmt.documentsource.DocumentProviderException
 
doEM() - Method in class idl.tmt.clusterers.EnhancedEM
Perform the EM algorithm
doTasks(LinkedList) - Method in class idl.tmt.tasks.TaskManager
 
doublec(int) - Method in class idl.tmt.documentparsing.filters.Stemmer
 

A B C D E F G H I J K L M N O P R S T U V W X Z _