Overview
Package
Class
Use
Tree
Deprecated
Index
Help
PREV LETTER
NEXT LETTER
FRAMES
NO FRAMES
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
R
S
T
U
V
W
X
Z
_
G
generateOutput()
- Method in class idl.tmt.util.
RBInputGenerator
GenericFileReader
- class idl.tmt.representation.matrix.
GenericFileReader
.
Reads the contents of a file on disk, one line at a time
GenericFileReader(String)
- Constructor for class idl.tmt.representation.matrix.
GenericFileReader
GenericFilesystemDocumentProvider
- class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProvider
.
This class implements a filesystemDocumentProvide for a generic filesystems.
GenericFilesystemDocumentProvider.DirectoryFilter
- class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProvider.DirectoryFilter
.
GenericFilesystemDocumentProvider.DirectoryFilter(GenericFilesystemDocumentProvider)
- Constructor for class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProvider.DirectoryFilter
GenericFilesystemDocumentProvider.HTMLFileFilter
- class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProvider.HTMLFileFilter
.
GenericFilesystemDocumentProvider.HTMLFileFilter(GenericFilesystemDocumentProvider)
- Constructor for class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProvider.HTMLFileFilter
GenericFilesystemDocumentProvider(File, FilenameFilter)
- Constructor for class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProvider
Creates a new GenericFilesystemDocumentProvider.
GenericFilesystemDocumentProviderTest
- class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProviderTest
.
Created on Feb 26, 2004
GenericFilesystemDocumentProviderTest(String)
- Constructor for class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProviderTest
Constructor for GenericFilesystemDocumentProviderTest.
GenericHardClustering
- class idl.tmt.representation.
GenericHardClustering
.
A generic hard clustering that can be built from a soft clustering.
GenericHardClustering.GenericHardCluster
- class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
.
GenericHardClustering.GenericHardCluster(GenericHardClustering, int[], int, double)
- Constructor for class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
GenericHardClustering(SoftClustering, IndexedCollection)
- Constructor for class idl.tmt.representation.
GenericHardClustering
genFDP
- Variable in class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProviderTest
genUM
- Variable in class idl.tmt.documentsource.webcrawl.
StringReplacementURLMapperTest
getAllowMetaNames()
- Method in class idl.tmt.representation.
MetaTextRepresentationBuilder
Gets a set of the Strings that are allowed for "meta" names in this document representation.
getAllPositiveLogOddsTerms()
- Method in interface idl.tmt.representation.
HardCluster
getAllPositiveLogOddsTerms()
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
Returns an array of all terms in sorted order with a positive log-odds-ratio.
getClasses(int)
- Method in class idl.tmt.classification.
SoftClusteringClassification
Gets the classes that the given document belongs to.
getClasses(int)
- Method in class idl.tmt.classification.
SimpleClassification
getClasses(int)
- Method in interface idl.tmt.classification.
DocumentClassification
getClassification()
- Method in interface idl.tmt.classification.
ClassificationBuilder
getClassification()
- Method in class idl.tmt.classification.
HTMLMetricsClassifier
getClassName(int)
- Method in class idl.tmt.classification.
SoftClusteringClassification
getClassName(int)
- Method in class idl.tmt.classification.
SimpleClassification
getClassName(int)
- Method in interface idl.tmt.classification.
DocumentClassification
getCluster(int)
- Method in class idl.tmt.representation.
DensityBasedClustering
Gets the cluster corresponding to the given cluster index.
getCluster(int)
- Method in interface idl.tmt.representation.
Clustering
getCluster(int)
- Method in class idl.tmt.representation.
GenericHardClustering
Returns the cluster associated with this cluster ID
getClusterer()
- Method in class idl.tmt.tasks.
ClusteringTask
getClusterer(String)
- Method in class idl.tmt.
ToolkitEnvironment
Retrieves a clusterer from the shared repository.
getClustererName()
- Method in class idl.tmt.tasks.
AnalysisTask
getClusterMembership(int)
- Method in interface idl.tmt.representation.
HardClustering
getClusterMembership(int)
- Method in class idl.tmt.representation.
GenericHardClustering
returns the clusterID that this document belongs to
getClusterMembershipProbabilities(int)
- Method in class idl.tmt.representation.
DensityBasedClustering
Gets the probabilities of cluster membership for the given document ID
getClusterMembershipProbabilities(int)
- Method in interface idl.tmt.representation.
SoftClustering
getClusterModelsNumericAtts()
- Method in class idl.tmt.clusterers.
EnhancedEM
Return the normal distributions for the cluster models
getClusterPriors()
- Method in class idl.tmt.clusterers.
EnhancedEM
Return the priors for the clusters
getCollection(String)
- Method in class idl.tmt.
ToolkitEnvironment
Retrieves a collection from the shared repository.
getCollectionName()
- Method in class idl.tmt.tasks.
IndexingTask
getCollectionName()
- Method in class idl.tmt.tasks.
ClusteringTask
getCollectionName()
- Method in class idl.tmt.tasks.
AnalysisTask
getColumn(int)
- Method in class idl.tmt.representation.matrix.
TmtMatrix
get a column from the matrix
getColumnCentered()
- Method in class idl.tmt.representation.matrix.
TmtMatrix
returns column-centered version of the matrix
getColumnNormalized()
- Method in class idl.tmt.representation.matrix.
TmtMatrix
returns column-normalized (unit length) version of the matrix
getColumns(int, int)
- Method in class idl.tmt.representation.matrix.
TmtMatrix
construct a new matrix containing a subset of the columns of this matrix
getConfig()
- Method in class idl.tmt.tasks.
AnalysisTask
getCorrelationMatrix()
- Method in class idl.tmt.representation.matrix.
TmtMatrix
computes and returns the correlation matrix of this matrix
getDebug()
- Method in class idl.tmt.clusterers.
EnhancedEM
Get debug mode
getDefaultIndexName()
- Method in interface idl.tmt.documentsource.webcrawl.
URLMapper
Returns the default name given to remote URLs which do not have a file name component.
getDefaultIndexName()
- Method in class idl.tmt.documentsource.webcrawl.
StringReplacementURLMapper
Returns the default index name -- used for cases where the remote url ends in a slash
getDefaultIndexName()
- Method in class idl.tmt.documentsource.webcrawl.
MultiMapURLMapper
Returns the default index name
getDocFile(int)
- Method in class idl.tmt.representation.
IndexedCollection
getDocID(File)
- Method in class idl.tmt.documentsource.webcrawl.
MultiMapDocumentIDMap
Returns the Document ID referred to by the File of the local document provided.
getDocID(File)
- Method in interface idl.tmt.documentsource.
DocumentIDMapper
getDocTermIDs(int)
- Method in class idl.tmt.representation.
IndexedCollection
getDocTerms(int)
- Method in class idl.tmt.representation.
IndexedCollection
getDocumentClassifications()
- Method in class idl.tmt.representation.
IndexedCollection
getDocumentIDByRank(int)
- Method in interface idl.tmt.representation.
Cluster
getDocumentIDByRank(int)
- Method in class idl.tmt.representation.
DensityBasedClustering.DensityBasedSoftCluster
Returns the document ID of the document with the given rank, in the document ordering by probabilities.
getDocumentIDByRank(int)
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
For now, just returns a document ID, in no particular order.
getDocumentIDMapper()
- Method in class idl.tmt.representation.
IndexedCollection
getDocumentIDMapper()
- Method in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
Returns the DocumentIDMapper for this web crawl
getDocumentIDMapper()
- Method in interface idl.tmt.documentsource.webcrawl.
WebCrawlContext
Retrieves the DocumentIDMapper object associated with this web crawl.
getDocumentIDs()
- Method in interface idl.tmt.representation.
HardCluster
getDocumentIDs()
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
Returns an array of docuemnt ID's in this cluster
getDocumentProvider()
- Method in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
Returns a reference to the FilesystemDocumentProvider
getDocumentProvider()
- Method in interface idl.tmt.documentsource.webcrawl.
WebCrawlContext
Provides the document provider which will traverse the local filesystem and retrieve all the documents mirrored with this web crawl.
getEigenvalueDecomposition(DoubleMatrix2D)
- Method in class idl.tmt.representation.matrix.
MatrixManipulator
getElapsedTimeMillis()
- Method in class idl.tmt.util.
Stopwatch
getElapsedTimeSecs()
- Method in class idl.tmt.util.
Stopwatch
getFilter()
- Method in class idl.tmt.documentparsing.
TextDocumentParser
getFilter(String, Map)
- Method in class idl.tmt.documentparsing.filters.
FilterFactory
Creates a new Filter object based on the name and parameters.
getFilters()
- Method in class idl.tmt.tasks.
IndexingTask
getGlobalTermCounts(TmtMatrix)
- Method in class idl.tmt.representation.transformations.
TermOccurrenceFilter
getHardClusterForDoc(int)
- Method in class idl.tmt.representation.
TmtClustering
getIDFromValue(Object)
- Method in class idl.tmt.util.
IDMap
getIndexThreshold()
- Method in class idl.tmt.classification.
HTMLMetricsClassifier
getInitializer()
- Method in class idl.tmt.clusterers.
EnhancedEM
getInputDirName()
- Method in class idl.tmt.tasks.
IndexingTask
getInputFileName()
- Method in class idl.tmt.tasks.
IndexingTask
getInstance()
- Static method in class idl.tmt.
ToolkitEnvironment
getInstance()
- Static method in class idl.tmt.documentparsing.filters.
FilterFactory
getInstanceCount()
- Method in class idl.tmt.training.
RandomSelector
getLeftSingularVectors()
- Method in class idl.tmt.representation.matrix.
TmtMatrix
returns a TmtMatrix containing the left singular vectors of this matrix
getLocalDoc(int)
- Method in class idl.tmt.documentsource.webcrawl.
MultiMapDocumentIDMap
Returns the local URL of the document corresponding to the passed in Document ID.
getLocalDoc(int)
- Method in interface idl.tmt.documentsource.
DocumentIDMapper
getLocalFileIterator()
- Method in class idl.tmt.documentsource.webcrawl.
MultiMapURLMapper
getLogFile()
- Method in class idl.tmt.
ToolkitEnvironment
getLogFile()
- Method in class idl.tmt.tasks.
TaskManager
getLogLikely()
- Method in class idl.tmt.clusterers.
EnhancedEM
getLogOddsForTerm(int, int)
- Method in class idl.tmt.util.
LogOddsUtil
getMatrix()
- Method in class idl.tmt.representation.
IndexedCollection
getMatrix()
- Method in class idl.tmt.representation.matrix.
TmtMatrix
getMaxClusterersToBuild()
- Method in class idl.tmt.clusterers.
EnhancedEM
Get the maximum number of clusterers to build when seeking the best one.
getMaxIterations()
- Method in class idl.tmt.clusterers.
EnhancedEM
Get the maximum number of iterations
getMean()
- Method in class idl.tmt.representation.matrix.
VectorNormalizer
getMinLength()
- Method in class idl.tmt.documentparsing.filters.
LengthFilter
getMinOccurrences()
- Method in class idl.tmt.representation.transformations.
TermOccurrenceFilter
getMinStdDev()
- Method in class idl.tmt.clusterers.
EnhancedEM
Get the minimum allowable standard deviation.
getName()
- Method in class idl.tmt.tasks.
ClusteringTask
getName()
- Method in class idl.tmt.tasks.
AnalysisTask
getName()
- Method in class idl.tmt.classification.
SoftClusteringClassification
getName()
- Method in class idl.tmt.classification.
SimpleClassification
getName()
- Method in interface idl.tmt.classification.
DocumentClassification
getNextDocument()
- Method in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
Returns the next document URL
getNextDocument()
- Method in interface idl.tmt.documentsource.
DocumentProvider
Returns a URL of the next available document.
getNextDocument()
- Method in class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProvider
Returns the next document in our queue
getNorm()
- Method in class idl.tmt.representation.matrix.
VectorNormalizer
getNumClasses()
- Method in class idl.tmt.classification.
SoftClusteringClassification
getNumClasses()
- Method in class idl.tmt.classification.
SimpleClassification
getNumClasses()
- Method in interface idl.tmt.classification.
DocumentClassification
getNumClusters()
- Method in class idl.tmt.representation.
TmtClustering
getNumClusters()
- Method in class idl.tmt.representation.
DensityBasedClustering
gets the number of clusters in this clustering.
getNumClusters()
- Method in interface idl.tmt.representation.
Clustering
getNumClusters()
- Method in class idl.tmt.representation.
GenericHardClustering
returns the number of clusters in this clustering
getNumClusters()
- Method in class idl.tmt.clusterers.
EnhancedEM
Get the number of clusters
getNumDocs()
- Method in class idl.tmt.representation.
IndexedCollection
getNumDocsWithTerm(int, int)
- Method in class idl.tmt.util.
TermFrequencyUtil
getNumDocuments()
- Method in interface idl.tmt.representation.
HardCluster
getNumDocuments()
- Method in class idl.tmt.representation.
DensityBasedClustering
Returns the number of documents in this clustering
getNumDocuments()
- Method in interface idl.tmt.representation.
Clustering
getNumDocuments()
- Method in class idl.tmt.representation.
GenericHardClustering
Returns the number of documents in this clustering
getNumDocuments()
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
Returns the number of docs in this cluster
getNumIterations()
- Method in class idl.tmt.clusterers.
BestKMeansEMInitializer
getNumTerms()
- Method in class idl.tmt.representation.
IndexedCollection
getOptions()
- Method in class idl.tmt.clusterers.
EnhancedEM
Gets the current settings of EM.
getOutputDir()
- Method in class idl.tmt.
ToolkitEnvironment
getOutputDir()
- Method in class idl.tmt.tasks.
TaskManager
getOutputFile()
- Method in class idl.tmt.util.
RBInputGenerator
getPrior()
- Method in interface idl.tmt.representation.
Cluster
getPrior()
- Method in class idl.tmt.representation.
DensityBasedClustering.DensityBasedSoftCluster
getPrior()
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
getPrior(int)
- Method in class idl.tmt.representation.
TmtClustering
getProbabilityForDocID(int)
- Method in interface idl.tmt.representation.
SoftCluster
getProbabilityForDocID(int)
- Method in class idl.tmt.representation.
DensityBasedClustering.DensityBasedSoftCluster
Returns the probability of membership to this cluster for the given doc id.
getProbabilityThreshold()
- Method in class idl.tmt.classification.
SoftClusteringClassification
getRegex()
- Method in class idl.tmt.training.
RegexSelector
getRemoteCrawlRoots()
- Method in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
This method is unsupported for this implementation, and returns null.
getRemoteCrawlRoots()
- Method in interface idl.tmt.documentsource.webcrawl.
WebCrawlContext
Retrieves the URLs used as the remote root of the crawl.
getRemoteURL(File)
- Method in class idl.tmt.representation.
IndexedCollection
getRepresentation()
- Method in class idl.tmt.representation.
BagOfWordsRepresentationBuilder
getRepresentation()
- Method in interface idl.tmt.representation.
RepresentationBuilder
getResultBuffer()
- Method in class idl.tmt.documentparsing.filters.
Stemmer
Returns a reference to a character buffer containing the results of the stemming process.
getResultLength()
- Method in class idl.tmt.documentparsing.filters.
Stemmer
Returns the length of the word resulting from the stemming process.
getRightSingularVectors()
- Method in class idl.tmt.representation.matrix.
TmtMatrix
returns a TmtMatrix containing the right singular vectors of this matrix
getRoot()
- Method in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
Returns the local root directory where the mirrored documents are located.
getRoot()
- Method in interface idl.tmt.documentsource.filesystem.
FilesystemDocumentProvider
Retrieves the root directory of the filesystem where documents are returned from.
getRoot()
- Method in class idl.tmt.documentsource.filesystem.
GenericFilesystemDocumentProvider
Returns the root directory of this document provider
getRow(int)
- Method in class idl.tmt.representation.matrix.
TmtMatrix
get a row from the matrix
getRows(int, int)
- Method in class idl.tmt.representation.matrix.
TmtMatrix
construct a new matrix containing a subset of the columns of this matrix
getSeed()
- Method in class idl.tmt.training.
RandomSelector
getSeed()
- Method in class idl.tmt.clusterers.
EnhancedEM
Get the random number seed
getSingularValueDecomposition(DoubleMatrix2D)
- Method in class idl.tmt.representation.matrix.
MatrixManipulator
getSingularValues()
- Method in class idl.tmt.representation.matrix.
TmtMatrix
returns a diagonal TmtMatrix containing the singular values of this matrix
getSortedOrdering(double[], boolean)
- Static method in class idl.tmt.util.
ArrayUtils
getStatus()
- Method in class idl.tmt.training.
RegexSelector
getStatus()
- Method in class idl.tmt.training.
RandomSelector
getStatus()
- Method in interface idl.tmt.training.
TrainingSetConstructor
getStatus()
- Method in class idl.tmt.training.
FullCollectionSelector
getStatus()
- Method in class idl.tmt.representation.transformations.
MatrixRowNormalizer
getStatus()
- Method in class idl.tmt.representation.transformations.
StopListFilter
getStatus()
- Method in class idl.tmt.representation.transformations.
TermOccurrenceFilter
getStatus()
- Method in interface idl.tmt.representation.transformations.
CollectionTransformation
Provides a short string description of the collection transformation.
getStatus()
- Method in class idl.tmt.representation.transformations.
BinarizeAllTermsTransformation
getStatus()
- Method in class idl.tmt.representation.transformations.
TfIdfWeighter
getStatus()
- Method in class idl.tmt.representation.transformations.
MatrixColumnCenterer
getStopWordFile()
- Method in class idl.tmt.documentparsing.filters.
StopWordFilter
getStopWordsFile()
- Method in class idl.tmt.representation.transformations.
StopListFilter
getTableThreshold()
- Method in class idl.tmt.classification.
HTMLMetricsClassifier
getTerm(int)
- Method in class idl.tmt.representation.
TermList
Gets the term corresponding to this term ID.
getTerm(int)
- Method in class idl.tmt.representation.
IndexedCollection
getTermDocFrequency(int)
- Method in interface idl.tmt.representation.
HardCluster
getTermDocFrequency(int)
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
Returns the number of documents in this cluster that contain the given term
getTermDocs(int)
- Method in class idl.tmt.representation.
IndexedCollection
getTermID(String)
- Method in class idl.tmt.representation.
TermList
Gets the integer term ID corresponding to this term.
getTermIDByDocFreqency(int)
- Method in interface idl.tmt.representation.
HardCluster
getTermIDByDocFreqency(int)
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
Returns the termID with the given rank in a sorting by the number of documents the term occurs in.
getTermIDByLORRank(int)
- Method in interface idl.tmt.representation.
HardCluster
getTermIDByLORRank(int)
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
Returns the term ID for this cluster with the given rank.
getTermIDByRank(int, int)
- Method in class idl.tmt.util.
LogOddsUtil
getTermIDByRank(int, int)
- Method in class idl.tmt.util.
TermFrequencyUtil
getTermList()
- Method in class idl.tmt.representation.
BagOfWordsRepresentationBuilder
getTermList()
- Method in class idl.tmt.representation.
IndexedCollection
getTermList()
- Method in interface idl.tmt.representation.
RepresentationBuilder
getTermLogOddsRatio(int)
- Method in interface idl.tmt.representation.
HardCluster
getTermLogOddsRatio(int)
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
Returns the log-odds-ratio for the given term in this cluster
getTopNDocumentIDs(int)
- Method in interface idl.tmt.representation.
SoftCluster
getTopNDocumentIDs(int)
- Method in class idl.tmt.representation.
DensityBasedClustering.DensityBasedSoftCluster
Returns an array of n document ID's, sorted decending by their probability of membership to this cluster.
getTopNFrequentTerms(int)
- Method in interface idl.tmt.representation.
HardCluster
getTopNFrequentTerms(int)
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
Returns the top N termID's, ordered by frequency
getTopNLogOddsTerms(int)
- Method in interface idl.tmt.representation.
HardCluster
getTopNLogOddsTerms(int)
- Method in class idl.tmt.representation.
GenericHardClustering.GenericHardCluster
Returns the top n term IDs, in sorted order, for this cluster
getTopNterms(int, int)
- Method in class idl.tmt.util.
LogOddsUtil
getTopNterms(int, int)
- Method in class idl.tmt.util.
TermFrequencyUtil
getTranspose(double[][])
- Static method in class idl.tmt.util.
ArrayUtils
getType()
- Method in class idl.tmt.tasks.
AnalysisTask
getURL()
- Method in class idl.tmt.documentsource.webcrawl.
MultiMapURLMapper.URLWrapper
getURLMapper()
- Method in class idl.tmt.representation.
IndexedCollection
getURLMapper()
- Method in class idl.tmt.documentsource.webcrawl.
WgetWebCrawlContext
returns the URL Mapper object
getURLMapper()
- Method in interface idl.tmt.documentsource.webcrawl.
WebCrawlContext
Retrieves the URLMapper object associated with this web crawl.
getUseClustering()
- Method in class idl.tmt.classification.
SoftClusteringClassification
getUseCollection()
- Method in class idl.tmt.util.
RBInputGenerator
getValue()
- Method in class idl.tmt.representation.transformations.
BinarizeAllTermsTransformation
getValue(int, int)
- Method in class idl.tmt.representation.matrix.
TmtMatrix
getValueFromID(int)
- Method in class idl.tmt.util.
IDMap
getWeight()
- Method in class idl.tmt.representation.
BagOfWordsRepresentationBuilder
getWeight()
- Method in interface idl.tmt.representation.
RepresentationBuilder
globalInfo()
- Method in class idl.tmt.clusterers.
EnhancedEM
Returns a string describing this clusterer
goodWordSet
- Variable in class idl.tmt.documentparsing.filters.
WordNetFilter
Overview
Package
Class
Use
Tree
Deprecated
Index
Help
PREV LETTER
NEXT LETTER
FRAMES
NO FRAMES
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
R
S
T
U
V
W
X
Z
_