Uses of Interface
cc.mallet.extract.Tokenization

Packages that use Tokenization
cc.mallet.extract Unimplemented. 
cc.mallet.grmm.learning.extract   
 

Uses of Tokenization in cc.mallet.extract
 

Classes in cc.mallet.extract that implement Tokenization
 class StringTokenization
           
 

Methods in cc.mallet.extract that return Tokenization
 Tokenization DocumentExtraction.getInput()
           
 

Methods in cc.mallet.extract with parameters of type Tokenization
 LabeledSpans TokenizationFilter.constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
          Converts a the sequence of labels into a set of labeled spans.
 LabeledSpans HierarchicalTokenizationFilter.constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
           
 LabeledSpans DefaultTokenizationFilter.constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
           
 LabeledSpans ConfidenceTokenizationFilter.constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
           
 LabeledSpans BIOTokenizationFilter.constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
           
protected  Span BIOTokenizationFilterWithTokenIndices.createSpan(Tokenization input, int startTokenIdx, int endTokenIdx)
           
protected  Span BIOTokenizationFilter.createSpan(Tokenization input, int startTokenIdx, int endTokenIdx)
           
 Extraction Extractor.extract(Tokenization toks)
          Performs extraction from an object that has been already been tokenized.
 Extraction CRFExtractor.extract(Tokenization spans)
           
 

Constructors in cc.mallet.extract with parameters of type Tokenization
DocumentExtraction(java.lang.String name, LabelAlphabet dict, Tokenization input, LabeledSpans predictedSpans, LabeledSpans trueSpans, java.lang.String background)
           
DocumentExtraction(java.lang.String name, LabelAlphabet dict, Tokenization input, Sequence predicted, Sequence target, java.lang.String background)
           
DocumentExtraction(java.lang.String name, LabelAlphabet dict, Tokenization input, Sequence predicted, Sequence target, java.lang.String background, TokenizationFilter filter)
           
DocumentExtraction(java.lang.String name, LabelAlphabet dict, Tokenization input, Sequence predicted, java.lang.String background)
           
Extraction(Extractor extractor, LabelAlphabet dict, java.lang.String name, Tokenization input, Sequence output, java.lang.String background)
          Creates an extration given a sequence output by some kind of per-sequece labeler, like an HMM or a CRF.
 

Uses of Tokenization in cc.mallet.grmm.learning.extract
 

Methods in cc.mallet.grmm.learning.extract with parameters of type Tokenization
 Extraction ACRFExtractor.extract(Tokenization toks)