Methods in cc.mallet.extract with parameters of type Tokenization |
LabeledSpans |
TokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
java.lang.Object document,
Label backgroundTag,
Tokenization input,
Sequence seq)
Converts a the sequence of labels into a set of labeled spans. |
LabeledSpans |
HierarchicalTokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
java.lang.Object document,
Label backgroundTag,
Tokenization input,
Sequence seq)
|
LabeledSpans |
DefaultTokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
java.lang.Object document,
Label backgroundTag,
Tokenization input,
Sequence seq)
|
LabeledSpans |
ConfidenceTokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
java.lang.Object document,
Label backgroundTag,
Tokenization input,
Sequence seq)
|
LabeledSpans |
BIOTokenizationFilter.constructLabeledSpans(LabelAlphabet dict,
java.lang.Object document,
Label backgroundTag,
Tokenization input,
Sequence seq)
|
protected Span |
BIOTokenizationFilterWithTokenIndices.createSpan(Tokenization input,
int startTokenIdx,
int endTokenIdx)
|
protected Span |
BIOTokenizationFilter.createSpan(Tokenization input,
int startTokenIdx,
int endTokenIdx)
|
Extraction |
Extractor.extract(Tokenization toks)
Performs extraction from an object that has been
already been tokenized. |
Extraction |
CRFExtractor.extract(Tokenization spans)
|
Constructors in cc.mallet.extract with parameters of type Tokenization |
DocumentExtraction(java.lang.String name,
LabelAlphabet dict,
Tokenization input,
LabeledSpans predictedSpans,
LabeledSpans trueSpans,
java.lang.String background)
|
DocumentExtraction(java.lang.String name,
LabelAlphabet dict,
Tokenization input,
Sequence predicted,
Sequence target,
java.lang.String background)
|
DocumentExtraction(java.lang.String name,
LabelAlphabet dict,
Tokenization input,
Sequence predicted,
Sequence target,
java.lang.String background,
TokenizationFilter filter)
|
DocumentExtraction(java.lang.String name,
LabelAlphabet dict,
Tokenization input,
Sequence predicted,
java.lang.String background)
|
Extraction(Extractor extractor,
LabelAlphabet dict,
java.lang.String name,
Tokenization input,
Sequence output,
java.lang.String background)
Creates an extration given a sequence output by some kind of per-sequece labeler, like an
HMM or a CRF. |